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(57) Abstract: The present invention provides 
amino acid sequences of proteins that are encoded by 
genes within the Drosophila melanogdster genome, 
the GPCR proteins of the present invention. The 
present invention specifically provides isolated 
protein and nucleic acid molecules, methods of 
identifying orthologs and paralogs of the GPCR 
proteins and methods of identifying modulators of 
the GPCR proteins for use as insecticides. 
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A5AACC7TT CTTTT CT AAAT3C ATTTCCCCCAT ACT ATCCCACCA? AAAAAATATAAC ACGAACA TTTT AAATAATTGAAATC ASTATCC ATTTT AAC AAAATTAACTT AT A 
AAAAAAAATATTireCTATCATWAATAAGTCT^CCACTGGlVrTCAAAATCTT^^ 

CCAAf.TCTCP3CrGCCCGTGCTCGTCGCCATCTCTCCACT ATT ACT CCCGGCCGnCGGCCGCCn: ACT AAACTGCCCAGCAACTTGTTCACCACCCTCCCAATCCCAAAOCT 

3ACATC0?GAT11^C*AGCAC^CAC«AAGWGACCACAAA3C^ 

CtWT*GSCCA01*CC*>CTCTTTX*CTCCCAACAGCCCrr«^ 

TTCCC ACGATCACTCGCAGT ACGTTGATCAGAAAAAC TGTTGSGTAAAATGC ATAT AGCTGAGC AAT A TTGT AC TTTT AAAGAAAATT AT5CAT ATTAC7 AAAGCTCGCCAG 
CMCCAAAGACTTATGCCCACCt»CM0ATCWaMU7T»5A0ACTCT 

CtCTT AAT AAATCCAATCC AAAAACCAAATCAATCAGC ATTATGTC TTGTCTAAAATfCTCAAAATAACTTACT GC TT A TT CTCCOGACT ACTCAAATCCCTOGCCAGMXC 
TAGACGATGGCTATQGaTATSaQTGAAAACCAGCTGATGACAATAAACCAGCGCATCACAATCGTCT CCTTCAC AAAGACCTCCCGA* TCCAAASAACAACCCGCGTCACC A 
TrrOAACCW^CATrCCTt^CATCTCAAC^TCACOT 
CTCCACCAAATCACATCTOXXTGCACACAA^GCCW 

AGGTGt AGS TX3C^GGCCCT - CGCAAAACA7CCAGAAATACTTCACCAWCATAAAGTAt;TQCACCACCACATCC AGCCC AATGC ACCAC ACCTCAAAGn AGTTATCTTTAJ CC 
TTTTCTAT AATTGCCTAGC T ACTTGT ACCCCTATGTTATTTTC A rAG7TACTTTTCTCACTCCATCTT ACCGGATTTTCGGCTATOOTT TCCCAAC BCTCC ACCACCAGACC 
ATACCATAATATCCAGT^ACJOIGCTI^JU^XACCCA^C 
At^AJCGAAAAJG^TATTICira^TCAiyiTAgCG^ 

CACTT ACCTCC AGGTCC TC ATACTCC ACCCACTTGCT GT AATTCCACC ACCTCrCArTGCT AAKGCATCMKTACCA<^GCCGTrKTCUCCCAGCT 
T SCTT ACCAT AATTGTGAC TCSCTAAlTrCCC^ACTCATCCGTXAAGOTCCTftTTTCTAACAAAKXCCT^ 
TCACTrCCCCTCTAAAGW«TE»ACJKTATOT 

ACTTCAAGAASSTTTTATT TT-TCTTTC ATTTTEC ATCATCTTTTTCTGAGATATGCTGCGACTTTATttTAATAGACACCAATGTAAACTCTATAGGATT TTCCCTGTTTGT A 

QCSCACTCAT AC AT AAAC A TT0TATT7 TATGA3ACAC AGUCACATT AAAGCAAATGCC9AAT ATGIXAACAT ATTTTCAAATTT ATI^TC AAACSAAT ATTAU 

GAGC?TATTTGACCATTCGAAATCCTCAA<KMTCAAAATCtATCCTACTOTG 

TTCTGTTGAArCCTTCCAC<UAATCGqSACAArATTGACTCAGCA2AI^ 

CTCCrXCTCAT7T1JT«K«TC<;rC«CACTGACGAWKnGOTCA 

<XATTCTCCAWtATC&C«^crttSXTAAAimCrCCC^^ 

t atagc a7tttt aat aat act atctat6coctoc ataactcaaactgatgt aattcacttttc aagttcaattcaattgcata: ttcc tt t ac atatctaaaat aaactgct 
a ttfc aaatctftcat aac7tgcttaac at actt a ataga tc atcgttct tt at agcagggt ataattca7 aaa ttctcgttc aaaatat atgtacatttcc t aaaggtcttg 
agtcctattt aaat7tat aaatttctcc7c agatctxaatc a vttgca7 tcaaacgaaaa£cc atgtttaaa7ttccatc cccaacaactaagtcaatcattttgtatata t 
ttaaatacatt actttcaaatgc attaaatg aaatcaga70ggctgtaa ttt aatttaatt c tc t tc t c tttcttatctctcttatqcctctcct at^tttcgcggcjjfca'f 
caccwtacccctcjtmccr^actcacctotktccotaag 
tcctttaaaaatat«ccacaitatcct«:gtttto^tk«tccmcatccac^ 

taact ctt ctcaacgcctaarccact aaaaaaaaatataai aat at aaatat aattcaaacaatt ac ggcataaataggataatcaactqcaataatttatcacagvcaatt 

Tl^AAGCrnTT\AACAACTCrr&rrrACCCAT.\G\VAC7TC7ATCJ»^ 

TTTCGCCA TAGCGAAGC G AACTTGAGTCC CTTGTGGCCGTCC ATTCGTC TXiTSCATTTCTAATTTBiTTA BGCTTCAAAG AGTT CCCT GAGTTCGGTTCATATTTG rTCIGC 
OGTCC TG7CTGCGGOG AAATGGTGCC AAAC 3AAATG0AAT ATCCACACSGGGCACACAC ACAC ACTCAAAAT A rCGAAGCAC CTCAGTCTCCTGniACCCnACTCACCTACr 
CCTCTCTCATATGTTGTAAATCACTTTATTCTCTAATTAAT^^CCTCC™ 

GAGACAAAOC AAATGC C ACOWTTAAAAAC TCAAATCAAATCC AC TTTGATCGTGGGGGCACACAAACACACAAAC AC AT AAAC ACACTCACACACCCGC ACCCAACACCTG 
QCCCAMACGCGGSATGAGC TGC AAAGTTCTTGCC T AACTTGOTT ATATTTTTAATCAAAACT CATTTC ACTTGATTaAC TT TCO ACAG AACCC TCCACC AGGC ACTGGAAA 
AlCTAGCAT«AC»T*CATACWACTAAC*?CATTTTreCCeT 

ATGACGCCTGCCT AAACATTCTT CCATGTT ATATATATT TTI GCAftGTCTCTCGGCCAGGC ** TTAGCCCC ACCTAACCCCACC 
(SGO ID BO: 1) 

Hem 4236..40G2 

Eaoni U30..12JQ 

CasBi 2«H..IS8« 

Eiom 2518.. 2427 

Exon. 2323.. lt?9 

Excn: nio..iooi 

Start ATO; I23S (H«v*rs* *tr«nd: CAT) 
Transcript «o. : CM121 

ATGGCCACAACCTCTTCA6 A7TC5GA5TO5C AAAATGI5GATG TGGC ATCGC AGGCCC AAACO C AGGAT aatcTGCGC ATAT TTTT AAAGC ATTTCTBTGCGCAGTSCCTCT 
TCCGATACC Aty^TCTCACAT ACCAT ACCSAC&ACCCATCAri WCAC TTCCCCC AGC AAC AG ACTACCATTCC CATTTCCCGSCAACn? ACCCCAGTCCCGCGATATCT 
GCAGAATeCCGCGATGAACGAGSCeCTTATTGACATGCGAAACGTCGJlCCMC^ 

C AGAAQGAC AACCCACrCTtTTCCCCCCTGAAC TT-7CACGGAT A7CTCTCCTGC-CC AAGWC TCCG GCCGGC AC T GTGCTGA GTCAATATTGTC CCGAT TTCCrCGAAGGAT 
T C AACACAAAATTTCTGCCCC AC AAGACCTCCCTGGAGAACCCCTCCTCCTACCGTC ATCCC GTT ACCAATCAGACCT GGTC CAATT ACAC C AACTGCGT9GACTArQACG A 
CCTGCACTTTCCCCAGTTCATC AACCAGCTCT ACGTGAAGCOTT ATGCCCTCTCCCTTCTI^CCTGerCATATCGAT T ATT A TTTTCTTCCCCTTCAAATCCCTCCCCTCC 
ACCCSCArCCCCATCCATCTAC ATCT CTTCCCCTCOCTGGCCTOC ACC rCTCTGGC CTCGATAT7 ATGGTATCOT CTOGTGC TOO AGCGTTCCG AAACC AT AGCCGAAAA7C 
CGGTAAGATGCACTGA 5AAAAGT AACTATGAAAAT AAC AT AGC CC T AC AAG7 ACC T AGGC AAT7 AT ACAAAACC ATAAACAT AJ^TAAC TTTCAGC TGTGGTGC ATTGGGCT 
GC ATCTGGTG07GC ACTACrf TAT G T T GS T C AACT ATTTCTGCA TGT TTTCCGAGttGCCTCCACCTCC ACCTGCT GCTCCTT CTCCTCACTTATCCTT ACTCCTVTCCCAAT 
CTOOCAATTCC CTGCACAAG7C TOCAT TCTTTATGCGCGACTTCTCTGCATCCCACATCTOATTTC CTGCACCCGC TGAC AT GfiC AATCGTTC ACCCCC AGC A A TTTGTCTC 
AGACC AAA7CGC AACGTTCGCC AACTCCTTT AC GTCTCC GTCAAGTTGAGATCTC AGGAATGTCCCTT CAGATC C7CACGCC CCTTCTTCT TTCCATTCGCC AGGTCTTTGT 
C AAGGACACGATTGTG ArOCGCTGGTTIATTCTCATCAGCTCGTTTrC ACtCATACC C AT AOCC ATCGTCTAC G TTTTTCTC A7C AA CCT ACTCCCACTCATCCTCCGAAAC 
TTACAiaxCACrcCGCL^U:CC«»ttGTT&crcATCC^^ 

CCCATGCCGGC ACC CAGCTGGACC ACTTCT ATC ACATCCT ATCCGTSC rCTTGSTC AGCCTGC AGCCCrrTCTCGTCTCCTTCCTCTTCTGCTT CGCC AATC ACGATCTC A c 
CTTTGCCATTC GC A CGCTGCTGAAC AACTTGCTGCC C AGTTTA GTGACCCCCCCG C CGCCCGGGAGT AAT ACTGCACAGArf; GCC ACGACC ACCCCC ACCCGAGAG TTGG CC 
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ISOLATED G-PROTEIN COUPLED RECEPTORS, NUCLEIC ACID MOLECULES 
ENCODING GPCR PROTEINS, AND USES THEREOF AS INSECTICIDAL TARGETS 

RELATED APPLICATIONS 

The present application claims priority to U.S. Serial No. 60/191,638, filed March 23, 
2000 (Atty. Docket CL000388) and U.S. Serial No. 09/618,893, filed July 18, 2000 (Atty. 
Docket CL000733). 

FIELD OF THE INVENTION 

The present invention is in the field of G-Protein coupled receptors (GPCRs), 
recombinant DNA molecules and protein production. The present invention specifically 
provides novel GPCR proteins and nucleic acid molecules encoding such protein molecules, for 
use in the development of insecticide and insecticide targets and as a source for identifying 
human therapeutics and human therapeutic development. 

BACKGROUND OF THE INVENTION 

The Drosophila melanogaster genome is 165 Mb, with about 120 Mb of this being 
euchromatic. The genome is organized in 4 chromosome pairs and is estimated to contain 10 - 
12,000 genes. Model organisms, such as Drosophila melanogaster, share many genes with 
humans whose sequences and functions have been conserved. In addition to myriad similarities 
in cellular structure and function, humans and Drosophila share pathways for intercellular 
signaling, developmental patterning, learning and behavior, as well as tumor formation and 
metastasis. 

The genes involved in the development of Drosophila, with few exceptions, are the same 
as those involved in the development of higher organisms. Developmental biology studies the 
sequential activation and interaction of genes, in relation to developing morphology. Right now, 
Drosophila is the only organism for which one can begin with a list of genes active in the egg 
and follow the morphological changes and gene activations through to adulthood. 

Drosophila studies have provided the widest knowledge base available for any single 
organism; accordingly, developmental biologists use the fly to ferret out the activity of genes 
with similar functions in higher organisms. Despite its small size, the fly is by no means a small 
developmental problem. If you know the genes involved in the development of the fly, you also 

1 
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know, to a reasonable approximation, the genes involved in the development of the worm, the 
fish, the mouse, and humans. 

A major goal in drug/insecticide development is to understand and elucidate the 
molecular mechanisms that govern cell signaling and cell-cell interactions in higher eukaryotes. 
5 GPCRs form major links in cellular communication/response systems. A complete list of GPCR 
proteins from Drosophila would therefore be invaluable in developing human therapeutic targets. 
Not only will the proteins serve as models for human cellular signaling and response, such 
molecules will also serve as molecular keys in identifying therapeutically important human 
orthologs. 

10 

G-protein coupled receptors 

G-protein coupled receptors (GPCRs) constitute a major class of proteins responsible for 
transducing a signal within a cell. GPCRs have three structural domains: an amino terminal 
extracellular domain, a transmembrane domain containing seven transmembrane segments, three 

1 5 extracellular loops, and three intracellular loops, and a carboxy terminal intracellular domain. Upon 
binding of a ligand to an extracellular portion of a GPCR, a signal is transduced; within the cell that 
results in a change in a biological or physiological property of the cell. GPCRs, along with G- 
proteins and effectors (intracellular enzymes and channels modulated by G-proteins), are the 
components of a modular signaling system that connects the state of intracellular second 

20 messengers to extracellular inputs. 

GPCR genes and gene-products are potential causative agents of disease (Spiegel et al, J. 
Clin. Invest. 92:1119-1125 (1993); McKusick et al.,J. Med. Genet. 30:1-26(1993)). Specific 
defects in the rhodopsin gene and the V2 vasopressin receptor gene have been shown to cause 
various forms of retinitis pigmentosum (Nathans et al.,Annu. Rev. Genet. 26:403-424(1 992)), and 

25 nephrogenic diabetes insipidus (Holtzman et al, Hum. Mol. Genet. 2: 1 201 -1 204 (1 993)). These 
receptors are of critical importance to both the central nervous system and peripheral physiological 
processes. Evolutionary analyses suggest that the ancestor of these proteins originally developed in 
concert with complex body plans and nervous systems. 

The GPCR protein superfamily can be divided into five families: Family I, receptors 

30 typified by rhodopsin and the (52-adrenergic receptor and currently represented by over 200 unique 
members (Dohlman et al, Annu. Rev. Biochem. (50:653-688 (1991)); Family II, the parathyroid 
hormone/calcitonin/secretin receptor family (Juppner et al, Science 254:1024-1026 (1991); Lin et 
al, Science 254:1022-1024 (1991)); Family III, the metabotropic glutamate receptor family 
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(Nakanishi, Science 258 597:603 (1992)); Family IV, the cAMP receptor family, important in the 
chemotaxis and development of D. discoideum (Klein el al, Science 247:1467-1472 (1988)); and 
Family V, the fungal mating pheromone receptors such as STE2 (Kurjan, Annu. Rev. Biochem. 
(57:1097-1129(1992)). 

There are also a small number of other proteins, which present seven putative hydrophobic 
segments and appear to be unrelated to GPCRs; they have not been shown to couple to G-proteins. 
Drosophila expresses a photoreceptor-specific protein, bride of sevenless (boss), a seven- 
transmembrane-segment protein, which has been extensively studied and does not show evidence of 
being a GPCR (Hart et al., Proc. Natl. Acad. Sci. USA 90:5047-5051 (1993)). The gene frizzled (Jz) 
in Drosophila is also thought to be a protein with seven transmembrane segments. Like boss, fz has 
not been shown to couple to G-proteins (Vinson et al, Nature 355:263-264 (1989)). 

G proteins represent a family of heterotrimeric proteins composed of a, p and y subunits, 
that bind guanine nucleotides. These proteins are usually linked to cell surface receptors, e.g., 
receptors containing seven transmembrane segments. Following ligand binding to the GPCR, a 
conformational change is transmitted to the G protein, which causes the a-subunit to exchange a 
bound GDP molecule for a GTP molecule and to dissociate from the Py-subunits. The GTP-bound 
form of the a-subunit typically functions as an effector-modulating moiety, leading to the 
production of second messengers, such as cAMP (e.g., by activation of adenyl cyclase), 
diacylglycerol or inositol phosphates. Greater than 20 different types of a-subunits are known in 
humans. These subunits associate with a smaller pool of p and y subunits. Examples of 
mammalian G proteins include Gi, Go, Gq, Gs and Gt. G proteins are described extensively in 
Lodish et al., Molecular Cell Biology, (Scientific American Books Inc., New York, N.Y., 1 995), the 
contents of which are incorporated herein by reference. GPCRs, G proteins and G protein-linked 
effector and second messenger systems have been reviewed in The G-Protein Linked Receptor Fact 
Book, Watson et al, eds., Academic Press (1994). 

Dopamine receptors 

The understanding of the dopaminergic system relevance in brain function and disease 
developed several decades ago from three diverse observations following drug treatments. These 
were the observations that dopamine replacement therapy improved Parkinson's disease symptoms, 
depletion of dopamine and other catecholamines by reserpine caused depression and antipsychotic 
drugs blocked dopamine receptors. The finding that the dopamine receptor binding affinities of 
typical antipsychotic drugs correlate with their clinical potency led to the dopamine overactivity 



3 



SUBSTITUTE SHEET (RULE 26) 





WO 01/070980 



PCT/US01/09341 



10 



15 



20 



25 



hypothesis of schizophrenia (Snyder, S.H., Am J Psychiatry 133, 197-202 (1976); Seeman, P. and 
Lee, T., Science 188, 1217-9 (1975)). Today, dopamine receptors are crucial targets in the 
pharmacological therapy of schizophrenia, Parkinson's disease, Tourette's syndrome, tardive 
dyskinesia and Huntington's disease. The dopaminergic system includes the nigrbstriatal, 
mesocorticolimbic and tuberoinfundibular pathways. The nigrostriatal pathway is part of the striatal 
motor system and its degeneration leads to Parkinson's disease; the mesocorticolimbic pathway 
plays a key role in reinforcement and in emotional expression and is the desired site of action of 
antipsychotic drugs; the tuberoinfundibular pathways regulates prolactin secretion from the 
pituitary. 

Dopamine receptors are members of the G protein coupled receptor superfamily, a large 
group proteins that share a seven helical membrane-spanning structure and transduce signals 
through coupling to heterotrimeric guanine nucleotide-binding regulatory proteins (G proteins). 
Dopamine receptors are classified into subfamilies: Dl-like (Dl and D5) and D2-like (D2, D3 and 
D4) based on their different ligand binding profiles, signal transduction properties, sequence 
homologies and genomic organizations (Civelli, O., Bunzow, J.R. and Grandy, D.K., Anna Rev 
Pharmacol Toxicol 33, 281-307 (1993)). The Dl-like receptors, Dl and D5, stimulate cAMP 
synthesis through coupling with Gs-like proteins and their genes do not contain introns within their 
protein coding regions. On the other hand, the D2-like receptors, D2, D3 and D4, inhibit cAMP 
synthesis through their interaction with Gi-like proteins and share a similar genomic organization 
which includes introns within their protein coding regions. 

Serotonin receptors 

Serotonin (5-Hydroxytryptamine; 5-HT) was first isolated from blood serum, where it was 
shown to promote vasoconstriction (Rapport, M.M., Green, A.A. and Page, I.H., J Biol Chem 176, 
1243-125 1 (1948). Interest on a possible relationship between 5-HT and psychiatric disease was 
spurred by the observations that hallucinogens such as LSD and psilocybin inhibit the actions of 5- 
HT on smooth muscle preparations (Gaddum, J.H. and Hameed, K.A., Br J Pharmacol 9, 240-248 
(1 954)). This observation lead to the hypothesis that brain 5-HT activity might be altered in 
psychiatric disorders (Wooley, D.W. and Shaw, E, Proc Natl Acad Sci USA 40, 228-231 (1954); 
Gaddum, J.H. and Picarelli, Z.P., Br J Pharmacol 12, 323-328 (1957)). This hypothesis was 
strengthened by the introduction of tricyclic antidepressants and monoamine oxidase inhibitors for 
the treatment of major depression and the observation that those drugs affected noradrenaline and 5- 
HT metabolism. Today, drugs acting on the serotoninergic system have been proved to be effective 
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in the pharmacotherapy of psychiatric diseases such as depression, schizophrenia, obsessive- 
compulsive disorder, panic disorder, generalized anxiety disorder and social phobia as well as 
migraine, vomiting induced by cancer chemotherapy and gastric motility disorders. 

Serotonin receptors represent a very large and diverse family of neurotransmitter receptors. 
5 To date thirteen 5-HT receptor proteins coupled to G proteins plus one ligand-gated ion channel 
receptor (5-HT3) have been described in mammals. This receptor diversity is thought to reflect 
serotonin's ancient origin as a neurotransmitter and a hormone as well as the many different roles of 
5-HT in mammals. The 5-HT receptors have been classified into seven subfamilies or groups 
according to their different ligand-binding affinity profiles, molecular structure and intracellular 
10 transduction mechanisms (Hoyer, D. et ah, Pharmacol. Rev. 46, 157-203 (1994)). 

GPCRs are a major target for drug/insecticide action and development. Accordingly, it is 
valuable to the field of pharmaceutical development to identify and characterize previously 
unknown GPCRs. The present invention advances the state of the art by providing a previously 
unidentified drosophila GPCRs. 

15 

Insecticides 

About 10,000 species of the more than 1 million species of insects are crop-eating, and of 
these, approximately 700 species worldwide cause most of the insect damage to man's crops, in 
the field and in storage. 

20 A detailed study of novel proteins from Drosophila and invertebrate orthologs thereof, 

will serve as targets for identifying new members of the known classes of insecticides as well as 
aiding in the identification of new classes of compounds. 



SUMMARY OF THE INVENTION 

25 The present invention is based in part on the identification of nucleic acid sequences that 

encode amino acid sequences of 66 GPCR proteins that are produced by Drosophila 
melanogaster and invertebrate, human and other mammalian orthologs thereof. These unique 
protein sequences, and nucleic acid sequences that encode these proteins, can be used as models 
for the development of human therapeutic targets, aid in the identification of therapeutic proteins 

30 and serve as targets for the development of human therapeutic and insecticidal agents. 

The proteins of the present inventions are GPCRs that participate in signaling pathways. As 
used herein, a "signaling pathway" refers to the modulation (e.g., stimulation or inhibition) of a 
cellular function/activity upon the binding of a ligand to the GPCR protein. Examples of such 
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functions include mobilization of intracellular molecules that participate in a signal transduction 
pathway, e.g., phosphatidylinositol 4,5-bisphosphate (PIP2X inositol 1 ,4,5-triphosphate (IP 3 ) and 
adenylate cyclase; polarization of the plasma membrane; production or secretion of molecules; 
alteration in the structure of a cellular component; cell proliferation, e.g., synthesis of DNA; cell 
5 migration; cell differentiation; and cell survival 

The response mediated by the receptor protein depends on the type of cell. For example, in 
some cells, binding of a ligand to the receptor protein may stimulate an activity such as release of 
compounds, gating of a channel, cellular adhesion, migration, differentiation, etc., through 
phosphatidylinositol or cyclic AMP metabolism and turnover while in other cells, the binding of the 

10 ligand will produce a different result. Regardless of the cellular activity/response modulated by the 
receptor protein, it is universal that the protein is a GPCR and interacts with G proteins to produce 
one or more secondary signals, in a variety of intracellular signal transduction pathways, e.g., 
through phosphatidylinositol or cyclic AMP metabolism and turnover, in a cell. 

As used herein, "phosphatidylinositol turnover and metabolism" refers to the molecules 

1 5 involved in the turnover and metabolism of phosphatidylinositol 4,5-bisphosphate (PIP2) as well as 
to the activities of these molecules. PIP2 is a phospholipid found in the cytosolic leaflet of the 
plasma membrane. Binding of ligand to the receptor activates, in some cells, the plasma-membrane 
enzyme phospholipase C that in turn can hydrolyze PIP2 to produce 1 ,2-diacylglycerol (DAG) and 
inositol 1,4,5-triphosphate (IP3). Once formed IP3 can diffuse to the endoplasmic reticulum surface 

20 where it can bind an IP3 receptor, e.g., a calcium channel protein containing an IP3 binding site. IP 3 
binding can induce opening of the channel, allowing calcium ions to be released into the cytoplasm. 
IP3 can also be phosphorylated by a specific kinase to form inositol 1,3,4,5-tetraphosphate (IP4), a 
molecule that can cause calcium entry into the cytoplasm from the extracellular medium. IP3 and 
U*4can subsequently be hydrolyzed very rapidly to the inactive products inositol 1 ,4-biphosphate 

25 (IP 2 ) and inositol 1 ,3,4-triphosphate, respectively. These inactive products can be recycled by the 
cell to synthesize PIP2. The other second messenger produced by the hydrolysis of PIP2, namely 
1 ,2-diacylglycerol (DAG), remains in the cell membrane where it can serve to activate the enzyme 
protein kinase C. Protein kinase C is usually found soluble in the cytoplasm of the cell, but upon an 
increase in the intracellular calcium concentration, this enzyme can move to the plasma membrane 

30 where it can be activated by DAG. The activation of protein kinase C in different cells results in 
various cellular responses such as the phosphorylation of glycogen synthase, or the phosphorylation 
of various transcription factors, e.g., NF-kB. The language "phosphatidylinositol activity", as used 
herein, refers to an activity of PIP2 or one of its metabolites. 
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Another signaling pathway in which the receptor may participate is the cAMP turnover 
pathway. As used herein, "cyclic AMP turnover and metabolism" refers to the molecules 
involved in the turnover and metabolism of cyclic AMP (cAMP) as well as to the activities of 
these molecules. Cyclic AMP is a second messenger produced in response to ligand-induced 
stimulation of certain G protein coupled receptors. In the cAMP signaling pathway, binding of a 
ligand to a GPCR can lead to the activation of the enzyme adenyl cyclase, which catalyzes the 
synthesis of cAMP. The newly synthesized cAMP can in tum activate a cAMP-dependent 
protein kinase. This activated kinase can phosphorylate a voltage-gated potassium channel 
protein, or an associated protein, and lead to the inability of the potassium channel to open 
during an action potential. The inability of the potassium channel to open results in a decrease in 
the outward flow of potassium, which normally repolarizes the membrane of a neuron, leading to 
prolonged membrane depolarization. 

By targeting an agent to modulate a GPCR, the signaling activity and biological process 
mediated by the receptor can be agonized or antagonized. Such agonism and antagonism serves 
as a basis for modulating a biological activity in a therapeutic context (mammalian therapy) or 
toxic context (anti-invertebrate/insecticidal agent). 

DESCRIPTION OF THE FIGURE SHEETS 

FIGURE SHEETS 1-89 provides genomic nucleic acid sequences from Drosophila 
melanogaster, predicted transcript, amino acid coding regions, information relating to the 
subfamily of GPCR proteins to which the protein sequence belongs, e.g., BLAST hit homology, 
presence of hidden Markov model, etc. Also provided are 5' promoter sequences, transciption 
initiation sites and other structural features. 

DETAILED DESCRIPTION OF THE INVENTION 

General Description 

The present invention is based on the sequencing of the Drosophila melanogaster 
genome. During the sequencing and assembly of the Drosophila melanogaster genome, analysis 
of the sequence information revealed previously unidentified nucleic acid molecules that encode 
proteins that share structural and/or sequence homology to protein/peptide/domains identified 
and characterized within the art as being a GPCR protein or part of a GPCR protein. Based on 
this analysis, the present invention provides amino acid sequences of 66 GPCR proteins that are 
produced by Drosophila melanogaster, nucleic acid sequences that encode these GPCR proteins, 
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and the art known protein/peptide/domain that have structural or sequence homology to the 
Drosophila melanogaster protein. 

In addition to being previously unknown, the proteins and nucleic acid molecules that are 
provided in the present invention are selected based on their ability to be used for the 
development of commercially important products and services. Specifically, the present proteins 
are selected based on homology and/or structural relatedness to known GPCR proteins of 
commercial importance and/or the presence of genetic signals that define the protein as a GPCR 
protein. Some of the more specific features of the proteins of the present invention, and the uses 
thereof, are described in detail below, and are known within the art for each category of GPCR. 

Specific Embodiments 
Peptide Molecules 

In the Figure sheets, the present invention provides nucleic acid sequences that encode 66 
protein molecules that have been identified as being within the Drosophila melanogaster genome 
(SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, encoded by nucleic acid molecules provided in SEQ 
ID NOs: 1, 4, 7, . . . , 190, 193, 196 (genomic sequences) or SEQ ID NOs: 2, 5, 8, .... 191, 
194, 197 (transcript sequences)). These protein sequences will be referred herein as the GPCR 
proteins of the present invention, GPCR proteins, or the peptides or the proteins of the present 
invention. Table 1 provides a summary of the GPCRs of the present invention and the subfamily 
assignment of each one. 

The present invention provides isolated peptide and protein molecules that consist of, 
consist essentially of or are comprised of the amino acid sequences of the GPCR proteins 
disclosed in the Figure sheets (SEQ ID NO: 3, 6, 9, 189, 192, 195, 198, encoded by nucleic acid 
molecules provided in SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 (genomic sequences) or SEQ ID 
NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)), as well as all obvious variants of these 
proteins that are within the art to make and use. Some of these variants are described in detail 
below. 

As used herein, a protein is said to be "isolated" or "purified" when it is substantially free 
of cellular material or free of chemical precursors or other chemicals. The proteins of the present 
invention can be purified to homogeneity or other degrees of purity. The level of purification will 
be based on the intended use. The critical feature is that the preparation allows for the desired 
function of the protein, even if in the presence of considerable amounts of other components. 
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In some uses, "substantially free of cellular material" includes preparations of the protein 
having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than 
about 20% other proteins, less than about 1 0% other proteins, or less than about 5% other proteins. 
When the protein is recombinantly produced, it can also be substantially free of culture medium, 
i.e., culture medium represents less than about 20% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the protein in which it is separated from chemical precursors or other chemicals that 
are involved in its synthesis. In one embodiment, the language "substantially free of chemical 
precursors or other chemicals" includes preparations of the GPCR protein having less than about 
30% (by dry weight) chemical precursors or other chemicals, less than about 20% chemical 
precursors or other chemicals, less than about 1 0% chemical precursors or other chemicals, or less 
than about 5% chemical precursors or other chemicals. 

The isolated GPCR protein can be purified from cells that naturally express it, purified from 
cells that have been altered to express it (recombinant), or synthesized using known protein 
synthesis methods. For example, a nucleic acid molecule encoding the GPCR protein is cloned into 
an expression vector, the expression vector introduced into a host cell and the protein expressed in 
the host cell. The protein can then be isolated from the cells by an appropriate purification scheme 
using standard protein purification techniques. Many of these techniques are described in detail 
below. 

Accordingly, the present invention provides proteins that consist of one of the amino acid 
sequences encoded by the nucleic acid sequences shown in the Figure sheets (SEQ ID NOs: 1 , 4, 7, 
. . . , 190, 193, 196 (genomic sequences) or SEQ ID NOs: 2, 5, 8, . ... 191, 194, 197 (transcript 
sequences)). The amino acid sequences of such proteins are provided in the Figure Sheets along 
with an explanation of the protein type/family. A protein consists of an amino acid sequence when 
the amino acid sequence is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of one of the amino 
acid sequences encoded by the nucleic acid sequences shown in the Figure sheets (SEQ ID NOs: 1 , 
4, 7, . . . , 190, 193, 196 (genomic sequences) or SEQ ID NOs: 2, 5, 8, ..... 191, 194, 197 
(transcript sequences)). A protein consists essentially of an amino acid sequence when such an 
amino acid sequence is present with only a few additional amino acid residues in the final protein. 

The present invention further provides proteins that are comprised of one of the amino acid 
sequences encoded by the nucleic acid sequences shown in the Figure sheets (SEQ ID NOs: 1 , 4, 7, 
. . . , 190, 193, 196 (genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript 
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sequences)). A protein is comprised of an amino acid sequence when the amino acid sequence is at 
least part of the final amino acid sequence of the protein. In such a fashion, the protein can be only 
the protein or have additional amino acid molecules, such as amino acid residues (contiguous 
encoded sequence) that are naturally associated with it or heterologous amino acid residues/peptide 

5 sequences. Such a protein can have a few additional amino acid residues or can comprise several 
hundred or more additional amino acids. The preferred classes of proteins that are comprised of the 
GPCR proteins of the present invention are the naturally occurring mature proteins. 

The GPCR proteins of the present invention can be attached to heterologous sequences to 
form chimeric or fusion proteins. Such chimeric and fusion proteins comprise a GPCR protein 

10 operatively linked to a heterologous protein having an amino acid sequence not substantially 
homologous to the GPCR protein. "Operatively linked" indicates that the GPCR protein and the 
heterologous protein are fused in-frame. The heterologous protein can be fused to the N-terminus 
or C-terminus of the GPCR protein. 



15 example, the fusion protein can include, but is not limited to, enzymatic fusion proteins, for example 
beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, MYC-tagged, HI- 
tagged and Ig fusions. Such fusion proteins, particularly poly-His fusions, can facilitate the 
purification of recombinant GPCR protein. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a protein can be increased by using a heterologous signal sequence. 

20 A chimeric or fusion protein can be produced by standard recombinant DNA techniques. 

For example, DNA fragments coding for the different protein sequences are ligated together in- 
frame in accordance with conventional techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 

25 complementary overhangs between two consecutive gene fragments which can subsequently be 
annealed and re-amplified to generate a chimeric gene sequence (see Ausubel et al, Current 
Protocols in Molecular Biology, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST protein). A GPCR protein-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in- 

30 frame to the GPCR protein. 

As mentioned above, the present invention also provides and enables obvious variants of the 
amino acid sequence of the proteins of the present invention, such as naturally occurring mature 
forms of the protein, allelic/sequence variants of the proteins, non-naturally occurring 



In some uses, the fusion protein does not affect the activity of the GPCR protein per se. For 
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recombinantly derived variants of the proteins, and orthologs and paralogs of the proteins. Such 
variants can readily be generated using art know techniques in the fields of recombinant nucleic acid 
technology and protein biochemistry. It is understood, however, that variants exclude any amino 
acid sequences disclosed prior to the invention. 



information disclosed herein. Further, such variants can readily be distinguished from other 
proteins based on sequence and/or structural homology to the GPCR proteins of the present 
invention. The degree of homology/identity present will be based primarily on whether the protein 
is a functional variant or non-functional variant, the amount of divergence present in the paralog 

l o family and the evolutionary distance between the orthologs. 

To determine the percent identity of two amino acid sequences or two nucleic acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal 
alignment and non-homologous sequences can be disregarded for comparison purposes). In a 

15 preferred embodiment, the length of a reference sequence aligned for comparison purposes is at 
least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more of the length of the reference sequence. 
The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide 
positions are then compared. When a position in the first sequence is occupied by the same 
amino acid residue or nucleotide as the corresponding position in the second sequence, then the 

20 molecules are identical at that position (as used herein amino acid or nucleic acid "identity" is 
equivalent to amino acid or nucleic acid "homology"). The percent identity between the two 
sequences is a function of the number of identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, which need to be introduced for optimal 
alignment of the two sequences. 

25 The comparison of sequences and determination of percent identity and similarity 

between two sequences can be accomplished using a mathematical algorithm. (Computational 
Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New 

30 Jersey, 1 994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 
1991). In a preferred embodiment, the percent identity between two amino acid sequences is 
determined using the Needleman and Wunsch (J. Mol. Biol. (48):444-453 (1970)) algorithm 



5 



Such variants can readily be identified/made using molecular techniques and the sequence 
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which has been incorporated into the GAP program in the GCG software package (available at 
http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using the 
5 GAP program in the GCG software package (Devereux, J., et al., Nucleic Acids Res. 12(l)\h%l 
(1984)) (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 
40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the 
percent identity between two amino acid or nucleotide sequences is determined using the 
algorithm of E. Meyers and W. Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated 

10 into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length 
penalty of 1 2 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used as a 
"query sequence" to perform a search against sequence databases to, for example, identify other 
family members or related sequences. Such searches can be performed using the NBLAST and 

15 XBLAST programs (version 2.0) of Altschul, et al. (J. Mol. Biol. 215:403-10 (1990)). BLAST 
nucleotide searches can be performed with the NBLAST program, score = 100, wordlength = 12 
to obtain nucleotide sequences homologous to the nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to the proteins of the invention. To obtain gapped 

20 alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
al. (Nucleic Acids Res. 25(17):3389-3402 (1997)). When utilizing BLAST and gapped BLAST 
programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can 

be used. See http://www.ncbi.nlm.nih.gov . 



25 comprise one of the proteins of the present invention can readily be identified as having complete 
sequence identity to one of the GPCR proteins of the present invention as well as being encoded by 
the same genetic locus as the GPCR protein provided herein. 

Allelic variants of a GPCR protein can readily be identified as having a high degree 
(significant) of sequence homology/identity to at least a portion of the GPCR protein as well as 

30 being encoded by the same genetic locus as the GPCR protein provided herein. As used herein, two 
proteins (or a region of the proteins) have significant homology when the amino acid sequences 
are typically at least about 70-75%, 80-85%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present 



Full-length pre-processed forms, as well as mature processed forms, of proteins that 
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invention, will be encoded by a nucleic acid sequence that will hybridize to a GPCR protein 
encoding nucleic acid molecule under stringent conditions as more fully described below. 

Paralogs of a GPCR protein can readily be identified as having some degree of significant 
sequence homology/identity to at least a portion of the GPCR protein, as being encoded by a gene 
5 from Drosophila, and as having similar activity or function. Two proteins will typically be 

considered paralogs when the amino acid sequences are typically at least about 70-75%, 80-85%, 
and more typically at least about 90-95% or more homologous through a given region or domain. 
Such paralogs will be encoded by a nucleic acid sequence that will hybridize to a GPCR protein 
encoding nucleic acid molecule under stringent conditions as more fully described below. 

10 Orthologs of a GPCR protein can readily be identified as having some degree of significant 

sequence homology/identity to at least a portion of the GPCR protein as well as being encoded by a 
gene from another organism. Preferred orthologs will be isolated from mammals, preferably 
human, for the development of human therapeutic targets and agents, or other invertebrates, 
particularly insects of economical/agriculture importance, e.g. members of the Lepidopteran and 

15 Coleopteran orders, for the development of insecticides and insecticidal targets. Such orthologs will 
be encoded by a nucleic acid sequence that will hybridize to a GPCR protein encoding nucleic 
acid molecule under moderate to stringent conditions, as more fully described below, depending 
on the degree of relatedness of the two organisms yielding the proteins. 

Non-naturally occurring variants of the GPCR proteins of the present invention can readily 

20 be generated using recombinant techniques. Such variants include, but are not limited to deletions, 
additions and substitutions in the amino acid sequence of the GPCR protein. For example, one class 
of substitutions are conserved amino acid substitution. Such substitutions are those that substitute a 
given amino acid in a GPCR protein by another amino acid of like characteristics. Typically seen as 
conservative substitutions are the replacements, one for another, among the aliphatic amino acids 

25 Ala, Val, Leu, and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic 
residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the basic 
residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found in Bowie et 
al., Science 247: 1 306-1 3 10(1 990). 

30 Variant GPCR proteins can be fully functional or can lack function in one or more activities, 

e.g. ability to bind ligand, ability to bind G-protein, ability to mediate signaling, etc. Fully 
functional variants typically contain only conservative variation or variation in non-critical residues 
or in non-critical regions. Functional variants can also contain substitution of similar amino acids 
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that result in no change or an insignificant change in function. Alternatively, such substitutions may 
positively or negatively affect function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or 
deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in the art, 
such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al. , Science 
244:1081-1085 (1989)). The latter procedure introduces single alanine mutations at every residue in 
the molecule. The resulting mutant molecules are then tested for biological activity such as receptor 
binding or in assays such as an in vitro proliferative activity. Sites that are critical for ligand- 
receptor binding can also be determined by structural analysis such as crystallization, nuclear 
magnetic resonance or photoaffinity labeling (Smith et al, J. Mol. Biol. 224:899-904 (1992); de 
Vos et al. Science 255:306-312 (1992)). 

Polypeptides often contain amino acids other than the 20 amino acids commonly referred to 
as the 20 naturally-occurring amino acids. Further, many amino acids, including the terminal amino 
acids, may be modified by natural processes, such as processing and other post-translational 
modifications, or by chemical modification techniques well known in the art. Common 
modifications that occur naturally in polypeptides are described in basic texts, detailed monographs, 
and the research literature, and they are well known to those of skill in the art. 

Accordingly, the polypeptides also encompass derivatives or analogs in which a substituted 
amino acid residue is not one encoded by the genetic code, in which a substituent group is included, 
in which the mature polypeptide is fused with another compound, such as a compound to increase 
the half-life of the polypeptide (for example, polyethylene glycol), or in which the additional amino 
acids are fused to the mature polypeptide, such as a leader or secretory sequence or a sequence for 
purification of the mature polypeptide or a pro-protein sequence. 

The present invention further provides fragments of the GPCR proteins, in addition to 
proteins and peptides that comprise and consist of such fragments. The fragments to which the 
invention pertains, however, are not to be construed as encompassing fragments that may be 
disclosed publicly prior to the present invention. 

As used herein, a fragment comprises at least 8 or more contiguous amino acid residues 
from a GPCR protein. Such fragments can be chosen based on the ability to retain one or more of 
the biological activities of the GPCR protein or could be chosen for the ability to perform a 
function, e.g. act as an immunogen. Particularly important fragments are biologically active 
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fragments, peptides which are, for example, about 8 or more amino acids in length. Such fragments 
will typically comprise a domain or motif of the GPCR protein, e.g., active site, a G-protein binding 
site, a transmembrane domain or a ligand binding domain. Further, possible fragments include, but 
are not limited to, domain or motif containing fragments, soluble peptide fragments, and fragments 
5 containing immunogenic structures. Predicted domains and functional sites are readily identifiable 
by computer programs well-known and readily available to those of skill in the art (e.g., PROSITE 
analysis). 

Polypeptides often contain amino acids other than the 20 amino acids commonly referred to 
as the 20 naturally-occurring amino acids. Further, many amino acids, including the terminal amino 
10 acids, may be modified by natural processes, such as processing and other post-translational 
modifications, or by chemical modification techniques well known in the art. Common 
modifications that occur naturally in GPCR proteins are described in basic texts, detailed 
monographs, and the research literature, and they are well known to those of skill in the art. 



15 analogs in which a substituted amino acid residue is not one encoded by the genetic code, in which 
a substituent group is included, in which the mature GPCR protein is fused with another compound, 
such as a compound to increase the half-life of the GPCR protein (for example, polyethylene 
glycol), or in which the additional amino acids are fused to the mature GPCR protein, such as a 
leader or secretory sequence or a sequence for purification of the mature GPCR protein or a pro- 

20 protein sequence. 

Known modifications include, but are not limited to, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 

25 formation, demethylation, formation of covalent crosslinks, formation of cystine, formation of 
pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, and ubiquitination. 

30 Such modifications are well-known to those of skill in the art and have been described in 

great detail in the scientific literature. Several particularly common modifications, glycosylation, 
lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and 
ADP-ribosylation, for instance, are described in most basic texts, such as Proteins - Structure and 



Accordingly, the GPCR proteins of the present invention also encompass derivatives or 
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Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and Company, New York (1993). 
Many detailed reviews are available on this subject, such as by Wold, F., Posttranslational Covalent 
Modification of Proteins, B.C. Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter et al. 
{Melh. Enzymol. 182: 626-646 (1990)) and Rattan etal. {Ann. N.Y.Acad. Sci. 66SA%-62 (1992)). 

5 

Protein/Peptide Uses 

The proteins of the present invention can be used in assays to identify modulators as 
potential insecticides, to determine the biological activity of the protein, including in a panel of 
multiple proteins for high-throughput screening; to raise antibodies or to elicit another immune 

10 response; as a reagent (including the labeled reagent) in assays designed to quantitatively 
determine levels of the protein (or its binding partner or receptor) in biological fluids; and as 
markers for tissues in which the corresponding protein is preferentially expressed. Where the 
protein binds or potentially binds to another protein (such as, for example, in a receptor-ligand 
interaction), the protein can be used to identify the binding partner so as to develop a system to 

15 identify inhibitors of the binding interaction. Any or all of these research utilities are capable of 
being developed into reagent grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include "Molecular Cloning: A Laboratory Manual", 2d ed., 
Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, 

20 and "Methods in Enzymology: Guide to Molecular Cloning Techniques", Academic Press, 

Berger, S. L. and A. R. Kimmel eds., 1987 and can readily be determined using the information 
provided in Table 1 . 

The potential uses of the proteins of the present invention are based primarily on the 
source of the protein as well as the class/action of the protein. For example, GPCRs isolated 

25 from Drosophila and other invertebrates serve as a target for identifying anti-invertebrate 

compounds, e.g. insecticides. GPCRs isolated from Drosophila and their human/mammalian 
orthologs serve as targets for identifying agents for use in mammalian therapeutic applications, 
e.g. a human drug. Approximately 70% of all pharmaceutical agents modulate the activity of a 
GPCR. A combination of the invertebrate and mammalian ortholog can be used in selective 

30 screening methods to find agents specific for invertebrates. Table 1 provides specific context of 
use for each of the presently disclosed GPCRs. Particularly useful are GPCRs involved in 
neurotransmission for use as insecticide targets. 
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The receptor polypeptides (including variants and fragments which may have been disclosed 
prior to the present invention) are useful for biological assays related to GPCRs. Such assays 
involve any of the known GPCR functions or activities or properties useful for diagnosis and 
treatment of GPCR-related conditions. 
5 The receptor polypeptides are also useful in drug/insecticide screening assays, in cell-based 

or cell-free systems. Cell-based systems can be native, i.e., cells that normally express the receptor 
protein, as a biopsy or expanded in cell culture. In one embodiment, however, cell-based assays 
involve recombinant host cells expressing the receptor protein. 

The polypeptides can be used to identify compounds that modulate receptor activity. Both 

10 the GPCRs of the present invention and appropriate variants and fragments can be used in high- 
throughput screens to assay candidate compounds for the ability to bind to the receptor. These 
compounds can be further screened against a functional receptor to determine the effect of the 
compound on the receptor activity. Further, these compounds can be tested in animal or 
invertebrate systems to determine activity/effectiveness. Compounds can be identified that activate 

1 5 (agonist) or inactivate (antagonist) the receptor to a desired degree. 

Further, the receptor polypeptides can be used to screen a compound for the ability to 
stimulate or inhibit interaction between the receptor protein and a target molecule that normally 
interacts with the receptor protein. The target can be ligand or a component of the signal pathway 
that the receptor protein normally interacts (for example, a G-protein or other interactor involved in 

20 cAJVIP or phosphatidylinositol turnover and/or adenylate cyclase, or phospholipase C activation). 
Such assays typically include the steps of combining the receptor protein with a candidate 
compound under conditions that allow the receptor protein, or fragment, to interact with the target 
molecule, and to detect the formation of a complex between the protein and the target or to detect 
the biochemical consequence of the interaction with the receptor protein and the target, such as any 

25 of the associated effects of signal transduction such as G-protein phosphorylation, cAMP or 
phosphatidylinositol turnover, and adenylate cyclase or phospholipase C activation. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, including 
Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et al., Nature 
354-.Z2-S4 (1991); Houghten et al., Nature 554:84-86 (1991)) and combinatorial chemistry-derived 

30 molecular libraries made of D- and/or L- configuration amino acids; 2) phosphopeptides (e.g., 

members of random and partially degenerate, directed phosphopeptide libraries, see, e.g., Songyang 
et al., Cell 72:767-778 (1993)); 3) antibodies (e.g., polyclonal, monoclonal, humanized, anti- 
idiotypic, chimeric, and single chain antibodies as well as Fab, F(ab')2, Fab expression library 
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fragments, and epitope-binding fragments of antibodies); and 4) small organic and inorganic 
molecules (e.g., molecules obtained from combinatorial and natural product libraries). 

One candidate compound is a soluble fragment of the receptor that competes for ligand 
binding. Other candidate compounds include mutant receptors or appropriate fragments containing 
5 mutations that affect receptor function and thus compete for ligand. Accordingly, a fragment that 
competes for ligand, for example with a higher affinity, or a fragment that binds ligand but does not 
allow release, is encompassed by the invention. 

The invention further includes other end point assays to identify compounds that modulate 
(stimulate or inhibit) receptor activity. The assays typically involve an assay of events in the signal 

10 transduction pathway that indicate receptor activity. Thus, the expression of genes that are up- or 
down-regulated in response to the receptor protein dependent signal cascade can be assayed. In one 
embodiment, the regulatory region of such genes can be operably linked to a marker that is easily 
detectable, such as luciferase. Alternatively, phosphorylation of the receptor protein, or a receptor 
protein target, could also be measured. 

1 5 Any of the biological or biochemical functions mediated by the receptor can be used as an 

endpoint assay. These include all of the biochemical or biochemical/biological events described 
herein, in the references cited herein, incorporated by reference for these endpoint assay targets, and 
other functions known to those of ordinary skill in the art. 

Binding and/or activating compounds can also be screened by using chimeric receptor 

20 proteins in which the amino terminal extracellular domain, or parts thereof, the entire 

transmembrane domain or subregions, such as any of the seven transmembrane segments or any of 
the intracellular or extracellular loops and the carboxy terminal intracellular domain, or parts 
thereof, can be replaced by heterologous domains or subregions. For example, a G-protein-binding 
region can be used that interacts with a different G-protein then that which is recognized by the 

25 native receptor. Accordingly, a different set of signal transduction components is available as an 
end-point assay for activation. Alternatively, the entire transmembrane portion or subregions (such 
as transmembrane segments or intracellular or extracellular loops) can be replaced with the entire 
transmembrane portion or subregions specific to a host cell that is different from the host cell from 
which the amino terminal extracellular domain and/or the G-protein-binding region are derived. 

30 This allows for assays to be performed in other than the specific host cell from which the receptor is 
derived. Alternatively, the amino terminal extracellular domain (and/or other ligand-binding 
regions) could be replaced by a domain (and/or other binding region) binding a different ligand, 
thus, providing an assay for test compounds that interact with the heterologous amino terminal 
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extracellular domain (or region) but still cause signal transduction. Finally, activation can be 
detected by a reporter gene containing an easily detectable coding region operably linked to a 
transcriptional regulatory sequence that is part of the native signal transduction pathway. 

The receptor polypeptides are also useful in competition binding assays in methods designed 
5 to discover compounds that interact with the receptor. Thus, a compound is exposed to a receptor 
polypeptide under conditions that allow the compound to bind or to otherwise interact with the 
polypeptide. Soluble receptor polypeptide is also added to the mixture. If the test compound 
interacts with the soluble receptor polypeptide, it decreases the amount of complex formed or 
activity from the receptor target. This type of assay is particularly useful in cases in which 

10 compounds are sought that interact with specific regions of the receptor. Thus, the soluble 

polypeptide that competes with the target receptor region is designed to contain peptide sequences 
corresponding to the region of interest. 

To perform cell free drug/insecticide screening assays, it is sometimes desirable to 
immobilize either the receptor protein, or fragment, or its target molecule to facilitate separation of 

1 5 complexes from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. 

Techniques for immobilizing proteins on matrices can be used in the drug/insecticide 
screening assays. In one embodiment, a fusion protein can be provided which adds a domain that 
allows the protein to be bound to a matrix. For example, glutathione-S-transferase/15625 fusion 

20 proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell lysates (e.g., 35 S- 
labeled) and the candidate compound, and the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the 
beads are washed to remove any unbound label, and the matrix immobilized and radiolabel 

25 determined directly, or in the supernatant after the complexes are dissociated. Alternatively, the 
complexes can be dissociated from the matrix, separated by SDS-PAGE, and the level of receptor- 
binding protein found in the bead fraction quantitated from the gel using standard electrophoretic 
techniques. For example, either the polypeptide or its target molecule can be immobilized utilizing 
conjugation of biotin and streptavidin using techniques well known in the art. Alternatively, 

30 antibodies reactive with the protein but which do not interfere with binding of the protein to its 
target molecule can be derivatized to the wells of the plate, and the protein trapped in the wells by 
antibody conjugation. Preparations of a receptor-binding protein and a candidate compound are 
incubated in the receptor protein-presenting wells and the amount of complex trapped in the well 
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can be quantitated. Methods for detecting such complexes, in addition to those described above for 
the GST-immobilized complexes, include immunodetection of complexes using antibodies reactive 
with the receptor protein target molecule, or which are reactive with receptor protein and compete 
with the target molecule, as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the target molecule. 

Agents that modulate one of the GPCRs of the present invention can be identified using one 
or more of the above assays, alone or in combination. It is generally preferable to use a cell-based 
or cell free system first and then confirm activity in an animal/insect model system. Such model 
systems are well known in the art and can readily be employed in this context. 

Modulators of receptor protein activity identified according to these drug/insecticide 
screening assays can be used to treat a subject with a disorder mediated by the receptor pathway, by 
treating cells that express the GPCR. These methods of treatment include the steps of administering 
the modulators of protein activity in a pharmaceutical composition as described herein, to a subject 
in need of such treatment. 

In yet another aspect of the invention, the GPCR proteins can be used as "bait proteins" 
in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,3 1 7; Zervos et al. 
(1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. 
(1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; and Brent 
WO94/10300), to identify other proteins, which bind to or interact with the GPCR and are 
involved in GPCR activity. Such GPCR-binding proteins are also likely to be involved in the 
propagation of signals by the GPCR proteins or GPCR targets as, for example, downstream 
elements of a GPCR-mediated signaling pathway, e.g., a signaling pathway. Alternatively, such 
GPCR-binding proteins are likely to be GPCR inhibitors. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for a GPCR protein is fused to a 
gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the 
other construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 
protein ("prey" or "sample") is fused to a gene that codes for the activation domain of the known 
transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming a 
GPCR-dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene (e.g., 
LacZ) which is operably linked to a transcriptional regulatory site responsive to the transcription 
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factor. Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene which encodes 
the protein which interacts with the GPCR protein. 

This invention further pertains to novel agents identified by the above-described 
5 screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model. For example, an agent identified 
as described herein (e.g., a GPCR modulating agent, an antisense GPCR nucleic acid molecule, a 
GPCR-specific antibody, or a GPCR-binding partner) can be used in an animal or insect model 
to determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, 
1 0 an agent identified as described herein can be used in an animal or insect model to determine the 
mechanism of action of such an agent. Furthermore, this invention pertains to uses of novel 
agents identified by the above-described screening assays for insecticidal activity as described 
herein. 

15 Antibodies 

The invention also provides antibodies that selectively bind to one of the proteins of the 
present invention, a protein comprising such a peptide, as well as variants and fragments thereof. 
As used herein, an antibody selectively binds a target peptide when it binds the target peptide and 
does not significantly bind to unrelated proteins. An antibody is still considered to selectively bind 

20 a peptide even if it also binds to other proteins that are not substantially homologous with the target 
peptide so long as such proteins share homology with a fragment or domain of the peptide target of 
the antibody. In this case, it would be understood that antibody binding to the peptide is still 
selective despite some degree of cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized within the 

25 art: they are multi-subunit proteins produced by a mammalian organism in response to an antigen 
challenge. The antibodies of the present invention include polyclonal antibodies and monoclonal 
antibodies, as well as fragments of such antibodies, including, but not limited to, Fab or F(ab')2, and 
Fv fragments. 

Many methods are known for generating and/or identifying antibodies to a given target 
30 peptide. Several such methods are described by Harlow, Antibodies, Cold Spring Harbor Press, 
(1989). 
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In general, to generate antibodies, an isolated peptide is used as an immunogen and is 
administered to a mammalian organism, such as a rat, rabbit or mouse. Either the full-length 
protein, an antigenic peptide fragment or a fusion protein can be used. 

Antibodies are preferably prepared from regions or discrete fragments of the GPCR 
5 proteins. Antibodies can be prepared from any region of the protein as described herein. 
However, preferred regions will include those involved in function/activity and/or 
receptor/binding partner interaction. 

An antigenic fragment will typically comprise at least 10 contiguous amino acid residues. 
The antigenic peptide can comprise, however, at least 12, 14, 20 or more amino acid residues. Such 
10 fragments can be selected on a physical property, such as fragments correspond to regions that are 
located on the surface of the protein, e.g., hydrophilic regions or can be selected based on sequence 
uniqueness. 

Detection on an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable substances 

1 5 include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples of 
suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 

20 rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include luciferase, 
luciferin, and aequorin, and examples of suitable radioactive material include 125 I, l3l I, 35 S or 3 H. 

Antibody Uses 

25 The antibodies can be used to isolate one of the proteins of the present invention by standard 

techniques, such as affinity chromatography or immunoprecipitation. The antibodies can facilitate 
the purification of the natural protein from cells and recombinantly produced protein expressed in 
host cells. In addition, such antibodies are useful to detect the presence of one of the proteins of the 
present invention in cells or tissues to determine the pattern of expression of the protein among 

30 various tissues in an organism and over the course of normal development. Further, such antibodies 
can be used to detect protein in situ, in vitro, or in a cell lysate or supernatant in order to evaluate the 
abundance and pattern of expression. Also, such antibodies can be used to assess abnormal tissue 
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distribution or abnormal expression during development. Antibody detection of circulating 
fragments of the full length protein can be used to identify turnover. 

The antibodies can also be used to assess normal and aberrant subcellular localization of 
cells in the various tissues in an organism. The diagnostic uses can be applied, not only in genetic 
testing, but also in monitoring a treatment modality. Accordingly, where treatment is ultimately 
aimed at correcting expression level or the presence of aberrant sequence and aberrant tissue 
distribution or developmental expression, antibodies directed against the or relevant fragments can 
be used to monitor therapeutic efficacy. 

The antibodies are also useful for inhibiting protein function, for example, blocking the 
binding of the GPCR protein to a binding partner such as a ligand. An antibody can be used, for 
example, to block binding, thus modulating (agonizing or antagonizing) the peptides activity. 
Antibodies can be prepared against specific fragments containing sites required for function or 
against intact protein that is associated with a cell or cell membrane. 

The invention also encompasses kits for using antibodies to detect the presence of a protein 
in a biological sample. The kit can comprise antibodies such as a labeled or labelable antibody and 
a compound or agent for detecting protein in a biological sample; means for determining the 
amount of protein in the sample; means for comparing the amount of protein in the sample with a 
standard; and instructions for use. 

Nucleic Acid Molecules 

The present invention further provides isolated nucleic acid molecules that encode a GPCR 
protein or protein of the present invention (SEQ ID NO: 3, 6, 9, ... . , 192, 195, 198, encoded by 
nucleic acid molecules provided in SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 (genomic 
sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)). Such nucleic 
acid molecules will consist of, consist essentially of, or comprise a nucleotide sequence that encodes 
one of the GPCR proteins of the present invention, an allelic variant thereof, or an ortholog or 
paralog thereof. 

As used herein, an "isolated" nucleic acid molecule is one that is separated from other 
nucleic acid present in the natural source of the nucleic acid. Preferably, an "isolated" nucleic acid 
is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' 
ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is 
derived. However, there can be some flanking nucleotide sequences, for example up to about 5KB, 
4KB, 3KB, 2KB, or 1KB or less, particularly contiguous protein encoding sequences and protein 
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encoding sequences within the same gene but separated by introns in the genomic sequence. The 
important point is that the nucleic acid is isolated from remote and unimportant flanking sequences 
such that it can be subjected to the specific manipulations described herein such as recombinant 
expression, preparation of probes and primers, and other uses specific to the nucleic acid sequences. 

Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture medium when produced by recombinant 
techniques, or chemical precursors or other chemicals when chemically synthesized. However, the 
nucleic acid molecule can be fused to other coding or regulatory sequences and still be considered 
isolated. 

For example, recombinant DNA molecules contained in a vector are considered isolated. 
Further examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the isolated DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present invention further include 
such molecules produced synthetically. 

Accordingly, the present invention provides nucleic acid molecules that consist of one of the 
nucleotide sequences shown in the Figure sheets (SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 
(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)). A 
nucleic acid molecule consists of a nucleotide sequence when the nucleotide sequence is the 
complete nucleotide sequence of the nucleic acid molecule. 

The present invention further provides nucleic acid molecules that consist essentially of one 
of the nucleotide sequences shown in the Figure sheets (SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 
(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)). A 
nucleic acid molecule consists essentially of a nucleotide sequence when such a nucleotide 
sequence is present with only a few additional nucleic acid residues in the final nucleic acid 
molecule. 

The present invention further provides nucleic acid molecules that are comprised of one of 
the nucleotide sequences shown in the Figure sheets (SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 
(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)). A 
nucleic acid molecule is comprised of a nucleotide sequence when the nucleotide sequence is at 
least part of the final nucleotide sequence of the nucleic acid molecule. In such a fashion, the 
nucleic acid molecule can be only the nucleotide sequence or have additional nucleic acid residues, 
such as nucleic acid residues that are naturally associated with it or heterologous nucleotide 
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sequences. Such a nucleic acid molecule can have a few additional nucleotides or can comprises 
several hundred or more additional nucleotides. The preferred classes of nucleic acid molecules that 
are comprised of the nucleotide sequences of the present are the naturally occurring full-length 
cDNA molecules and genes and genomic clones since some of the nucleic acid molecules provided 
in the Figure sheets are fragments of the complete gene that exists in nature. A brief description of 
how various types of these nucleic acid molecules can be readily made/isolated is provided below. 

In the Figures, both coding and non-coding sequences are provided for each protein 
encoding nucleic acid sequence. Because of the source of the present invention, Drosophila 
genomic sequences, the nucleic acid molecules in the figures will contain genomic intronic 
sequences, 5' and 3' non-coding sequences, gene regulatory regions and non-coding intergenic 
sequences. In general such sequence features are either noted or can readily be identified using 
computational tools known in the art. As discussed below, some of the non-coding regions, 
particularly gene regulatory elements such as promoters, are useful for a variety of purposes, e.g. 
control of heterologous gene expression, target for identifying gene activity modulating 
compounds. 

Full-length genes may be cloned from known sequence using any one of a number of 
methods known in the art. For example, a method which employs XL-PCR (Perkin-EImer, 
Foster City, Calif.) to amplify long pieces of DNA may be used. Other methods for obtaining 
full-length sequences are well known in the art. 

The isolated nucleic acid molecules can encode the mature protein plus additional amino or 
carboxyl-terminal amino acids, or amino acids interior to the mature protein (when the mature form 
has more than one protein chain, for instance). Such sequences may play a role in processing of a 
protein from precursor to a mature form, facilitate protein trafficking, prolong or shorten protein 
half-life or facilitate manipulation of a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids may be processed away from the mature 
protein by cellular enzymes. 

As mentioned above, the isolated nucleic acid molecules include, but are not limited to, the 
sequence encoding the GPCR protein alone, the sequence encoding the mature protein and 
additional coding sequences, such as a leader or secretory sequence (e.g., a pre-pro or pro-protein 
sequence), the sequence encoding the mature protein, with or without the additional coding 
sequences, plus additional non-coding sequences, for example introns and non-coding 5' and 3' 
sequences such as transcribed but non-translated sequences that play a role in transcription, mRNA 
processing (including splicing and polyadenylation signals), ribosome binding and stability of 
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mRNA. In addition, the nucleic acid molecule may be fused to a marker sequence encoding, for 
example, a protein that facilitates purification. 

Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the form 
DNA, including cDNA and genomic DNA obtained by cloning or produced by chemical synthetic 
5 techniques or by a combination thereof. The nucleic acid, especially DNA, can be double-stranded 
or single-stranded. Single-stranded nucleic acid can be the coding strand (sense strand) or the non- 
coding strand (anti-sense strand). 

The invention further provides nucleic acid molecules that encode fragments of the proteins 
of the present invention as well as nucleic acid molecules that encode obvious variants of the GPCR 

l o proteins of the present invention that are described above. Such nucleic acid molecules may be 
naturally occurring, such as allelic variants (same locus), paralogs (different locus), and orthologs 
(different organism), or may be constructed by recombinant DNA methods or by chemical 
synthesis. Such non-naturally occurring variants may be made by mutagenesis techniques, 
including those applied to nucleic acid molecules, cells, or organisms. Accordingly, as discussed 

1 5 above, the variants can contain nucleotide substitutions, deletions, inversions and insertions. 
Variation can occur in either or both the coding and non-coding regions. The variations can 
produce both conservative and non-conservative amino acid substitutions. 

The present invention further provides non-coding fragments of the nucleic acid molecules 
provided in the Figures. Preferred non-coding fragments include, but are not limited to, promoter 

20 sequences, enhancer sequences, gene modulating sequences and gene termination sequences. Such 
fragments are useful in controlling heterologous gene expression and in developing screens to 
identify gene modulating agents. Particularly useful are fragments from about 1 00 to about 1 ,000 
nucleotides taken 5' from the start ATG in the genomic sequences provided in the Figures (SEQ ID 
NOs: 1,4,7, ...,190,193,196,). 

25 A fragment comprises a contiguous nucleotide sequence greater than 12 or more 

nucleotides. Further, a fragment could at least 30, 40, 50, 100, 250 or 500 nucleotides in length. 
The length of the fragment will be based on its intended use. For example, the fragment can encode 
epitope bearing regions of the protein, or can be useful as DNA probes and primers. Such 
fragments can be isolated using the known nucleotide sequence to synthesize an oligonucleotide 

30 probe. A labeled probe can then be used to screen a cDNA library, genomic DNA library, or 

mRNA to isolate nucleic acid corresponding to the coding region. Further, primers can be used in 
PCR reactions to clone specific regions of gene. 
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A probe/primer typically comprises substantially a purified oligonucleotide or 
oligonucleotide pair. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 20, 25, 40, 50 or more consecutive 
nucleotides. 

5 Orthologs, homologs, and allelic variants can be identified using methods well known in the 

art. As described in the protein Section, these variants comprise a nucleotide sequence encoding a 
protein that is typically 60-65%, 70-75%, 80-85%, and more typically at least about 90-95% or 
more homologous to the nucleotide sequence shown in the Figure sheets or a fragment of this 
sequence. Such nucleic acid molecules can readily be identified as being able to hybridize under 

10 moderate to stringent conditions, to the nucleotide sequence shown in the Figure sheets or a 
fragment of the sequence. 

As used herein, the term "hybridizes under stringent conditions" is intended to describe 
conditions for hybridization and washing under which nucleotide sequences encoding a protein at 
least 50-55% homologous to each other typically remain hybridized to each other. The conditions 

1 5 can be such that sequences at least about 65%, at least about 70%, or at least about 75% or more 
homologous to each other typically remain hybridized to each other. Such stringent conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John 
Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45C, followed by one or more 

20 washes in 0.2 X SSC, 0. 1% SDS at 50-65C. 



Nucleic Acid Molecule Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, chemical 
intermediates, and in biological assays. The nucleic acid molecules are useful as a hybridization 
25 probe for cDNA and genomic DNA to isolate full-length cDNA and genomic clones encoding the 
protein described in the Figures and to isolate cDNA and genomic clones that correspond to variants 
(alleles, orthologs, etc.) producing the same or related proteins shown in the Figures. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the Figures. Accordingly, it could be derived from 5' noncoding regions, the 
30 coding region, and 3' noncoding regions. However, as discussed, fragments are not to be construed 
as encompassing fragments disclosed prior to the present invention. 
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The nucleic acid molecules are also useful as primers for PCR to amplify any given region 
of a nucleic acid molecule and are useful to synthesize antisense molecules of desired length and 
sequence. 

The nucleic acid molecules are also useful for constructing recombinant vectors. Such 
5 vectors include expression vectors that express a portion of, or all of, the protein sequences. 
Vectors also include insertion vectors, used to integrate into another nucleic acid molecule 
sequence, such as into the cellular genome, to alter in situ expression of a gene and/or gene product. 
For example, an endogenous coding sequence can be replaced via homologous recombination with 
all or part of the coding region containing one or more specifically introduced mutations. 
I o The nucleic acid molecules are also useful for expressing antigenic portions of the proteins. 

The nucleic acid molecules are also useful in making vectors containing the gene regulatory 
regions of the nucleic acid molecules of the present invention. 

The nucleic acid molecules are also useful for designing ribozymes corresponding to all, or 
a part, of the mRNA produced from the nucleic acid molecules described herein. 
1 5 The nucleic acid molecules are also useful for constructing host cells expressing a part, or 

all, of the nucleic acid molecules and proteins. 

The nucleic acid molecules are also useful for constructing transgenic animals expressing 
all, or a part, of the nucleic acid molecules and proteins. 

The nucleic acid molecules are also useful for making vectors that express part, or all, of the 
20 proteins. 

The nucleic acid molecules are also useful as hybridization probes for determining the 
presence, level, form and distribution of nucleic acid expression. Accordingly, the probes can be 
used to detect the presence of, or to determine levels of, a specific nucleic acid molecule in cells, 
tissues, and in organisms. The nucleic acid whose level is determined can be DNA or RNA. 

25 Accordingly, probes corresponding to the proteins described herein can be used to assess expression 
and/or gene copy number in a given cell, tissue, or organism. These uses are relevant for efficacy of 
insecticides involving an increase or decrease in GPCR protein expression relative to normal results. 

In vitro techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detecting DNA includes Southern hybridizations and in situ 

30 hybridization. 

Probes can be used as a part of a diagnostic test kit for identifying cells or tissues that 
express a GPCR protein, such as by measuring a level of a receptor-encoding nucleic acid in a 



28 



SUBSTITUTE SHEET (RULE 26) 




WO 01/070980 PCT/US01/09341 

sample of cells from a subject e.g., mRNA or genomic DNA, or determining if a receptor gene has 
been mutated. 

Nucleic acid expression assays are useful for drug/insecticide screening to identify 
compounds that modulate GPCR nucleic acid expression. 
5 The invention thus provides a method for identifying a compound that can be used to treat a 

block the activity of the GPCR by modulating the expression of the GPCR gene. The method 
typically includes assaying the ability of the compound to modulate the expression of the GPCR 
nucleic acid and thus identifying a compound that can be used to kill insects by altering GPCR 
nucleic acid expression. The assays can be performed in cell-based and cell-free systems. Cell- 

i o based assays include cells naturally expressing the GPCR nucleic acid or recombinant cells 
genetically engineered to express specific nucleic acid sequences. 

The assay for GPCR nucleic acid expression can involve direct assay of nucleic acid levels, 
such as mRNA levels, or on collateral compounds involved in the signal pathway. Further, the 
expression of genes that are up- or down-regulated in response to the GPCR protein signal pathway 

1 5 can also be assayed. In this embodiment the regulatory regions of these genes can be operably 
linked to a reporter gene such as luciferase. 

Thus, modulators of GPCR gene expression can be identified in a method wherein a cell is 
contacted with a candidate compound and the expression of mRNA determined. The level of 
expression of GPCR mRNA in the presence of the candidate compound is compared to the level of 

20 expression of GPCR mRNA in the absence of the candidate compound. The candidate compound 
can then be identified as a modulator of nucleic acid expression based on this comparison and be 
used, for example to treat a disorder characterized by aberrant nucleic acid expression. When 
expression of mRNA is statistically significantly greater in the presence of the candidate compound 
than in its absence, the candidate compound is identified as a stimulator of nucleic acid expression. 

25 When nucleic acid expression is statistically significantly less in the presence of the candidate 
compound than in its absence, the candidate compound is identified as an inhibitor of nucleic acid 
expression. 

The nucleic acid molecules are also useful in diagnostic assays for qualitative changes in 
GPCR nucleic acid, and particularly in qualitative changes that lead to insecticidal 
30 activity/tolerance. The nucleic acid molecules can be used to detect mutations in GPCR genes and 
gene expression products such as mRNA. 

The nucleic acid molecules are thus useful as antisense constructs to control GPCR gene 
expression in cells, tissues, and organisms. A DNA antisense nucleic acid molecule is designed to 
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be complementary to a region of the gene involved in transcription, preventing transcription and 
hence production of GPCR protein. An antisense RNA or DNA nucleic acid molecule would 
hybridize to the mRNA and thus block translation of mRNA into GPCR protein. 

The invention also encompasses kits for detecting the presence of a GPCR nucleic acid in a 
biological sample. For example, the kit can comprise reagents such as a labeled or labelable nucleic 
acid or agent capable of detecting GPCR nucleic acid in a biological sample; means for determining 
the amount of GPCR nucleic acid in the sample; and means for comparing the amount of GPCR 
nucleic acid in the sample with a standard. The compound or agent can be packaged in a suitable 
container. The kit can further comprise instructions for using the kit to detect GPCR protein mRNA 
or DNA. 

Vectors/host cells 

The invention also provides vectors containing the nucleic acid molecules described herein. 
The term "vector" refers to a vehicle, preferably a nucleic acid molecule, that can transport the 
nucleic acid molecules. When the vector is a nucleic acid molecule, the nucleic acid molecules are 
covalently linked to the vector nucleic acid. With this aspect of the invention, the vector includes a 
plasmid, single or double stranded phage, a single or double stranded RNA or DNA viral vector, or 
artificial chromosome, such as a BAC, PAC, YAC, OR MAC. 

A vector can be maintained in the host cell as an extrachromosomal element where it 
replicates and produces additional copies of the nucleic acid molecules. Alternatively, the vector 
may integrate into the host cell genome and produce additional copies of the nucleic acid molecules 
when the host cell replicates. 

The invention provides vectors for the maintenance (cloning vectors) or vectors for 
expression (expression vectors) of the nucleic acid molecules. The vectors can function in 
procaryotic or eukaryotic cells or in both (shuttle vectors). 

Expression vectors contain cis-acting regulatory regions that are operably linked in the 
vector to the nucleic acid molecules such that transcription of the nucleic acid molecules is allowed 
in a host cell. The nucleic acid molecules can be introduced into the host cell with a separate 
nucleic acid molecule capable of affecting transcription. Thus, the second nucleic acid molecule 
may provide a trans-acting factor interacting with the cis-regulatory control region to allow 
transcription of the nucleic acid molecules from the vector. Alternatively, a trans-acting factor may 
be supplied by the host cell. Finally, a trans-acting factor can be produced from the vector itself. It 
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is understood, however, that in some embodiments, transcription and/or translation of the nucleic 
acid molecules can occur in a cell-free system. 

The regulatory sequence to which the nucleic acid molecules described herein can be 
operably linked include promoters for directing mRNA transcription. These include, but are not 
limited to, the left promoter from bacteriophage X, the lac, TRP, and TAC promoters from E. coli, 
the early and late promoters from SV40, the CMV immediate early promoter, the adenovirus early 
and late promoters, and retrovirus long-terminal repeats. 

In addition to control regions that promote transcription, expression vectors may also 
include regions that modulate transcription, such as repressor binding sites and enhancers. 
Examples include the SV40 enhancer, the cytomegalovirus immediate early enhancer, polyoma 
enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

In addition to containing sites for transcription initiation and control, expression vectors can 
also contain sequences necessary for transcription termination and, in the transcribed region a 
ribosome binding site for translation. Other regulatory control elements for expression include 
initiation and termination codons as well as polyadenylation signals. The person of ordinary skill in 
the art would be aware of the numerous regulatory sequences that are useful in expression vectors. 
Such regulatory sequences are described, for example, in Sambrook et ah, Molecular Cloning: A 
Laboratory Manual. 2nd. ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
(1989). 

A variety of expression vectors can be used to express a nucleic acid molecule. Such 
vectors include chromosomal, episomal, and virus-derived vectors, for example vectors derived 
from bacterial plasmids, from bacteriophage, from yeast episomes, from yeast chromosomal 
elements, including yeast artificial chromosomes, from viruses such as baculoviruses, 
papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, pseudorabies viruses, and 
retroviruses. Vectors may also be derived from combinations of these sources such as those derived 
from plasmid and bacteriophage genetic elements, e.g. cosmids and phagemids. Appropriate 
cloning and expression vectors for prokaryotic and eukaryotic hosts are described in Sambrook et 
al., Molecular Cloning: A Laboratory Manual. 2nd. ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, (1989). 

The regulatory sequence may provide constitutive expression in one or more host cells (i.e. 
tissue specific) or may provide for inducible expression in one or more cell types such as by 
temperature, nutrient additive, or exogenous factor such as a hormone or other ligand. A variety of 
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vectors providing for constitutive and inducible expression in prokaryotic and eukaryotic hosts are 
well known to those of ordinary skill in the art. 

The nucleic acid molecules can be inserted into the vector nucleic acid by well-known 
methodology. Generally, the DNA sequence that will ultimately be expressed is joined to an 
5 expression vector by cleaving the DNA sequence and the expression vector with one or more 
restriction enzymes and then ligating the fragments together. Procedures for restriction enzyme 
digestion and ligation are well known to those of ordinary skill in the art. 

The vector containing the appropriate nucleic acid molecule can be introduced into an 
appropriate host cell for propagation or expression using well-known techniques. Bacterial cells 

10 include, but are not limited to, E. coli, Streptomyces, and Salmonella typhimurium. Eukaryotic cells 
include, but are not limited to, yeast, insect cells such as Drosophila, animal cells such as COS and 
CHO cells, and plant cells. 

As described herein, it may be desirable to express the protein as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of the proteins. 

1 5 Fusion vectors can increase the expression of a recombinant protein, increase the solubility of the 
recombinant protein, and aid in the purification of the protein by acting for example as a ligand for 
affinity purification. A proteolytic cleavage site may be introduced at the junction of the fusion 
moiety so that the desired peptide can ultimately be separated from the fusion moiety. Proteolytic 
enzymes include, but are not limited to, factor Xa, thrombin, and enterokinase. Typical fusion 

20 expression vectors include pGEX (Smith et ai, Gene 67:2 1 -40 (1 988)), pMAL (New England 
Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant 
protein. Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amann 
et ai, Gene (59:301-315 (1988)) and pET 1 Id (Studier et al., Gene Expression Technology: Methods 

25 in Enzymology 755:60-89 ( 1 990)). 

Recombinant protein expression can be maximized in a host bacteria by providing a genetic 
background wherein the host cell has an impaired capacity to proteolytically cleave the recombinant 
protein. (Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, California (1990) 1 19-128). Alternatively, the sequence of the nucleic acid 

30 molecule of interest can be altered to provide preferential codon usage for a specific host cell, for 
example E. coli. (Wada et ai, Nucleic Acids Res. 20:21 11-2118 (1 992)). 

The nucleic acid molecules can also be expressed by expression vectors that are operative in 
yeast. Examples of vectors for expression in yeast e.g., S. cerevisiae include pYepSecl (Baldari, et 
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al., EM BO J. 6:229-234 (1987)), pMFa (Kurjan et al., Cell 30:933-943(1982)), pJRY88 (Schultz el 
al, Gene 54:\ 13-123 (1987)), andpYES2 (Invitrogen Corporation, San Diego, CA). 

The nucleic acid molecules can also be expressed in insect cells using, for example, 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured 
5 insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al., Mol. Cell Biol. 5:2156-2165 
( 1 983)) and the pVL series (Lucklow et al. , Virology 1 70:3 1-39(1 989)). 

In certain embodiments of the invention, the nucleic acid molecules described herein are 
expressed in mammalian cells using mammalian expression vectors. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. Nature 329:840(1987)) and pMT2PC (Kaufman et al., 

10 EMBOJ. (5:187-195 (1987)). 

The expression vectors listed herein are provided by way of example only of the well- 
known vectors available to those of ordinary skill in the art that would be useful to express the 
nucleic acid molecules. The person of ordinary skill in the art would be aware of other vectors 
suitable for maintenance propagation or expression of the nucleic acid molecules described herein. 

1 5 These are found for example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989. 

The invention also encompasses vectors in which the nucleic acid sequences described 
herein are cloned into the vector in reverse orientation, but operably linked to a regulatory sequence 

20 that permits transcription of antisense RNA. Thus, an antisense transcript can be produced to all, or 
to a portion, of the nucleic acid molecule sequences described herein, including both coding and 
non-coding regions. Expression of this antisense RNA is subject to each of the parameters 
described above in relation to expression of the sense RNA (regulatory sequences, constitutive or 
inducible expression, tissue-specific expression). 

25 The invention also relates to recombinant host cells containing the vectors described herein. 

Host cells therefore include prokaryotic cells, lower eukaryotic cells such as yeast, other eukaryotic 
cells such as insect cells, and higher eukaryotic cells such as mammalian cells. 

The recombinant host cells are prepared by introducing the vector constructs described 
herein into the cells by techniques readily available to the person of ordinary skill in the art. These 

30 include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated 
transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, 
lipofection, and other techniques such as those found in Sambrook, et al. {Molecular Cloning: A 
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Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989). 

Host cells can contain more than one vector. Thus, different nucleotide sequences can be 
introduced on different vectors of the same cell. Similarly, the nucleic acid molecules can be 
introduced either alone or with other nucleic acid molecules that are not related to the nucleic acid 
molecules such as those providing trans-acting factors for expression vectors. When more than one 
vector is introduced into a cell, the vectors can be introduced independently, co-introduced or joined 
to the nucleic acid molecule vector. 

In the case of bacteriophage and viral vectors, these can be introduced into cells as packaged 
or encapsulated virus by standard procedures for infection and transduction. Viral vectors can be 
replication-competent or replication-defective. In the case in which viral replication is defective, 
replication will occur in host cells providing functions that complement the defects. 

Vectors generally include selectable markers that enable the selection of the subpopulation 
of cells that contain the recombinant vector constructs. The marker can be contained in the same 
vector that contains the nucleic acid molecules described herein or may be on a separate vector. 
Markers include tetracycline or ampicillin-resistance genes for prokaryotic host cells and 
dihydrofolate reductase or neomycin resistance for eukaryotic host cells. However, any marker that 
provides selection for a phenotypic trait will be effective. 

While the mature proteins can be produced in bacteria, yeast, mammalian cells, and other 
cells under the control of the appropriate regulatory sequences, cell- free transcription and 
translation systems can also be used to produce these proteins using RNA derived from the DNA 
constructs described herein. 

Where secretion of the protein is desired, which is difficult to achieve with multi- 
transmembrane domain containing proteins such as GPCRs, appropriate secretion signals are 
incorporated into the vector. The signal sequence can be endogenous to the proteins or 
heterologous to these proteins. 

Where the protein is not secreted into the medium, which is typically the case with GPCRs, 
the protein can be isolated from the host cell by standard disruption procedures, including freeze 
thaw, sonication, mechanical disruption, use of lysing agents and the like. The protein can then be 
recovered and purified by well-known purification methods including ammonium sulfate 
precipitation, acid extraction, anion or cationic exchange chromatography, phosphocellulose 
chromatography, hydrophobic-interaction chromatography, affinity chromatography, 
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hydroxylapatite chromatography, lectin chromatography, or high performance liquid 
chromatography. 

It is also understood that depending upon the host cell in recombinant production of the 
proteins described herein, the proteins can have various glycosylation patterns, depending upon the 
5 cell, or maybe non-glycosylated as when produced in bacteria. In addition, the proteins may include 
an initial modified methionine in some cases as a result of a host-mediated process. 

Nucleic Acid Arrays 

The present invention further provides arrays or microarrays of nucleic acid molecules 
10 that are based on the sequence information provided in the Figure Sheets SEQ ID NOs: 1, 4, 7, 
. . . , 190, 193, 196 (genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript 
sequences)). 

As used herein "Arrays" or "Microarrays" refers to an array of distinct polynucleotides or 
oligonucleotides synthesized on a substrate, such as paper, nylon or other type of membrane, 

15 filter, chip, glass slide, or any other suitable solid support. In one embodiment, the microarray is 
prepared and used according to the methods described in US Patent 5,837,832, Chee et al., PCT 
application W095/1 1995 (Chee et al.), Lockhart, D. J. et al. (1996; Nat. Biotech. 14: 1675-1680) 
and Schena, M. et al. (1996; Proc. Natl. Acad. Sci. 93: 10614-10619), all of which are 
incorporated herein in their entirety by reference. In other embodiments, such arrays are 

20 produced by the methods described by Brown et. al., US Patent No. 5,807,522. 

The microarray is preferably composed of a large number of unique, single-stranded 
nucleic acid sequences, usually either synthetic antisense oligonucleotides or fragments of 
cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6-60 nucleotides in 
length, more preferably 1 5-30 nucleotides in length, and most preferably about 20-25 nucleotides 

25 in length. For a certain type of microarray, it may be preferable to use oligonucleotides that are 
only 7-20 nucleotides in length. The microarray may contain oligonucleotides that cover the 
known 5', or 3', sequence, sequential oligonucleotides which cover the full length sequence; or 
unique oligonucleotides selected from particular areas along the length of the sequence. 
Polynucleotides used in the microarray may be oligonucleotides that are specific to a gene or 

30 genes of interest. 

In order to produce oligonucleotides to a known sequence for a microarray, the gene(s) of 
interest (or an ORF identified from the contigs of the present invention) is typically examined 
using a computer algorithm which starts at the 5' or at the 3' end of the nucleotide sequence. 
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Typical algorithms will then identify oligomers of defined length that are unique to the gene, 
have a GC content within a range suitable for hybridization, and lack predicted secondary 
structure that may interfere with hybridization. In certain situations it may be appropriate to use 
pairs of oligonucleotides on a microarray. The "pairs" will be identical, except for one 
5 nucleotide that preferably is located in the center of the sequence. The second oligonucleotide in 
the pair (mismatched by one) serves as a control. The number of oligonucleotide pairs may 
range from two to one million. The oligomers are synthesized at designated areas on a substrate 
using a light-directed chemical process. The substrate may be paper, nylon or other type of 
membrane, filter, chip, glass slide or any other suitable solid support. 

10 In another aspect, an oligonucleotide may be synthesized on the surface of the substrate 

by using a chemical coupling procedure and an ink jet application apparatus, as described in PCT 
application W095/25 1116 (Baldeschweiler et al.) which is incorporated herein in its entirety by 
reference. In another aspect, a "gridded" array analogous to a dot (or slot) blot may be used to 
arrange and link cDNA fragments or oligonucleotides to the surface of a substrate using a 

1 5 vacuum system, thermal, UV, mechanical or chemical bonding procedures. An array, such as 
those described above, may be produced by hand or by using available devices (slot blot or dot 
blot apparatus), materials (any suitable solid support), and machines (including robotic 
instruments), and may contain 8, 24, 96, 384, 1536, 6144 or more oligonucleotides, or any other 
number between two and one million which lends itself to the efficient use of commercially 

20 available instrumentation. 

In order to conduct sample analysis using a microarray, the RNA or DNA from a 
biological sample is made into hybridization probes. The mRNA is isolated, and cDNA is 
produced and used as a template to make anti sense RNA (aRNA). The aRNA is amplified in the 
presence of fluorescent nucleotides, and labeled probes are incubated with the microarray so that 

25 the probe sequences hybridize to complementary oligonucleotides of the microarray. Incubation 
conditions are adjusted so that hybridization occurs with precise complementary matches or with 
various degrees of less complementarity. After removal of nonhybridized probes, a scanner is 
used to determine the levels and patterns of fluorescence. The scanned images are examined to 
determine degree of complementarity and the relative abundance of each oligonucleotide 

30 sequence on the microarray. The biological samples may be obtained from any bodily fluids 
(such as blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or other tissue 
preparations. A detection system may be used to measure the absence, presence, and amount of 
hybridization for all of the distinct sequences simultaneously. This data may be used for large 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/US01/09341 



scale correlation studies on the sequences, expression patterns, mutations, variants, or 
polymorphisms among samples. 

Using such arrays, the present invention provides methods to identify the expression of 
one or more of the proteins/peptides of the present invention. In detail, such methods comprise 
incubating a test sample with one or more nucleic acid molecules and assaying for binding of the 
nucleic acid molecule with components within the test sample. Such assays will typically 
involve arrays comprising many genes, at least one of which is a gene of the present invention. 

Conditions for incubating a nucleic acid molecule with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, and the 
type and nature of the nucleic acid molecule used in the assay. One skilled in the art will 
recognize that any one of the commonly available hybridization, amplification or array assay 
formats can readily be adapted to employ the novel fragments of the Drosophila genome 
disclosed herein. Examples of such assays can be found in Chard, T, An Introduction to 
Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The 
Netherlands ( 1 986); Bullock, G. R. etal., Techniques in Immunocytochemistry, Academic 
Press, Orlando, FL Vol. 1 (1 982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include cells, protein or membrane extracts of 
cells. The test sample used in the above-described method will vary based on the assay format, 
nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. 
Methods for preparing nucleic acid extracts or of cells are well known in the art and can be 
readily be adapted in order to obtain a sample that is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising one of the 
nucleic acid molecules that can bind to a fragment of the Drosophila genome disclosed herein; 
and (b) one or more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound nucleic acid. Preferred kits will include chips 
that are capable of detecting the expression of 1 0 or more, or 50 or more, or all of the GPCR 
genes expressed in Drosophila. 
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In detail, a compartmentalized kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers, strips of 
plastic, glass or paper, or arraying material such as silica. Such containers allows one to 
efficiently transfer reagents from one compartment to another compartment such that the 
5 samples and reagents are not cross-contaminated, and the agents or solutions of each container 
can be added in a quantitative fashion from one compartment to another. Such containers will 
include a container which will accept the test sample, a container which contains the nucleic acid 
probe, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, 
etc.), and containers which contain the reagents used to detect the bound probe. One skilled in 
10 the art will readily recognize that the previously unidentified GPCR genes of the present 
invention can be routinely identified using the sequence information disclosed herein can be 
readily incorporated into one of the established kit formats which are well known in the art, 
particularly expression arrays. 

15 Uses of vectors and host cells 

The recombinant host cells expressing the proteins described herein have a variety of uses. 

First, the cells are useful for producing a GPCR protein or protein that can be further purified to 

produce desired amounts of GPCR protein or fragments. Thus, host cells containing expression 

vectors are useful for protein production. 
20 Host cells are also useful for conducting cell-based assays involving the GPCR protein or 

GPCR protein fragments, such as those described above as well as other formats known in the art. 

Thus, a recombinant host cell expressing a native GPCR protein is useful for assaying compounds 

that stimulate or inhibit GPCR protein function. 

Host cells are also useful for identifying GPCR protein mutants in which these functions are 
25 affected. If the mutants naturally occur and give rise to a pathology, host cells containing the 

mutations are useful to assay compounds that have a desired effect on the mutant GPCR protein (for 

example, stimulating or inhibiting function) which may not be indicated by their effect on the native 

GPCR protein. 

Genetically engineered host cells can be further used to produce non-human transgenic 
30 animals. A transgenic animal is preferably a mammal, for example a rodent, such as a rat or mouse, 
in which one or more of the cells of the animal include a transgene. A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal in one or more cell types or tissues of the transgenic 
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animal. These animals are useful for studying the function of a GPCR protein and identifying and 
evaluating modulators of GPCR protein activity. Other examples of transgenic animals include 
non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. 

A transgenic animal can be produced by introducing nucleic acid into the male pronuclei of 

5 a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to develop 
in a pseudopregnant female foster animal. Any of the GPCR protein nucleotide sequences can be 
introduced as a transgene into the genome of a non-human animal, such as a mouse. 

Any of the regulatory or other sequences useful in expression vectors can form part of the 
transgenic sequence. This includes intronic sequences and polyadenylation signals, if not already 

10 included. A tissue-specific regulatory sequence(s) can be operably linked to the transgene to direct 
expression of the GPCR protein to particular cells. 

Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et al., U.S. Patent No. 

1 5 4,873, 1 9 1 by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified based upon 
the presence of the transgene in its genome and/or expression of transgenic mRNA in tissues or 
cells of the animals. A transgenic founder animal can then be used to breed additional animals 

20 carrying the transgene. Moreover, transgenic animals carrying a transgene can further be bred to 
other transgenic animals carrying other transgenes. A transgenic animal also includes animals in 
which the entire animal or tissues in the animal have been produced using the homologously 
recombinant host cells described herein. 

In another embodiment, transgenic non-human animals can be produced which contain 

25 selected systems which allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al. PNAS 59:6232-6236 (1992). Another example of a 
recombinase system is the FLP recombinase system of S. cerevisiae (O'Gorman et al. Science 
257:1351-1355 (1991). If a cre/loxP recombinase system is used to regulate expression of the 

30 transgene, animals containing transgenes encoding both the Cre recombinase and a selected protein 
is required. Such animals can be provided through the construction of "double" transgenic animals, 
e.g., by mating two transgenic animals, one containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 
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Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, I. et al. Nature 555:810-813 (1997) and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a somatic cell, 
from the transgenic animal can be isolated and induced to exit the growth cycle and enter G 0 phase. 
The quiescent cell can then be fused, e.g., through the use of electrical pulses, to an enucleated 
oocyte from an animal of the same species from which the quiescent cell is isolated. The 
reconstructed oocyte is then cultured such that it develops to morula or blastocyst and then 
transferred to pseudopregnant female foster animal. The offspring born of this female foster animal 
will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

Transgenic animals containing recombinant cells that express the proteins described herein 
are useful to conduct the assays described herein in an in vivo context. Accordingly, the various 
physiological factors that are present in vivo and that could effect ligand binding, GPCR protein 
activation, and signal transduction, may not be evident from in vitro cell-free or cell-based assays. 
Accordingly, it is useful to provide non-human transgenic animals to assay in vivo GPCR protein 
function, including ligand interaction, the effect of specific mutant GPCR proteins on GPCR protein 
function and ligand interaction, and the effect of chimeric GPCR proteins. It is also possible to 
assess the effect of null mutations, that is mutations that substantially or completely eliminate one or 
more GPCR protein functions. 

All publications and patents mentioned in the above specification are herein incorporated 
by reference. Various modifications and variations of the described method and system of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit 
of the invention. Although the invention has been described in connection with specific 
preferred embodiments, it should be understood that the invention as claimed should not be 
unduly limited to such specific embodiments. Indeed, various modifications of the above- 
described modes for carrying out the invention which are obvious to those skilled in the field of 
molecular biology or related fields are intended to be within the scope of the following claims. 
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Table 1 



Gene Transcript F W V 

Similar to E Value 
A. Rhodopsin-like receptor family 

Opsin-like 

CGS638 CTI7820 Rh3 c-57 e-14 e-44 

Receptors for biogenic amines and related compounds 
5-HT receptor-like 



CG8007 CT24060 


5-HT2 


e-21 


e-35 


e-27 


Muscarinic Acetylcholine receptor-like 








CG7918 CT23924 


mAcR-60C 


e-32 


e-49 


e-41 


Unclassifiable biogenic amine receptor-like 






CG17004 CT37739 


5-HT7 


e-18 


e-20 


e-28 


CG7431 CT22855 


CGI 6766 


e-70 


e-45 


c-37 


CGI 6766 CT37292 


CG743I 


e-53 


e-31 


e-22 


CGI 2796 CT38338 


CG6919 


e-23 


e-20 


e-30 


CG69I9 CT2I432 


CG6989 


e-82 


e-46 


e-57 


CG6989 CT2I650 


CG69I9 


e-82 


e-39 


e-46 


CG7078 CT2I843 


CG6919 


e-73 


e-46 


e-38 


CGI 83 14 CT4I076 


CG6919 


e-21 


e-48 


e-25 


CG7994 CT24036 




>e-IO 


>e-10 


e-10 


Purine receptors 










Adenosine receptor-like 










CG9753 CT27563 


CG6989 


e-19 


e-29 


e-42 


Peptide receptors 










Allatostatin receptor-like 










CGI 0001 CT28I87 


EG:121E7.2 


e-62 


e-35 


e-34 


FSH/TSHyLH receptor-like 










CG4I87 CT 13764 


CG5042 


e-48 


e-23 


e-24 


CG5042 CTI6185 


CG4I87 


e-44 


c-25 


e-23 


Gastrin/CCK receptor-like 










CG6857 CT2I155 


CG688I 


e-96 


e-33 


e-19 


CG6881 CT21314 


CG6857 


e-96 


e-23 


e-33 


Gonadotropin releasing hormone receptor-like 






CGI 0698 CT29989 


GRHR 


e-43 


e-34 


e-39 


Growth hormone secretagoge receptor-like 






CG8784 CT25324 


CG8795 


c-165 


e-38 


e-35 


CG8795 CT25350 


CG8784 


e-165 


e-40 


e-37 


CG99I8 CT27924 


CG8795 


e-83 


e-53 


e-33 


Tachykinin receptor-like 










CGI 147 CTI960 


NepYr 


e-31 


c-43 


e40 


CGI 0626 CT29768 


Takr99D 


e-53 


e-41 


e-44 


Somatostatin receptor-like 










CG7285 CT22465 


CGI 3702 


e-96 


e-24 


e-51 


CGI 3702 CT33I59 


CG7285 


e-96 


e-26 


e-46 


Vasopressin receptor-like 










CG6III CT19I9I 


GRHR 


e-36 


e-26 


e-45 


Unclnssifiablc peptide receptors 








CGI 3575 CT32957 


CGI 0626 


e-08 


e-08 


c-8# 


CGI 4003 CT33559 


Takr99D 


e-14 


e-12 


e-14« 


CG59II CT 18539 


Takr86C 


c-18 


e-19 


e-26fl 


CG10823 CTI89I6 


CG6857 


e-10 


e-10 


e-12 



Gene Transcript F 

Similar to 

Orphan receptors 

Related to C. elcgans orphan receptors 



W V 
E Value 



CG2II4 


CT2366 




>e-10 


e-27 


e-10 


CG3171 


CT 10621 


EG:22E5.10 


e-50 


e-16 


e-26 


CG5936 


CT18637 




>e-IO 


e-18 


>e-!0 


CG8985 


CT25824 


CGI 3803 


e-162 


e-31 


>e-IO 


CGI 3803 


CT33298 


CG8985 


e-162 


e-36 


>e-IO 


CGI 3229 


CT32473 


CGI 3803 


e-58 


e-35 


c-10 


Other orphan receptors 










CG9569 


CT 17758 




>e-10 


>e-10 


>e-10 


CGI 2290 


CT 19320 




>e-10 


>e-IO 


>e-10 


CG6986 


CT2I642 


CGI 6726 


e-10 


>e-10 


>e-IO 


CGI 3579 


CT32961 




>e-IO 


e-18 


>e-IO 


CGI 3995 


CT33551 




>e-10 


e-13 


e-ll 


CG7497 


CT23019 




>e-10 


>e-10 


>e-10 


Calcitonin receptor-like 










CG4395 


CT4I2I 


CGI7415 


e-16 


e-I6 


e-41 


CG174I5 


CT38445 


CG4395 


e-42 


e-20 


e-61 


CG13758 


CT33238 


CG8422 


e-40# 


e-65 


e-66 


! Diuretic hormone receptor-like 










CG8422 


CT245I3 


CGI 2370 


e-122 


e-25 


e-65 


CGI 2370 


CT24959 


CG8422 


e-122 


e-30 


e-101 


HE6 receptor-like 












CGI 1318 


CT3I591 


CGI 5556 


e-104 


e-ll 


c-20 


CGI 5556 


CT35672 


CGI 1318 


e-103 


>e-ll 


e-6 


Latrophilin-like 












CG8639 


CT875S 




>e-10 


e-47 


e-45 


Methuselah-like 












CG452I 


CTI4539 


CG6965 


e-30 


e-13 




CGI 7795 


CT 16507 


mth 


e-132 


>e-IO 


>e-IO 


CG6530 


CT20339 


CG6536 


e-104 


>c-IO 


>e-10 


CG6536 


CT2035I 


CG6530 


e-115 


>e-IO 




CG6965 


CT2I585 


CGI 7795 


e-12 


>e-IO 


e-10 


CG7476 


CT22963 


mth 


e-66 


>e-IO 


>e-10 


CGI 3406 


CT32762 


CGI 7084 


e-47 


>e-10 


>e-I0 


CGI 7084 


CT334I4 


CGI 3406 


e-47 


>e-IO 


>e-IO 


CG1706I 


CT334I5 


mth 


e-80 


>e-IO 


>e-10 


CGI 6992 


CT377I5 


mth 


e-32 


>e-10 


>e-IO 



C. Metabotropic glutamate receptor family 

CABA-B receptor-like 

CG3022 CT9836 CG6706 e-96 e-56 e-85 

CG6706 CT20836 CG3022 e-120 e-61 e-169 

CGI 5274 CT35221 CG3022 e-49 e-156 e-129 



Metabotropic glutamate receptor-like 

CG8692 CT5032 Glu-RA 



Other 

CG7155 
CGI 1923 



CT22II7 

CT35779 CGI 72 15 



D. Atypical 7 TM proteins 
Frizzled-like 

CG4626 CT9057 



e-152 e-I15 e-157 



>e-IO >e-10 >e-!0 
e-17 >e-10 e-17 



e-36 e-54 e-76 
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1 . An isolated protein consisting of an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence shown in SEQ IDNOs: 3, 6, 9, ... , 192, 195, 198; 

(b) the amino acid sequence of an allelic variant of the amino acid sequence 
shown in SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said allelic variant is encoded by a 
nucleic acid molecule that hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1, 4, 7, . 
. . , 795, 798, 801 (genomic sequences); SEQ ID NOs: 2, 5, 8, . . . , 796, 799, 802 (transcript 
sequences), under stringent conditions; 

(c) the amino acid sequence of an ortholog of an amino acid sequence shown in 
SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said ortholog is encoded by a nucleic acid 
molecule that hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1, 4, 7, . . . , 795, 798, 
801 (genomic sequences); SEQ ID NOs: 2, 5, 8, . . . , 796, 799, 802 (transcript sequences), under 
stringent condition; 

(d) a fragment of the amino acid sequence shown in SEQ ID NOs: 3, 6, 9, . . . , 
192, 195, 198, wherein the fragment comprises at least 10 contiguous amino acids. 

2. An isolated protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence shown in SEQ ID NOs: 3, 6, 9, ... , 192, 195, 198; 

(b) the amino acid sequence of an allelic variant of the amino acid sequence 
shown in SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said allelic variant is encoded by a 
nucleic acid molecule that hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1, 4, 7, . 
. . , 795, 798, 801 (genomic sequences); SEQ ID NOs: 2, 5, 8, . . . , 796, 799, 802 (transcript 
sequences), under stringent conditions; 

(c) the amino acid sequence of an ortholog of an amino acid sequence shown in 
the SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said ortholog is encoded by a nucleic acid 
molecule that hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1 , 4, 7, . . . , 795, 798, 
801 (genomic sequences); SEQ ID NOs: 2, 5, 8, . . . , 796, 799, 802 (transcript sequences), under 
stringent condition; 
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(d) a fragment of the amino acid sequence shown in the SEQ ID NOs: 3, 6, 9, 
. . . , 192, 195, 198, wherein the fragment comprises at least 10 contiguous amino acids. 

3. An isolated antibody that selectively binds to a protein of claim 1 . 

4. An isolated nucleic acid molecule consisting of a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence that encodes the amino acid sequence shown in SEQ 
ID NOs: 3, 6, 9, . . . , 192, 195, 198; 

(b) a nucleotide sequence that encodes of an allelic variant of the amino acid 
sequence shown in SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said nucleic acid molecule 
hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1 , 4, 7, . . . , 1 90, 1 93, 1 96 
(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)), under 
stringent conditions; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence 
shown in SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said nucleic acid molecule hybridizes 
to the nucleic acid molecule shown in SEQ ID NOs: 1,4,7, . . . , 1 90, 1 93, 1 96 (genomic 
sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)), under stringent 
condition; 

(d) a nucleotide sequence that encodes a fragment of the amino acid sequence 
shown in the SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said the fragment comprises at 
least 10 contiguous amino acids; and 

(e) A nucleic acid molecule that is the complement of a nucleic acid molecule 

of(a)-(d). 

5. An isolated nucleic acid molecule comprising a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence that encodes the amino acid sequence shown in SEQ 
ID NOs: 3, 6, 9, . . . , 192, 195, 198; 

(b) a nucleotide sequence that encodes of an allelic variant of the amino acid 
sequence shown in SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said nucleic acid molecule 
hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 
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(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)), under 
stringent conditions; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence 
shown in the SEQ ID NOs: 3, 6, 9, . . . , 192, 195, 198, wherein said nucleic acid molecule 
hybridizes to the nucleic acid molecule shown in SEQ ID NOs: 1, 4, 7, . . . , 190, 193, 196 
(genomic sequences) or SEQ ID NOs: 2, 5, 8, . . . , 191, 194, 197 (transcript sequences)), under 
stringent condition; 

(d) a nucleotide sequence that encodes a fragment of the amino acid sequence 
shown in the SEQ ID NOs: 3,6,9, . . . , 1 92, 1 95, 1 98, wherein said the fragment comprises at 
least 10 contiguous amino acids; and 

(f) A nucleic acid molecule that is the complement of a nucleic acid molecule 

of(a)-(d). 

6. A nucleic acid vector comprising the nucleic acid sequences of claim 4. 

7. A nucleic acid vector comprising the nucleic acid sequences of claim 5. 

8. A host cell containing the vector of claim 6. 

9. A host cell containing the vector of claim 7. 

10. A method for producing any of the proteins of claim 1 comprising introducing a 
nucleotide sequence encoding any of the protein sequences in (a)-(d) into a host cell, and culturing 
the host cell under conditions in which the proteins are expressed from the nucleic acid. 

11. A method for producing any of the proteins of claim 2 comprising introducing a 
nucleotide sequence encoding any of the protein sequences in (a)-(d) into a host cell, and culturing 
the host cell under conditions in which the proteins are expressed from the nucleic acid. 

12. A method for detecting the presence of any of the proteins of claim 1 in a sample, 
said method comprising contacting said sample with an agent that specifically allows detection of 
the presence of the protein in the sample and then detecting the presence of the protein. 
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13. A method for detecting the presence of any of the proteins of claim 2 in a sample, 
said method comprising contacting said sample with an agent that specifically allows detection of 
the presence of the protein in the sample and then detecting the presence of the protein. 

14. A kit comprising reagents used for the method of claim 12, wherein the reagents 
comprise an agent that specifically binds to said protein. 

15. A method for detecting the presence of a nucleic acid sequence of claim 4 in a 
sample, the method comprising contacting the sample with an oligonucleotide that hybridizes to the 
nucleic acid sequences under stringent conditions and determining whether the oligonucleotide 
binds to the nucleic acid sequence in the sample. 

16. A method for detecting the presence of a nucleic acid sequence of claim 5 in a 
sample, the method comprising contacting the sample with an oligonucleotide that hybridizes to the 
nucleic acid sequences under stringent conditions and determining whether the oligonucleotide 
binds to the nucleic acid sequence in the sample. 

17. A kit comprising reagents used for the method of claim 1 5, wherein the reagents 
comprise a compound that hybridizes under stringent conditions to any of the nucleic acid 
molecules. 

18. A kit comprising reagents used for the method of claim 16, wherein the reagents 
comprise a compound that hybridizes under stringent conditions to any of the nucleic acid 
molecules. 

1 9. A method for identifying an agent that binds to any of the proteins of claim 1 , said 
method comprising contacting the protein with an agent and assaying the contacted mixture to 
determine whether a complex is formed with the agent bound to the protein. 

20. A method for identifying an agent that binds to any of the proteins of claim 2, said 
method comprising contacting the protein with an agent and assaying the contacted mixture to 
determine whether a complex is formed with the agent bound to the protein. 
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SEQUENCE LISTING 

<110> CravcmHEi: Anibal 

<120> ISOLATED G- PROTEIN COUPLED RECEPTORS, 

NUCLEIC ACID MOLECULES ENCODING GPCR PROTEINS, AND USES 
THEREOF AS INSECTICIDAL TARGETS 

<130> CL000733PCT 

<150> 60/191,638 
<151> 2000-03-23 

<160> 198 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 5236 
<212> DNA 
<213> Drosophila 

<400> 1 

caaaaactaa aacaacaaaa agccggcaag caatggtata caaaataaat aaaaaaaaat 60 
atgacaagag aaaatccaca aaaaaatgaa ataaaattga atgaaaaatt taccaaaaac 120 
caagttgttt ttattccaag tgtatgtagt tggcaatcaa acgattcaac tgcagttaac 180 
atgatttatt cgctacgtaa gtgtgcttgg ctaagttgaa taaatatcat taatatctgt 240 
cagctacatt aaaggttcaa acattacata tatatatatc tatatattta tcttaacatt 300 
attaaatgtt taagggctta aaaaaaaata tccttaggta cacgggtacg tatgtattat 360 
tctgcgtctc tttctatatc ctctcaggat ttgaatatat tttgtattat aaaatattac 420 
gcataaaatc atcgggggga acctgttaat tcttggcatt tgattcccgc attttctctg 480 
gatatggcat agatattttt ctagggttag ctgtgtgaaa cctaaatatt tttaagttaa 540 
atttaaacaa cgatttatac ggctgacaca ttttaaattt ttctgggttc aagcttagat 600 
tttgtttaca tgtttttttc ttaatgctta acactatgct ttatatactt tttcaaaata 660 
tattttaaaa cttgttcagt aaaaagtttt aaagtgataa ttcttaacag agttttttta 720 
gaaacttttt ttttagaaag atcaagcata caattatttg tttttatcaa taattcatat 780 
gactagaacc tttcttttct aaatgcattt cgggcatagt atcccaggat aaaaaatata 840 
acaggaacat tttaaataat tgaaatcagt atgcatttta acaaaattaa gttataaaaa 900 
aaaatatttt gctatcatga aataagtgtg ccggactggt cttcaaaatc tttgctcggc 960 
gatgcggtca atgttacgat cgaccctcgc gcggtacgcc ttaaacgccc aactctcggc 1020 
tgggcgtggt cgtggccatc tgtccagtat tactcccggc cggcggcggg gtcactaaac 1080 
tgggcagcaa cttgttcagc agcgtgcgaa tggcaaaggt gacatcgtga ttggcgaagc 1140 
agaacaggaa ggagaccaca aagccctgca ggctgaccaa caccaccgat agcatctgat 1200 
agaagtggtc cagctgggtg ccggcatccg gacggtaggg cagcaggaag tgttgcagtc 1260 
cgaacagcgg taccagaata atggtggccc gaaccgcctt ccggatggcc aacggtgccg 1320 
gctgggcgga ctggggatgt aactttcgca cgatcactcg cagtacgttg atcagaaaaa 1380 
ctgttgggta aaatgcatat agctgagcaa tattgtactt ttaaagaaaa ttatgcatat 1440 
tactaaagct ggccagcagc gaaagagtta tgggcaccga gaagatccac aagtagagac 1500 
tgtcagttat ccagcagctg caataatgtg aaaagaaatg ttaccatgga tatagcgtag 1560 
aatgtaatct cttaataaat gcaatgcaaa aaccaaatca atcaggatta tgtcttgtgt 1620 
aaaattgtga aaataactta ctgcttattg tccggactac tgaaatgcct ggccagaccg 1680 
tagacgatgg ctatgggtat gggtgaaaac cagctgatga caataaacca gcgcatcaca 1740 
atcgtgtcct tgacaaagac ctggcgaatg gaaagaagaa ggggcgtcag gatctgaacg 1800 
gacattcctg acatctcaac ttgacgcaga cgtaaagcag ttggccaacg ttgcgatttg 1860 
ctctcagaca aattgctggg ggtgaaccat tgccatgtca gcggctgcag gaaatcacat 1920 
ctgccatgca cagaagtcgc ccataaagaa tgcagacttg tgcaggcaat tgccagattg 1980 
ccagacgagt aaggataact caccacaacg agcaccaggt gcaggtgcag gccctcgcaa 2040 
aacatccaga aatagttgac caacataaag tagtgcacca ccagatgcag cccaatgcac 2100 
cacagctgaa agttagttat gtttatggtt ttgtataatt gcctagctac ttgtagggct 2160 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCTAJS01/09341 



atgttatttt catagttact tttctcagtg catcttaccg gattttcggc tatggtttcg 2220 

gaacgctcca ccaccagacg ataccataat atccaggcca cacaggtgca ggccagcgag 2280 

gcgaacagat gtacatggat gcggatgcgg gtgcagcgca gggatctgca aacggaatgg 2340 

aacggaacgg aaagaaacga aaagggatat ccggccgtca gttaccggag ggaaaggaac 2400 

cgtgtgggtc gtagacaaac acttacttga agcccaagaa aataataatc gatatgagca 2460 

gggccagcag ggagagggca taacccttca cgtacagctc gttgatgaac tggcgaaact 2520 

gcagtggcaa ctttcttgat taatgcccat gtaccacccg cttgatgccg ccgaagcact 2580 

tacctccagg tcctcatagt ccacgcagtt ggtgtaattg gaccaggtct gattgctaac 2640 

cggatgacgg taccaggagc cgttctccag gcagctatga atgcgaaatg cttacgataa 2700 

ttgtgactgg ctaagtgcgg actgatgggt aaagggggta tttctaacaa agcgcctgac 2760 

aatgcacaag ttacccaact ttaccaggag ccagctgagc tcagttggcg tgtaaagtga 2820 

gatgaacgct atccattaaa ttggagatac atacatgtag tgcaagtttt agagtgctgc 2880 

tacagtacac ttagggagat tatctctgca agagttgaag aaggttttat ttttctttca 2940 

ttttccatca tctttttctg agatatgctg cgagtttatg taatagacac gaatgtaaac 3000 

tctataggat tttccctgtt tgtagcgcac tcatacataa acattgtatt ttatgacaca 3060 

cagaacacat taaagcaaat gccgaatatg gcaagatatt ttcgaattta tgctcaaacg 3120 

aatattacat attgcagagc ttatttgacc atgcgaaatc ctgaacggaa tgaaaatcta 3180 

tcctattcgt ggtggcgtaa tatacaggca tattgggcat ttaactcacg tcttgtgggc 3240 

cagaaatttt ctgttgaatc cttcgacgaa atcgggacaa tattgactca gcacagtgcc 3300 

ggccggagtc cttggccagc agagatatcc gtcaaagttc agggggcaaa agagtcggtt 3360 

ctccttctga tttgtggcca tcgtggcact gacgacggtg gccatcagct cctccttctc 3420 

cgccagctcc tcgtccacgt ttcgcatgtc aataacgccc tcgttcatcg cggcattctc 3480 

cagatatcgc ggcactgggc taaagttctc cggcaaatcg gaatcgtagt ctgtgcacgg 3540 

cgacaagtgc cattaaaatg ggtttacttt ttaatgtata aaaatatagc atttttaata 3600 

atagtatcta tgggctgcat aactgaaagt gatgtaattc agttttcaag ttcaattcaa 3660 

ttgcatgctt gctttacata tgtaaaataa agtggtattt caaatgtaca taacttgctt 3720 

aagatagtta atagatcatc gttctttata gcagggtata attcataaat tctggttgaa 3780 

aatatatgta catttggtaa aggtgttgag tgctatttaa atttataaat ttgtcctgag 3840 

atgccaatga tttgcattga aacgaaaacc catgtttaaa tttgcatccc caagaagtaa 3900 

gtcaatgatt ttgtatatat ttaaatacat tactttgaaa tgcattaaat gaaatcagat 3960 

gggctgtaat ttaatttaat tctcttctct ttcttatctg tcttatgcct ctgctatttt 4020 

ttcgcggcaa atcacgtata cgcctgttgg ccgccactca cctgttgctg ggccaagtga 4080 

aaatgatggg tcgtccgtat cgtatgtgac attctggtat cggaacacgc actccgcata 4140 

caaatgcttt aaaaatatgc gcagattatc ctgcgtttgg gcctgcgatg ccacatccac 4 200 

attttgcgac tccgaatctg aagaggttgt cgccattgtg ggttattcaa cgtgactaac 4260 

tcttgtcaac ggctaatgca ctaaaaaaaa atataataat ataaatataa ttcaaagaat 4320 

tacgggataa ataggataat caactgcaat aatttatcag aggcaatttg aaaggtttta 4380 

aacaagtgtt gcttacccat agaaacttgt atgagctagt ttttaaaatt tattctgatt 4440 

taaatatatt ttcttatatt taacattagt tctcactgta tttcgccata gcgaagcgaa 4500 

cttgagtccc ttgtggccgt ccattcgtgt gtgcatttct aatttggtta ggcttcaaag 4560 

agttgcctga gttgggttga tatttgttgt ggggtcctgt ctgcggggaa atggtgccaa 4 620 

aggaaatgga atatccacac ggggcacaca cacacactca aaatatcgaa gcacctgagt 4 680 

gtcgtgtgag ccgagtcacc tactcgtctc tcatatgttg taaatcagtt tattgtctaa 4740 

ttaatggcgg tgcttaactg tgtcgctatt tctgctaccg tggttgggcg aacttaattg 4800 

catttcgtcg cacagagaga caaagcaaat gccacgggtt aaaaactcaa atgaaatgca 4 860 

ctttgatcgt gggggcacac aaacacacaa acacataaac acactcacac acccgcaccg 4 920 

aacacgtggc ccaaaaggcg ggatgagctg caaagttctt gcctaacttg gttatatttt 4 980 

taatcaaaac tcatttcact tgattgagtt tcgacagaag cctgcaccag gcactggaaa 5040 

atctagcatg cacgatacat acaaagtaag atgattttcc cgctcccaag ctaaacttta 5100 

atttgcggca ttaatttatt acaccgaact gggacagttt cccatgatgg atatgacgcc 5160 

tggctaaaca ttcttgcatg ttatatatat ttttgcaagt gtctcggcca ggcaattagc 5220 

cgcaggtaag cggagc 5236 



<210> 2 

<211> 1686 

<212> DNA 

<213> Drosophila 
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<400> 2 

atggcgacaa 

acgcaggata 

cagaatgtca 

gattccgatt 

aacgagggcg 

atggccaccg 

cccctgaact 

caatattgtc 

ctggagaacg 

aactgcgtgg 

ggttatgccc 

tccctgcgct 

tgtgtggcct 

aatccggtaa 

aggcaattat 

ctggtggtgc 

ctgcacctgg 

tgcacaagtc 

ccgctgacat 

ccaactgctt 

ccccttcttc 

attgtcatca 

gtactgcgag 

atccggaagg 

ctgctgccct 

gtggtgttgg 

gatgtcacct 

ccgccggccg 

gtttaa 



cctcttcaga 
atctgcgcat 
catacgatac 
tgccggagaa 
ttattgacat 
tcgtcagtgc 
ttgacggata 
ccgatttcgt 
gctcctggta 
actatgagga 
tctccctgct 
gcacccgcat 
ggatattatg 
gatgcactga 
acaaaaccat 
actactttat 
tgctcgttgt 
tgcattcttt 
ggcaatggtt 
tacgtctgcg 
tttccattcg 
gctggttttc 
tgatcgtgcg 
cggttcgggc 
accgtccgga 
tcagcctgca 
ttgccattcg 
ggagtaatac 



ttcggagtcg 
atttttaaag 
ggacgaccca 
ctttagccca 
gcgaaacgtg 
cacgatggcc 
tctctgctgg 
cgaaggattc 
ccgtcatccg 
cctggagttt 
ggccctgctc 
ccgcatccat 
gtatcgtctg 
gaaaagtaac 
aaacataact 
gttggtcaac 
ggtgagttat 
atgggcgact 
cacccccagc 
tcaagttgag 
ccaggtcttt 
acccataccc 
aaagttacat 
caccattatt 
tgccggcacc 
gggctttgtg 
cacgctgctg 
tggacagatg 



caaaatgtgg 
catttgtatg 
tcatt ttcac 
gtgccgcgat 
gacgaggagc 
acaaatcaga 
ccaaggactc 
aacagaaaat 
gttagcaatc 
cgccagttca 
atatcgatta 
gtacatctgt 
gtggtggagc 
tatgaaaata 
aactttcagc 
tatttctgga 
ccttactcgt 
tctgtgcatg 
aatttgtctg 
atgtcaggaa 
gtcaaggaca 
atagccatcg 
ccccagtccg 
ctggtaccgc 
cagctggacc 
gtctccttcc 
aacaagttgc 
gccacgacca 



atgtggcatc 
cggagtgcgt 
ttggcccagc 
atctggagaa 
tggcggagaa 
aggagaaccg 
cggccggcac 
ttctggccca 
agacctggtc 
tcaacgagct 
ttattttctt 
tcgcctcgct 
gttccgaaac 
acatagccct 
tgtggtgcat 
tgttttgcga 
ctggcaatct 
gcagatgtga 
agagcaaatc 
tgtccgttca 
cgattgtgat 
tctacgtttt 
cccagccggc 
tgttcggact 
acttctatca 
tgttctgctt 
tgcccagttt 
cgcccagccg 



gcaggcccaa 
gttccgatac 
aacagactac 
tgccgcgatg 
ggaggagctg 
actcttttgc 
tgtgctgagt 
caagacctgc 
caattacacc 
gtacgtgaag 
gggcttcaaa 
ggcctgcacc 
catagccgaa 
acaagtagct 
tgggctgcat 
gggcctgcac 
ggcaattgcc 
tttcctgcag 
gcaacgttgg 
gatcctgacg 
gcgctggttt 
tctgatcaac 
accgttggcc 
gcaacacttc 
gatgctatcg 
cgccaatcac 
agtgaccccg 
agagttgggc 



60 

120 

180 

210 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1686 



<210> 3 
<211> 561 
<212> PRT 
<213> Drosophila 



<400> 3 



20 



35 



50 



65 



100 



115 



130 



145 



Ser 


Ser 


Asp 


Ser 


Glu 


Ser 


Gin 


Asn 


Val 


Asp 


Val 


Ala 


5 










10 










15 




Thr 


Gin 


Asp 


Asn 


Leu 


Arg 


lie 


Phe 


Leu 


Lys 


His 


Leu 










25 










30 






Val 


Phe 


Arg 


Tyr 


Gin 


Asn 


Val 


Thr 


Tyr 


Asp 


Thr 


Asp 








40 










45 








Ser 


Leu 


Gly 


Pro 


Ala 


Thr 


Asp 


Tyr 


Asp 


Ser 


Asp 


Leu 






55 










60 










Ser 


Pro 


Val 


Pro 


Arg 


Tyr 


Leu 


Glu 


Asn 


Ala 


Ala 


Met 




70 










75 










80 


lie 


Asp 


Met 


Arg 


Asn 


Val 


Asp 


Glu 


Glu 


Leu 


Ala 


Glu 


85 










90 










95 




Met 


Ala 


Thr 


Val 


Val 


Ser 


Ala 


Thr 


Met 


Ala 


Thr 


Asn 










105 










110 






Arg 


Leu 


Phe 


Cys 


Pro 


Leu 


Asn 


Phe 


Asp 


Gly 


Tyr 


Leu 








120 










125 








Thr 


Pro 


Ala 


Gly 


Thr 


Val 


Leu 


Ser 


Gin 


Tyr 


Cys 


Pro 






135 










140 










Gly 


Phe 


Asn 


Arg 


Lys 


Phe 


Leu 


Ala 


His 


Lys 


Thr 


Cys 




150 










155 










160 


Ser 


Trp 


Tyr 


Arg 


His 


Pro 


Val 


Ser 


Asn 


Gin 


Thr 


Trp 


165 










170 










175 




Asn 


Cys 


Val 


Asp 


Tyr 


Glu 


Asp 


Leu 


Glu 


Phe 


Arg 


Gin 
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180 



185 



190 



Phe 


T 1 • 

i le 


Asn 


GlU 


Leu 


Tyr 


va i 


Lys 


Gly 


Tyr 


TV 1 _ 

Ala 


Leu 


ber 


Leu 


Leu 


Ala 






195 










2uu 










one 

205 








Leu 


Leu 


lie 


Ser 


lie 


lie 


He 


Phe 


Leu 


Gly 


Phe 


Lys 


Caw 

ber 


Leu 


Arg 


Cys 




210 










215 










220 










Thr 


Arg 


lie 


Arg 


lie 


His 


Val 


His 


Leu 


Phe 


Ala 


Ser 


Leu 


Ala 


Cys 


Thr 


225 










o o t~\ 

230 










IOC 










240 


Cys 


val 


Ala 


Trp 


lie 


Leu 


Trp 


Tyr 


Arg 


Leu 


Val 


Val 


Glu 


Arg 


Ser 


Glu 










245 










250 










255 




Thr 


lie 


Ala 


Glu 


Asn 


Pro 


Val 


Arg 


Cys 


Thr 


Glu 


Lys 


Ser 


Asn 


Tyr 


Glu 








260 










265 










270 






Asn 


Asn 


lie 


Ala 


Leu 


Gin 


Val 


Ala 


Arg 


Gin 


Leu 


Tyr 


Lys 


Thr 


He 


Asn 






275 










280 










285 








lie 


Thr 


Asn 


Phe 


Gin 


Leu 


Trp 


Cys 


He 


Gly 


Leu 


His 


Leu 


Val 


Val 


His 




290 










295 










300 










Tyr 


Phe 


Met 


Leu 


Val 


Asn 


Tyr 


Phe 


Trp 


Met 


Phe 


Cys 


Glu 


Gly 


Leu 


His 


305 










310 










315 










320 


Leu 


His 


Leu 


Val 


Leu 


Val 


Val 


Val 


Ser 


Tyr 


Pro 


Tyr 


Ser 


Ser 


Gly 


Asn 










325 










330 










335 




Leu 


Ala 


lie 


Ala 


Cys 


Thr 


Ser 


Leu 


His 


Ser 


Leu 


Trp 


Ala 


Thr 


Ser 


Val 








340 










345 










350 






His 


Gly 


Arg 


Cys 


Asp 


Phe 


Leu 


Gin 


Pro 


Leu 


Thr 


Trp 


Gin 


Trp 


Phe 


Thr 






355 










360 










365 








Pro 


Ser 


Asn 


Leu 


Ser 


Glu 


Ser 


Lys 


Ser 


Gin 


Arg 


Trp 


Pro 


Thr 


Ala 


Leu 




370 










375 










380 










Arg 


Leu 


Arg 


Gin 


Val 


Glu 


Met 


Ser 


Gly 


Met 


Ser 


Val 


Gin 


He 


Leu 


Thr 


385 










390 










395 










400 


Pro 


Leu 


Leu 


Leu 


Ser 


He 


Arg 


Gin 


Val 


Phe 


Val 


Lys 


Asp 


Thr 


lie 


Val 










405 










410 










415 




Met 


Arg 


Trp 


Phe 


lie 


Val 


He 


Ser 


Trp 


Phe 


Ser 


Pro 


He 


Pro 


He 


Ala 








420 










425 










430 






lie 


Val 


Tyr 


Val 


Phe 


Leu 


He 


Asn 


Val 


Leu 


Arg 


Val 


He 


Val 


Arg 


Lys 






435 










440 










445 








Leu 


His 


Pro 


Gin 


Ser 


Ala 


Gin 


Pro 


Ala 


Pro 


Leu 


Ala 


He 


Arg 


Lys 


Ala 




4 50 










455 










4 60 










Val 


Arg 


Ala 


Thr 


lie 


He 


Leu 


Val 


Pro 


Leu 


Phe 


Gly 


Leu 


Gin 


His 


Phe 


465 










470 










475 










480 


Leu 


Leu 


Pro 


Tyr 


Arg 


Pro 


Asp 


Ala 


Gly 


Thr 


Gin 


Leu 


Asp 


His 


Phe 


Tyr 










485 










490 










495 




Gin 


Met 


Leu 


Ser 


Val 


Val 


Leu 


Val 


Ser 


Leu 


Gin 


Gly 


Phe 


Val 


Val 


Ser 








500 










505 










510 






Phe 


Leu 


Phe 


Cys 


Phe 


Ala 


Asn 


His 


Asp 


Val 


Thr 


Phe 


Ala 


He 


Arg 


Thr 






515 










520 










525 








Leu 


Leu 


Asn 


Lys 


Leu 


Leu 


Pro 


Ser 


Leu 


Val 


Thr 


Pro 


Pro 


Pro 


Ala 


Gly 




530 










535 










540 










Ser 


Asn 


Thr 


Gly 


Gin 


Met 


Ala 


Thr 


Thr 


Thr 


Pro 


Ser 


Arg 


Glu 


Leu 


Gly 


545 










550 










555 










560 



Val 



<210> 4 
<211> 15572 
<212> DNA 
<213> Drosophila 

<400> 4 

geegtaaact atggcgatgc cccactgtgc ccgcacacaa attccgccac ctcgctccga 60 

aatcgtgtgt atttggttaa taataatcta gttgggcatc gattgttcgg ccaaaatgaa 120 

gccactatcc gtggcggagt gttaattaat ttgcacttaa gcgcagctgc tggaagcact 180 
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tgatagaagt gggcatgccg gttctcttgg gaagttggct tactatagtt tagcgttaag 240 
gcgttaactt gtggccttta aagttcttag agaagacttt ctgcagctgc tcaggctgtt 300 
cattgcggca tctagcataa aatttagagt ttgaaagtaa gtttctatga aaaaaaaata 360 
cacgtatata taagtataac aagaaaagga ccaaatcaaa gtttatatac cattgctgta 420 
cttatatatt tgaatattaa atatacacaa ataaatatat accgaataag aacactttat 480 
gattaaaagc aacaattatt gattaatcta cttctaccta aactggtgtt gaaactggta 540 
taatgaaaaa caactaaggt acatttactt ccatttaatt ttctattggc agctatataa 600 
taaacagttc cgatccaacg tgaaccgacc tatcaccatt tttgataaag ttttacgcag 660 
aaacgtaaaa atatctaaaa gcatgctatt ttgtatttgt atttttgtat tttgtagaga 720 
aaagttgtag atctatattt aattttgcac acatgagcat tttaatcagt tttgtataca 780 
ttatttcgaa ctcttcacca acagaataaa atgaaattaa aatggtccga tgtcgggggc 840 
caaacaaaat aaaggcgtat tgagttgtgc catgtttcag gtggaaaaac cagctgcaat 900 
ataattgctt ttgaataatg cgaatttcgg ggggaagttt cttttaccag cacaatataa 960 
acaaaactgc tcaactattt ctcgatattt acattttcag ggtccatcag caacatagat 1020 
gacgcggagt atcactgcaa taagacgcag gtgcgcaagg tcatctcggg ggtggtgggt 1080 
gccgcctcct cggtcacatc catacaagtt gccaatttat tgcggttatt ccgaattccc 1140 
caggtgagtt tccattgtgt gatcactaac tgttcgactc agtgcgtgcg aagtccaaag 1200 
gtcatggcat aaaagggcag tcctttttgt ttttagttca gattttgctt tatgctcgca 1260 
actacatgta tatgtgctag tttggccaac aaaagtttct cctatttcgt aattaaaatt 1320 
tccaggagcc acatggtaac tgttgcaatg gcttgcaggt ctttcggttg gctttgccga 1380 
ttcctaagtg tctgcacgtg ccactaaatt agtcgagtcc cgcaggccaa ctagaggaac 1440 
atagtgctgc agttgccaag aagtgtacag tagggactct cagcaatcat aacaattaat 1500 
gtaattattt tactatatgt tcaatttatc tttaatatac aattccaaaa tgatgattct 1560 
tagctgctaa agaactgact tacatattaa tttcctacca aaaccaataa caaactggac 1620 
agatcgcttc agttgagtac ttcgactatc agatacccgt cacttgcatt accgcttaaa 1680 
aagcttgtat ctagcattgc tcgtatttca actgcaaacg acattgcagt tttttgggtg 1740 
ttcctcatgt agtgactaga gcgtgaaaaa cgccacaaac taatatccat ttctgattca 1800 
ggaaggaata taacattttc cgagtgccgg catgaatcac gtgcaggaag ttgaaggaga 1860 
ttgagtgccc ggttcggctg ctccagctac gtgagctgga gccaaaagtc aacaacgcga 1920 
caataactaa gtagccaact gccagatccc ttctcgatat atatatatag tatgtactca 1980 
ctactacttg gatgccatta aacgcacacg aagtgccagt cgaaggagca gcagccgttc 2040 
catgcaggcc cagctatcgt cccgaaagga cctcataaat cagcaataaa catttgattg 2100 
catttggttt gcacgcgtgg attctgcgct ggcttcgcat aaacagcact ttcgagcaaa 2160 
ttttgtgtgt tctgcaattc aaatttctgg tcgatagcga cttagtgagt ttcgtttagg 2220 
cttagcggaa attggctgca atgcaaaatg ttcgcaagga cacaattcaa atgacaatgc 2280 
agttaaaaat tccgtaaatg caatatttta tggcttatcg tatttggata acaatgtttt 2340 
aaatgcttga acaaagcatt gctgtctttt ttgtttaaag taaaattttt gccgtcagaa 2400 
gtttgtgcat tgaattaaat ggttattcaa acattacatt ttatttatct tgaaaaaagt 2460 
acaaaaatat gtattaaact cgtagtatta tatataaatt aaatatattt tattattttt 2520 
tgtttttatg gactgagtaa tagcacttga attttccaac tagtttttca caatttttcg 2580 
tactttaata aaagtaaata tttgcatctc tagtgtcaat tatcaaaagt caacaaagcc 2640 
aaagctacac aataattttg ctataactcg acgctttaaa cgctcatgaa aatcaaaagt 2700 
aattctgatt aaataaaagt gagaatgcca taaaccatag gtaaattaga ctcagaaata 2760 
aggtctggta tgagttgcaa tgcctgtctt gccatgagca tttgacaagt aatcaagcac 2820 
ttacgcagaa gctgttgact aatggtaatg acacttaatt agaagccaag tagttcatta 2880 
aacacttgca gttgacttgc cacgccatcg acgatttgcc ccaggatgca cctacttcct 2940 
gccccaagtg tttgtgggcg gatccttctc taaccaaacg gtttatgtgt gcgtgaaatt 3000 
ggtttcccgg cttgggccag gtcaccacag ttggcctttt gctgcagttg ccacagcttt 3060 
gggtttcgca aggtaatgcc agctgatgtt ttcttcgaca gcagcattgt ttgtggtttt 3120 
tggaaaacag cttaagcgag aaaatatgag cgaaaaactg ggaaggagct tacgaaaggt 3180 
tctgcagaag cgatagccca gaacgagtgc gaaaaaatat gaaaattaaa tggcgacctt 3240 
aaagtgaaag gggttcttac ataaaatatt gtagcacact tgccactttg tgccacccaa 3300 
gcaatagcgt ttccaactaa aatgttatcc catcatattg taacccgcct caccacattt 3360 
tttccatcca ggtgtcgtac ttctccacca gccccgagct gagcaacaaa cagcgattcg 3420 
agtacttctc ccgcactatt ccctcggatc actaccaggt gaaggccatg gtggagatag 3480 
tgaagcggat gggatggagc tacgtgtcga ttatctacga ggagagcaat tacggaatca 3540 
aggtgagtgt ttatgtatgg aactttgatt aaaatggaag aaaggagtgc atctatggct 3600 
gagaaatgga gttaaagaat gagaagctct atgttttaaa agagcttgtt aaaattcaaa 3660 
tttaatgtat gttgcttcat tttactttgg atacttcttg gataaaatct attttggata 3720 
accattaaac attgaaaagt caataataag taagttcaat aagtttcagc cacttaaatt 3780 
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tcctcttttt caaaggacta cataaatttc acctaatagg gcaaatcatt taactagtaa 3840 

tccttaaacg gtggaacttt aaaataatgc caagtgccgt ttaaaagcgt tagcagcact 3900 

ccattatcat atgtttatat ccttagccgc ttgcagtgta aacgatatac attttaagtg 3960 

aacgccttgt atcctatcca acttgtacct gtgtaaacat tttacccatc ccaccacatg 4020 

cgtttttatt gctcacccac actgcacttg ttttcaggct ttcgaggaac tggaggagct 4080 

cctcgcacgc cataacattt gtattgccat caaggagaag ctggtgaagg attcgggagt 4140 

ggccgaggac atcgcctacg acaatatagt gcagaagctg ctgacgaagc cgcgggctcg 4200 

aggtatgtaa ctggagagtg ggaagttggg attgcagaga gggatcggag gcgatgtggc 4260 

ggaggaaaag tgcatttcca accggaacag cgactgcagt ctgccagtca aaatgcaatt 4320 

taaaatgaaa atgtcaaagc gaaggcgcag tgcaaatgcg aaaggaaagc ccttcgcaac 4380 

gcccgcacaa tgaatgcgat ccggagtcga gaactatgct aggatttttg gctgggcgac 4440 

ggaaattgcg taaattaaat gaaaatattt aaaaactccc tcgccagcgc gccgttgtgc 4500 

catctgtgcc agcgggaaat taaaaagttc acgacaaaac aaacggaaaa agttatttgc 4560 

aaataaattg agagatagtc gcccagaagt gggcaaactg atacgtccaa tactgtagtt 4 620 

gtgcaaaaaa gtgtgtattt gttatggcca aaaatgaagg tcctgtctat tgtccgtgga 4680 

caaaaagcca ggagtacttg gcgtccctga ggtctgatga atgacaccac ctattcgaca 4740 

tggcaagcaa ttaaaaattc cagtcagggt tggccaagtt caaatttatc ggaacgattt 4800 

ttcatcctaa atgaaaacga tttttccaca tggcattgcc ccaatctgcg ccaatcatgc 4860 

gattcagcag cgagataaac aaaaaataaa tgtcaatcaa tttgactttt cacattttgt 4920 

gttcccctgt cgaatatcag ctgccattcg cagacgccga aaagaaatat aaattaaaaa 4980 

taattaccgc ccacagcttt gcgcatatca ttagtcagtg cctcgaaagg gtgaagaata 5040 

tcagggtcgt gcccgccaat aaaaagcaca cgggtcgacc aataacaatt gacttttgtc 5100 

tggcaaacaa tttttgattt acataaatgc gccatcagga gtcgtaaagc agtcggtcgc 5160 

ccaagaaaag ggttcctccg ccaccattat ttattgtgat ttgtgatttg acgccgctta 5220 

cagggcgcta gccgcctcgc aacgattaat caatttgaca aagtattttc cctggcatca 5280 

gttgtttaat tttgcatatc gagggttctt tcactgctcc tggtcctggt cgtggttctg 5340 

gtgggggctc cagttgggtc cctgaaaaat gatgcccttc cggatccgta actgcggctg 54 00 

gtaactgccg ccaggggtga actttacgct tgttcggatc cccacacatg tcactccgga 54 60 

gctaaatgga atgtgggcgg ggggccgaca tgtttttcct ttttatcgcc aatgtggcaa 5520 

gctgatgcga tttcagccag gacatgaccg gcccccaccc gctttagatc actttctggc 5580 

ccggctttaa agtttccccc taatctgcat aaatatcctg ccgagtgggc tcttaaatta 5640 

aattttatat ccctgctaag aataaccgtg gttctcgact ggtataaaca ggctactgtg 5700 

tacgagaaag ccgctcaagt gcatgtctaa ggcaaagtga aaatccaagg aaaatatcct 5760 

taacttcaca aaagtatcga caagttttcc cgttgtcgcc gccgcttcct ttgaataaga 5820 

ttccctcctc ttacttcgag tggcggagcg gcagtggcaa atcgataccg ggacatgttg 5880 

aaggggcaaa aaattatggc taagacaaag gacctcttgg gcggaccgtt aagaagtgac 5940 

aactgcagct cacgtatccc atacaagttt gtccagagcg aaacaataag ggctgaaagc 6000 

gttggaaaat ttggtaagca cctaaccaaa ggatagtttt agaagctcct gatttccatc 6060 

cgtagaacac aagcgactta ggtgtcggct ttaccaattg gccattggga acgttgtttg 6120 

cttggagttt ctaatgatat cttcacactt tttgtggact ttctgttcgc ttcttgggac 6180 

caattaaaat attgttttcc ttgcaaattg aagaacaata ccttggcaga aaatctaaat 6240 

atagtaaaga tacgtcgcca aatggcgaac atctgaaaaa gtggttgcat tacatcaata 6300 

tatacccttc gtcatgtgta ccccttgtac cccccatacc cttgttttat agacttgttt 6360 

tacgcctaac ggctatagta atccaactta atttatatag catttgcgta attaatattc 6420 

aaagcgaaat aatgggctgt catttaatac tataaaaaca ctttaatccg ccctcggaaa 6480 

ggagaacaca agagcagttg gcagctgcag attgttttct tttgcgcaca aaagccgctc 6540 

attcactatt gattgcaatt aaaattaaat tctttcttcc tcctttcgca tccttttaat 6600 

acgccgctgc ccaggggcca ttatcttcgg ttcggatcaa gaggtgcgcc aagtgatgcg 6660 

ggcggtgcgc cgggcgaatg caactggttc cttctcctgg atcggatccg atggctggag 6720 

tgcccggaat ttggtgtccg acgactacga gccggaggta ggtgaactat tccgtcagac 6780 

cggatgacgg gccagttacc gtcagccgga aattatgagc ctaaatgggc ggaacaactc 6840 

ggtttgatta atgactcgga tttcggtcac atactcacgc acgcacatac gagggcccaa 6900 

ccccattaca tgataaccaa aaaattataa atgatggcgg ccgaaagaac actttcacac 6960 

gccctttccc atgatgaatt atggttcaat ttggattaat cgtgcctttg tctgtgcgtg 7020 

gctaattaat ttgtctaagc ccttcaagcc cagaaatggt cctcctctct gatcaagctc 7080 

attaagtata cgccgcgtgt gtcttggtgt catcaacagg tggaaggcac gctgtctgtg 7140 

cagccgcagg cgaatccggt gcgcgggttc gaggagtact tcctcagtct gacggtggag 7200 

aacaatcaac gcaatccctg gtttgtgggt aagtgaatta ccgattaaaa ttaagctggt 7260 

gttcattaat gctctgatat aagtagtcct ttattagtat cccctttcgc taaatggtta 7320 

tcttgatttc agaattctgg gaggatcact ttcagtgccg atatcccggc agtacgagca 7380 
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ctccgtacaa 
ctgacttcga 
taaggtacga 
ataatagttc 
aaatgtaatg 
ttcgctctgc 
agtggagttt 
caagttaaat 
tcagatccct 
gtggaaattt 
cacacacaat 
gagtggaaaa 
acagacagac 
tctgataggg 
ccataaaccc 
tccgcttcga 
cccaggcggg 
ttaatatgac 
acccttcaac 
tgtattcgtg 
agaggtgaag 
tcttgtgggt 
ttgcacaacc 
tcgaatagat 
atgggaaaaa 
ggcaaacatg 
cgtccattgc 
actcttcccg 
gagtccgctt 
ttcgtaatgg 
gaaattactt 
attcagtctt 
caggatgtta 
cataaaacct 
actttaacaa 
tcgttcttgt 
tcaagctgat 
tgactgtatt 
tttcaccctt 
cattaattgg 
cttcaccttc 
ctgagctaca 
ctgaagccga 
tgcgtgtgaa 
ggtgtcggct 
gcacggatct 
tcccagctgg 
gtggcgctcc 
ccgtgtcatt 
attgcgccat 
gactcgtaca 
atttgcacgg 
catattggtg 
attgtattgg 
ttcattatag 
atgaatgaac 
aaacaataca 
ctagtgcaga 
gattacttaa 
cttttgtacc 



taactacacc 
ggaccagttg 
ttaagcttat 
ttttgatcaa 
tgatttcctc 
gaggccatga 
gagggtgagt 
taacggagta 
ttgctctcgc 
cctgccaacc 
ggcatatctc 
ttgacttgct 
ggacgggaag 
cgagcgggca 
atttaactcc 
cgggaacggc 
acagtaccac 
gggtaagata 
attaagttta 
ctcactgctc 
tttaagcggc 
caggccaaga 
tatcaagtaa 
tgaaatgtag 
tggggcaaaa 
ccgcctttgt 
agatccgcca 
acgcccacaa 
gggccatcgg 
gcgtatttgt 
ttaataatat 
aagatcgaac 
tattcctttt 
tgttaatgta 
taattaaata 
gtttataagt 
ctgagtgcca 
tattggcaca 
tccgggcaaa 
cgcaagaaaa 
cacccaacag 
tcctgctggc 
ccaacatagt 
ttttcctccg 
tctgcttcac 
tcaaagcggg 
taatctgtgc 
tggcggtgac 
ctttccattc 
cacatgccat 
tcgacgcctc 
tctacgcggt 
agtaattgag 
aagggtctgt 
cagcaatgct 
tccttataat 
cgcatgaaat 
aatgttttat 
ccctattgtt 
gctctacttc 



aaacagtgca 
cagtttgtca 
tttggaagca 
cccttttagt 
tccttaggga 
agccaacgaa 
gagtaataga 
actattcccg 
ttagttcctt 
cgtccttgca 
agcctgcgag 
tgtcatcgtc 
gagccaaagg 
agcggcaaag 
tattcgatca 
gacggtcccg 
tgggtcaagg 
aagcccccag 
cctgactgaa 
ctcatcaata 
tgagtcccaa 
agtacgtgga 
gtgaagccca 
cgccaagtga 
acaattaagc 
gctcgcccca 
cccggatgac 
gcagtactgc 
tgctatggcg 
caggtgagca 
ttaatagtta 
atttaacaat 
ctattaagac 
tatattttct 
atttgaagat 
atagttattg 
aatatttcat 
caattgtccg 
cttcgccaca 
cgtcgtcgca 
gcacaacgac 
ggggattttc 
gtgtgccatt 
aaactccctc 
ggtcgtttat 
caaacaatcc 
gtgcctggtt 
cgacggggcg 
actccattcg 
gcaccattat 
ctacatgatt 
gctcacgcga 
ctttttattt 
acaaaaagcc 
attaactggt 
ttagttatgt 
atttataagt 
aattatgaag 
acaggattca 
ggaaccgcca 



ccacaaagga 
gcgatgcggt 
acccactatc 
ttttatcaac 
tatgcaccgc 
gggtgcagac 
agccgtgtat 
ggaattttca 
cagtcggcaa 
gttgggccaa 
agtcctacaa 
attgaactcg 
ccctgactga 
tcggcgaagt 
atccctccgc 
cccggtacaa 
tgggcgagta 
taggaaatac 
ttcgtgcatc 
ctaattttga 
gccgccggag 
aggcgagagc 
tctgaggcca 
attattccgg 
catcaggcct 
tccatttatc 
gagacccatt 
cgccccatcc 
ttcagtgcga 
gcgcatacac 
ttaatggctt 
gaaatttgct 
catttttaat 
caacttcaaa 
aaggtaacgg 
ttatactagt 
gagggcctgt 
cttgcatgga 
aagttaaccg 
agtttgaaac 
acgcccatag 
atgtgctacg 
cagcggtgag 
ttttccgctt 
gcagcactgt 
gccaagaggc 
agtgtgcagg 
gatgtggaat 
cagatactca 
cccacacgcg 
gcgttctcct 
aagattccgg 
ccagaagagc 
atgcatgggt 
ttatttagtg 
gttttgtatt 
ctatatttta 
ccataatgcc 
ccatgtacac 
accacgttcc 



gcgactttca 
gatggcattt 
attttaaacg 
ctatggagta 
gacctgtgcg 
ttgctgaaat 
ccacttttgc 
aaacaggatg 
gtgatgcgca 
aagccccgca 
ggcgcattgg 
attgaaattc 
ctagcgcaga 
cgtctctggc 
aggcctcagc 
catcatccac 
caccgagggc 
ttaaaagtat 
cttagtctcc 
tctcattatc 
tctgtctgca 
tgctgttggc 
gtaatttatt 
aactaacgcg 
gaaaggggct 
aagtgttcct 
gcaagctgtg 
cggagatata 
ccgggatctt 
cgtgataaaa 
ttataaaacc 
ttctaaatcc 
aatacatagt 
taatttagac 
gtatattata 
gacaagctaa 
tttctagcta 
tatggaaagc 
gcagacacat 
tgaagtaacg 
tgcgtgcctc 
gtgtcacctt 
tgtgcaaatt 
ttccgcttcc 
tgaccaaaac 
cctccttcat 
tgagtgtgtg 
cgagatgccc 
tcaacggcgt 
aggataatct 
acccaatctt 
aggctttcaa 
tataattcgg 
ttgcattgtc 
cataaaatag 
tgggtaattt 
tgcattgcaa 
cataattggc 
gacttgtgtg 
gctgcgcatc 



cgccagaaca 
gcatacgccc 
tattaagaaa 
gcaataaata 
gcggaggtcc 
atttgcgaaa 
cggataaaag 
tccgactgac 
catagcttaa 
cacacactta 
ccatgctttc 
aataaattac 
ttgtcaattt 
aggcgtgaaa 
ggcgacgagt 
t tcaagcagt 
gagctgcggc 
aataccagga 
tgtaacctaa 
ccgctaaccc 
gcctgccatg 
actgcttcaa 
tgttggccct 
ttgacgggag 
gcaagtgaat 
ttgtgcggtg 
caagctgggc 
tttgcgtccg 
agtaacgctc 
acgggtatag 
atttgaaaca 
aaaaacccat 
ccaatattat 
aatatataat 
gtcaatgcat 
aggatctaga 
tgtttttctc 
aattaaaaag 
aaaaatgatg 
cctttccctc 
tggtcgggag 
tgccctggtc 
gccgagatga 
acacaggttt 
gaatcgcatt 
tagtcccaag 
gagtggtctg 
tgcgatgact 
ttggatggta 
gctcgtctgc 
cctgattgtt 
cgaatcgaag 
cgtaattggc 
tataacetgg 
cacaacatgt 
ggttatttcg 
cctcagagtt 
cactttgatt 
atttggctgg 
accagcatgt 



7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 
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cggtgaccat 
gtgagtccgg 
ggtcctctgt 
gaccgttgaa 
tcattccgtt 
atgaggacag 
gtgttgcttc 
gtccttgccc 
ctcccacgga 
atactcggtc 
aagtaaactg 
cagcatggca 
agcactgata 
gcaggaaagg 
agtattccaa 
tctcttaatt 
catataacac 
aggaggtaga 
accgcacacg 
gcatattccg 
caaaaggcta 
gggccaactg 
acactgtact 
cttgagccat 
ggctgccatg 
aactaaggcc 
gcaaccgagg 
ccggctctgc 
aaggggcgtc 
aggctacgta 
gcgctaaatt 
tctgaaccag 
ttagaaagtc 
atcttattag 
tgaaaagtat 
aatagcagat 
gagataaagc 
cgttccggca 
taaattaatc 
acattgttca 
gcacttctgc 
tggccgtaga 
tggtgggcgt 
gggaacaacg 
taatgcctcg 
ctacgcttgt 
agagtatgat 
ccatgatggc 
agcatatcac 
gtcagtaagc 
tatgtaattt 
cattatcgat 
ttaatcgtat 
caccagctgt 
tgagaaaagt 
aagctgaaca 
aatttaattg 
acccataact 
caatttccag 
atcaatatgt 



cagcttatcc 
atagctggga 
aaattgattg 
tgatttttgg 
tattcaggac 
gcggaagcca 
caaccatatt 
atttcatgat 
gctgaactgg 
cccatgccaa 
ccagacaagc 
aagaagctgt 
tcttcattac 
aattttataa 
gcatgaaagt 
gtatggctat 
ttaaagatgc 
aatagtaaat 
tcatagttct 
ggtgcaaagt 
ttattaaatg 
tacgctatca 
gcgcacacgg 
aaaaatgcaa 
aatgcggcat 
ctggcgcaaa 
tggcaagcca 
cgcagtttgt 
caaatggcag 
acagtacgca 
ttcactaagc 
gaactcgctc 
aagacacctc 
cggtattttt 
gtaacaggta 
acgatctgac 
atactgattc 
gatctttcta 
aaacaaaaat 
ttgttatatt 
tccctgtaag 
agcaggaaat 
ggcaggtcac 
aaattggata 
aacgggaaat 
gtccttgcag 
gccgcctcgc 
cgccgccgtg 
tcccacaaac 
gtagcgctca 
gctgttgaat 
caattaatca 
tttgcaatga 
acctatctct 
ttatcataaa 
gttttatcaa 
attttaaaat 
tgcagaaaac 
tataatcacg 
gccaaacaat 



gccagcgtga 
gtgggttggg 
gtaattgcgt 
tgccacagta 
gcgtggacga 
agggtttggc 
gtttttaatg 
gtttccccga 
tcatgatctc 
catatttttt 
caaatatagc 
ttaccccgat 
acggaaacac 
gaaaatgtcg 
tagatcagaa 
taaaggggaa 
cagaaaggac 
cagccagaaa 
tggtcccgga 
agaacaaaac 
tcggagtgga 
gatgtcacct 
caaatacagt 
tttacaactg 
gccacacaac 
ggtgccggca 
ttccgacttc 
tttcgaactt 
ccaaacaatg 
gcgtatcgca 
gttttataga 
acgtggccaa 
aagaaattgg 
tacacctcac 
aaaacaagca 
catgtccgtc 
cagagacgta 
tagtcgggga 
gaacgaatgc 
attcttaaag 
tggcccaagt 
ggctaatgac 
gcactgactt 
tcgcctccaa 
taacctaatt 
ctgtacataa 
tacggcaaca 
gtgaccgccg 
acaggtcagt 
aaatacacac 
agtgcaatta 
aattattgaa 
atttatatca 
cgtattcatt 
ttgggtaata 
agtaatagtt 
gactaggata 
tcgctaaaga 
tcgctggaca 
gccacaacgg 



ccatcgcctg 
cttcggcttg 
tatgcattaa 
tgcagaagcc 
gtgtttcatt 
aattcaacaa 
cctacccttc 
gatgaagctt 
ctcccaaagg 
aatacgccct 
cagccagttg 
attaaacgtc 
gacctttgac 
aatagcttgt 
aatccaagtt 
acacttttct 
cttcgccagt 
cttgattagc 
gcggagcaag 
ccggggaggc 
gcatacttct 
aatgcacggt 
gtgcacaagc 
gttgtgtctg 
aactggggat 
gccacgcctc 
cgactgccac 
gccgcgtttg 
tcaaagcaaa 
agttgaaacc 
tggtgccatt 
aaacaaattc 
tgggaaagtg 
tagtagagtg 
tttaaagtat 
cattcgtatt 
gacgcagcac 
actcgactat 
gtgaacgtgt 
tgtaagtatg 
cgttggcctt 
acttgaattt 
ctaatcaaag 
atagctatta 
ggaaagacgc 
tactcatacg 
tgcaccgcac 
ccacctgtgc 
aattaaggga 
acttcgtgtc 
agaggtataa 
agtgagtcga 
attaaattat 
tttaattttt 
tagcaagtct 
aggctgacaa 
aagaataaaa 
cgaagaaatt 
tcaatgcgta 
ccacgaccac 



tctcttttcg 
gggggattca 
ttattgcgca 
atcggcgatt 
aaaatgcaat 
ccggctgcat 
ctcttggtcc 
aaatgtcaac 
cgcgcgtcct 
gctattagct 
agttcatgga 
aaggaggaag 
agcaatgtgc 
ttgattattt 
atattggcaa 
agttagtttc 
gtctatgcca 
aacggagcgg 
ctacagttta 
gaagggccgt 
tggcccagtg 
tggtttgcga 
acacgaaccg 
gcaaaaggaa 
caccggacca 
cttccgcagg 
gcgccctttc 
tttgcggtga 
gcggaaatga 
tggagcactg 
ctagtaagga 
gatttggctt 
ttttcatgta 
aaagggtaga 
aaagattatt 
aacgtcgaaa 
aagtttccgt 
agcgttccct 
tatgatttaa 
gtaatatggg 
tgctttgacc 
ggattaggaa 
ccgtaagata 
cacacaatgg 
actgcttttc 
ccccgagcgc 
cgctggcaca 
ccaggaggag 
tatcggttaa 
gggcaaatgt 
cagttgcttg 
ttttatctcc 
attactttat 
gtggaaatgc 
taaaataact 
tcaaaaggta 
ctgaattttt 
gtgcgaaatg 
caatcaaata 
gcccacagat 



cccaaggtga 
tatccacacg 
tttatctggc 
ttattccatt 
aagacggaag 
cgattgcgat 
atattccttg 
aagccatgtt 
ttatgttggc 
attaagatgc 
ctagatgatg 
cttacattta 
aatgtgaaac 
tttttgtata 
ataatgaaag 
caaaggcttc 
ttgaagagca 
agctcccaca 
tggtctgtct 
aaacactcag 
gctgctaacc 
cgaggaagga 
caccgtacac 
aacaataaaa 
agaggcaaca 
acaaaggctg 
tgctgtccgt 
aaacaataac 
cccaccgtgc 
ctcgcaattt 
gtatgtgtgc 
taaatatata 
ttgaatagta 
ctagtttcgt 
gattggaatc 
ctaataagtt 
attcaatatc 
aaattatatt 
tccgtctatt 
gctcaggcgt 
acattactgc 
tgggacttgg 
agccatcttg 
gtcgttcgat 
tgactcgacc 
aacgtgcgcc 
ggaccctcgt 
aagatccaga 
cgagtcccgg 
attattgaca 
cctcgttaat 
tgacaatgta 
taacccattg 
ttcaattaat 
caagctgtaa 
tatccttctt 
tccattcact 
gccactcaga 
ccatatgccg 
aagggcagtg 



11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
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gcaacggcaa 
gcaatggcga 
agcgacaaca 
gcggcccagc 
tggtggctac 
ctcccaacaa 
tatagctaaa 
ccgagctgca 
ccgccagcac 
ccccggagat 
agcgattccc 
gtgctaggac 
aggcgaaata 
accttacccc 
gtacaacaat 
tttaacatgc 
ctcttcccaa 



cagcaacagc 
tggcgatgtg 
tggccagccg 
agcaaccata 
aataacaact 
tgggcatgga 
cgcgcaacaa 
aacgcaacat 
gggcagcggc 
tcgagattct 
ggcatcagtc 
gggactaaat 
attcatttac 
catagacaca 
gtatctcaat 
acgcaagagt 
aaacaaatgt 



aatagcaata 
gcaacagagc 
gccgtggcat 
acaacaacag 
ccattggcaa 
aacaacggga 
t tgcagcaac 
ccgcagcagc 
agtgcggcga 
cgcaaccata 
tagggatgcg 
tgggagcagc 
cacttttgtt 
aattatctta 
aatgaagata 
cataagtaag 
gtttctccat 



cgaatggtag 
aagtcgtggc 
ttgccatcac 
cggcaactgg 
cggtggatgg 
gccaacgtcc 
agctgctgaa 
aacatggaac 
agaagcgggc 
cgtaagtaga 
gaaaagcaga 
tgggccacag 
cgcttattaa 
acgttctttt 
ttcctgtggc 
taagagtcgg 
tt 



tagcaatagc 
caacaataat 
atcgactaac 
cgaaacaaca 
cagcgaaacc 
gccggtcctg 
ggagcagcag 
acaaagctct 
cgcaattccc 
ggaaagtgtc 
caagaggagg 
aaatatcttc 
tttgagtgaa 
atataagttt 
acttaaggct 
ctgatcaaaa 



aacagcaaca 
aaaatcaatc 
aatcacataa 
actgaggcgc 
atgtccgtgg 
caaacaaact 
cgacaacaag 
tccagcgaca 
gtttagctgc 
tgttgaccca 
gcaggaggtc 
agatcttaga 
ttctggttat 
atcgatagct 
ttcttcagat 
accttcgcaa 



14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15572 



<210> 5 
<211> 2456 
<212> DNA 
<213> Drosophila 



<400> 5 

ggtccatcag 

tcatctcggg 

tgcggttatt 

aacagcgatt 

tggtggagat 

attacggaat 

ttgccatcaa 

atatagtgca 

atcaagaggt 

cctggatcgg 

aggtggaagg 

acttcctcag 

aggatcactt 

aacagtgcac 

agtttgtcag 

tgtgcggcgg 

tgaaatattt 

ggaacggcga 

agtaccactg 

aggtgaagtt 

ttgtgggtca 

gcacaaccta 

gcactcttcc 

cggagtccgc 

tcttcgtaat 

gggagctgag 

tggtcctgaa 

tcacggtcgt 

cgggcaaaca 

gtgcgtgcct 

cacatgccat 

tcgacgcctc 

tctacgcggt 

tcaccatgta 



caacatagat 
ggtggtgggt 
ccgaattccc 
cgagtacttc 
agtgaagcgg 
caaggctttc 
ggagaagctg 
gaagctgctg 
gcgccaagtg 
atccgatggc 
cacgctgtct 
tctgacggtg 
tcagtgccga 
cacaaaggag 
cgatgcggtg 
aggtccttcg 
gcgaaaagtg 
cggtcccgcc 
ggtcaaggtg 
taagcggctg 
ggccaagaag 
tcaaatccgc 
cgacgcccac 
ttgggccatc 
gggcgtattt 
ctacatcctg 
gccgaccaac 
ttatgcagca 
atccgccaag 
ggttagtgtg 
gcaccattat 
ctacatgatt 
gctcacgcga 
cacgacttgt 



gacgcggagt 
gccgcctcct 
caggtgtcgt 
tcccgcacta 
atgggatgga 
gaggaactgg 
gtgaaggatt 
acgaagccgc 
atgcgggcgg 
tggagtgccc 
gtgcagccgc 
gagaacaatc 
tatcccggca 
cgactttcac 
atggcatttg 
ctctgcgagg 
gagtttgagg 
cggtacaaca 
ggcgagtaca 
agtcccaagc 
tacgtggaag 
cacccggatg 
aagcagtact 
ggtgctatgg 
gtcaggcaca 
ctggcgggga 
atagtgtgtg 
ctgttgacca 
aggccctcct 
cagatactca 
cccacacgcg 
gcgttctcct 
aagattccgg 
gtgatttggc 



atcactgcaa 
cggtcacatc 
acttctccac 
ttccctcgga 
gctacgtgtc 
aggagctcct 
cgggagtggc 
gggctcgagg 
tgcgccgggc 
ggaatttggt 
aggcgaatcc 
aacgcaatcc 
gtacgagcac 
gccagaacac 
catacgccct 
ccatgaagcc 
gcctcagcgg 
tcatccactt 
ccgagggcga 
cgccggagtc 
gcgagagctg 
acgagaccca 
gccgccccat 
cgttcagtgc 
acgacacgcc 
ttttcatgtg 
ccattcagcg 
aaacgaatcg 
tcattagtcc 
tcaacggcgt 
aggataatct 
acccaatctt 
aggctttcaa 
tggcttttgt 



taagacgcag 
catacaagtt 
cagccccgag 
tcactaccag 
gattatctac 
cgcacgccat 
cgaggacatc 
ggccattatc 
gaatgcaact 
gtccgacgac 
ggtgcgcggg 
ctggtttgtg 
tccgtacaat 
tgacttcgag 
aagggatatg 
aacgaagggt 
cgacgagttc 
caagcagtcc 
gctgcggctt 
tgtctgcagc 
ctgttggcac 
ttgcaagctg 
cccggagata 
gaccgggatc 
catagtgcgt 
ctacggtgtc 
gtttggtgtc 
cattgcacgg 
caagtcccag 
ttggatggta 
gctcgtctgc 
cctgattgtt 
cgaatcgaag 
accgctctac 



gtgcgcaagg 
gccaatttat 
ctgagcaaca 
gtgaaggcca 
gaggagagca 
aacatttgta 
gcctacgaca 
ttcggttcgg 
ggttccttct 
tacgagccgg 
ttcgaggagt 
gaattctggg 
aactacacca 
gaccagttgc 
caccgcgacc 
gcagacttgc 
cgcttcgacg 
caggcgggac 
aatatgacgg 
ctgccatgtc 
tgcttcaatt 
tgcaagctgg 
tatttgcgtc 
ttagtaacgc 
gcctctggtc 
acctttgccc 
ggcttctgct 
atcttcaaag 
ctggtaatct 
attgcgccat 
gactcgtaca 
atttgcacgg 
catattggat 
ttcggaaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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ccaaccacgt tccgctgcgc atcaccagca tgtcggtgac catcagctta tccgccagcg 2100 

tgaccatcgc ctgtctcttt tcgcccaagc tgtacataat actcatacgc cccgagcgca 2160 

acgtgcgcca gagtatgatg ccgcctcgct acggcaacat gcaccgcacc gctggcacag 2220 

gaccctcgtc catgatggcc gccgccgtgg tgaccgccgc cacctgtgcc caggaggaga 2280 

agatccagaa gcatatcact cccacaaaca cagacgaaga aattgtgcga aatggccact 2340 

cagacaattt ccagtataat cacgtcgctg gacatcaatg cgtacaatca aataccatat 2400 

gccgatcaat atgtgccaaa caatgccaca acggccacga ccacgcccac agataa 2456 



<210> 6 

<211> 738 

<212> PRT 

<213> Drosophila 



<400> 6 



Met 


Val 


Glu 


He 


Val 


Lys 


Arg 


Met 


1 








5 








Tyr 


Glu 


Glu 


Ser 


Asn 


Tyr 


Gly 


He 








20 










Leu 


Leu 


Ala 


Arg 


His 


Asn 


He 


Cys 






35 










40 


Lys 


Asp 


Ser 


Gly 


Val 


Ala 


Glu 


Asp 




50 










55 




Lys 


Leu 


Leu 


Thr 


Lys 


Pro 


Arg 


Ala 


65 










70 






Asp 


Gin 


Glu 


Val 


Arg 
85 


Gin 


Val 


Met 


Thr 


Gly 


Ser 


Phe 


Ser 


Trp 


He 


Gly 








100 










Leu 


Val 


Ser 


Asp 


Asp 


Tyr 


Glu 


Pro 






115 










120 


Gin 


Pro 


Gin 


Ala 


Asn 


Pro 


Val 


Arg 




130 










135 




Leu 


Thr 


Val 


Glu 


Asn 


Asn 


Gin 


Arg 


145 










150 






Glu 


Asp 


His 


Phe 


Gin 


Cys 


Arg 


Tyr 










165 








Asn 


Asn 


Tyr 


Thr 


Lys 


Gin 


Cys 


Thr 








180 










Asn 


Thr 


Asp 


Phe 


Glu 


Asp 


Gin 


Leu 






195 










200 


Ala 


Phe 


Ala 


Tyr 


Ala 


Leu 


Arg 


Asp 




210 










215 




Gly 


Pro 


Ser 


Leu 


Cys 


Glu 


Ala 


Met 


225 










230 






Leu 


Lys 


Tyr 


Leu 


Arg 


Lys 


Val 


Glu 










245 








Phe 


Arg 


Phe 


Asp 


Gly 


Asn 


Gly 


Asp 








260 










His 


Phe 


Lys 


Gin 


Ser 


Gin 


Ala 


Gly 






275 










280 


Glu 


Tyr 


Thr 


Glu 


Gly 


Glu 


Leu 


Arg 




290 










295 




Lys 


Arg 


Leu 


Ser 


Pro 


Lys 


Pro 


Pro 


305 










310 






Leu 


Val 


Gly 


Gin 


Ala 


Lys 


Lys 


Tyr 










325 








His 


Cys 


Phe 


Asn 


Cys 


Thr 


Thr 


Tyr 








340 










Thr 


His 


Cys 


Lys 


Leu 


Cys 


Lys 


Leu 






355 










360 



Gly 


Trp 


Ser 


Tyr 


Val 


Ser 


He 


lie 




10 










15 




Lys 


Ala 


Phe 


Glu 


Glu 


Leu 


Glu 


Glu 


25 










30 






He 


Ala 


He 


Lys 


Glu 


Lys 


Leu 


Val 










45 








He 


Ala 


Tyr 


Asp 


Asn 


He 


Val 


Gin 








60 










Arg 


Gly 


Ala 


He 


He 


Phe 


Gly 


Ser 






75 










80 


Arg 


Ala 


Val 


Arg 


Arg 


Ala 


Asn 


Ala 




90 










95 




Ser 


Asp 


Gly 


Trp 


Ser 


Ala 


Arg 


Asn 


105 










110 






Glu 


Val 


Glu 


Gly 


Thr 


Leu 


Ser 


Val 










125 








Gly 


Phe 


Glu 


Glu 


Tyr 


Phe 


Leu 


Ser 








140 










Asn 


Pro 


Trp 


Phe 


Val 


Glu 


Phe 


Trp 






155 










160 


Pro 


Gly 


Ser 


Thr 


Ser 


Thr 


Pro 


Tyr 




170 










175 




Thr 


Lys 


Glu 


Arg 


Leu 


Ser 


Arg 


Gin 


185 










190 






Gin 


Phe 


Val 


Ser 


Asp 


Ala 


Val 


Met 










205 








Met 


His 


Arg 


Asp 


Leu 


Cys 


Gly 


Gly 








220 










Lys 


Pro 


Thr 


Lys 


Gly 


Ala 


Asp 


Leu 






235 










240 


Phe 


Glu 


Gly 


Leu 


Ser 


Gly 


Asp 


Glu 




250 










255 




Gly 


Pro 


Ala 


Arg 


Tyr 


Asn 


He 


He 


265 










270 






Gin 


Tyr 


His 


Trp 


Val 


Lys 


Val 


Gly 










285 








Leu 


Asn 


Met 


Thr 


Glu 


Val 


Lys 


Phe 








300 










Glu 


Ser 


Val 


Cys 


Ser 


Leu 


Pro 


Cys 






315 










320 


Val 


Glu 


Gly 


Glu 


Ser 


Cys 


Cys 


Trp 




330 










335 




Gin 


He 


Arg 


His 


Pro 


Asp 


Asp 


Glu 


345 










350 






Gly 


Thr 


Leu 


Pro 


Asp 


Ala 


His 


Lys 



365 
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Gin 


Tyr 


Cys 


Arg 


Pro 


He 


Pro 


Glu 


He 


Tyr 


Leu 


Arg 


Pro 


Glu 


Ser 


Ala 




370 










375 










380 










Trp 


Ala 


He 


Gly 


Ala 


Met 


Ala 


Phe 


Ser 


Ala 


Thr 


Gly 


He 


Leu 


Val 


Thr 


385 










390 










395 










400 


Leu 


Phe 


Val 


Met 


Gly 


Val 


Phe 


Val 


Arg 


His 


Asn 


Asp 


Thr 


Pro 


He 


Val 










405 










410 










415 




Arg 


Ala 


Ser 


Gly 


Arg 


Glu 


Leu 


Ser 


Tyr 


He 


Leu 


Leu 


Ala 


Gly 


lie 


Phe 








420 










425 










430 






Met 


Cys 


Tyr 


Gly 


Val 


Thr 


Phe 


Ala 


Leu 


Val 


Leu 


Lys 


Pro 


Thr 


Asn 


He 






435 










440 










445 








Val 


Cys 


Ala 


He 


Gin 


Arg 


Phe 


Gly 


Val 


Gly 


Phe 


Cys 


Phe 


Thr 


Val 


Val 




450 










455 










460 










Tyr 


Ala 


Ala 


Leu 


Leu 


Thr 


Lys 


Thr 


Asn 


Arg 


He 


Ala 


Arg 


He 


Phe 


Lys 


465 










470 










475 










480 


Ala 


Gly 


Lys 


Gin 


Ser 


Ala 


Lys 


Arg 


Pro 


Ser 


Phe 


He 


Ser 


Pro 


Lys 


Ser 










485 










490 










495 




Gin 


Leu 


Val 


He 


Cys 


Ala 


Cys 


Leu 


Val 


Ser 


Val 


Gin 


He 


Leu 


He 


Asn 








500 










505 










510 






Gly 


Val 


Trp 


Met 


Val 


He 


Ala 


Pro 


Ser 


His 


Ala 


Met 


His 


His 


Tyr 


Pro 






515 










520 










525 








Thr 


Arg 


Glu 


Asp 


Asn 


Leu 


Leu 


Val 


Cys 


Asp 


Ser 


Tyr 


He 


Asp 


Ala 


Ser 




530 










535 










540 










Tyr 


Met 


He 


Ala 


Phe 


Ser 


Tyr 


Pro 


He 


Phe 


Leu 


He 


Val 


He 


Cys 


Thr 


545 










550 










555 










560 


Val 


Tyr 


Ala 


Val 


Leu 


Thr 


Arg 


Lys 


He 


Pro 


Glu 


Ala 


Phe 


Asn 


Glu 


Ser 










565 










570 










575 




Lys 


His 


He 


Gly 


Phe 


Thr 


Met 


Tyr 


Thr 


Thr 


Cys 


Val 


He 


Trp 


Leu 


Ala 








580 










585 










590 






Phe 


Val 


Pro 


Leu 


Tyr 


Phe 


Gly 


Thr 


Ala 


Asn 


His 


Val 


Pro 


Leu 


Arg 


lie 






595 










600 










605 








Thr 


Ser 


Met 


Ser 


Val 


Thr 


He 


Ser 


Leu 


Ser 


Ala 


Ser 


Val 


Thr 


He 


Ala 




610 










615 










620 










Cys 


Leu 


Phe 


Ser 


Pro 


Lys 


Leu 


Tyr 


He 


He 


Leu 


He 


Arg 


Pro 


Glu 


Arg 


625 










630 










635 










640 


Asn 


Val 


Arg 


Gin 


Ser 


Met 


Met 


Pro 


Pro 


Arg 


Tyr 


Gly 


Asn 


Met 


His 


Arg 










645 










650 










655 




Thr 


Ala 


Gly 


Thr 


Gly 


Pro 


Ser 


Ser 


Met 


Met 


Ala 


Ala 


Ala 


Val 


Val 


Thr 








660 










665 










670 






Ala 


Ala 


Thr 


Cys 


Ala 


Gin 


Glu 


Glu 


Lys 


He 


Gin 


Lys 


His 


He 


Thr 


Pro 






675 










680 










685 








Thr 


Asn 


Thr 


Asp 


Glu 


Glu 


He 


Val 


Arg 


Asn 


Gly 


His 


Ser 


Asp 


Asn 


Phe 




690 










695 










700 










Gin 


Tyr 


Asn 


His 


Val 


Ala 


Gly 


His 


Gin 


Cys 


Val 


Gin 


Ser 


Asn 


Thr 


He 


705 










710 










715 










720 


Cys 


Arg 


Ser 


He 


Cys 


Ala 


Lys 


Gin 


Cys 


His 


Asn 


Gly 


His 


Asp 


His 


Ala 










725 










730 










735 





His Arg 



<210> 7 
<211> 9400 
<212> DNA 
<213> Drosophila 

<400> 7 

cagttagttc ctagttaaca agtgaggttt ggttgaaatc tttctttgct atataggtat 60 

ttcatttccg tatgctgtat cggttttgta aaataataaa gcatttcgcg agctctctct 120 

ctctctgtat ctcaaaaata taaagtaaat gcccctatgt gtgtgcttaa agcatctaga 180 

ttgatgaaaa tgtacacttc aaatgcttcg ctctggtttg cttacaacgt agctgaaatg 240 



1 1 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/TJS01/09341 



cagcaacaaa 
ttggaaggaa 
ctttcaaaac 
tgtatatata 
acaaaataca 
atcttataca 
tttttaagta 
gtacatacat 
gggcccgatt 
ggcaccgcag 
ataaagtttt 
aatgctccat 
tttttgccta 
gataacatcg 
gcggaggatg 
tgtgtgatgt 
ggcgcttctg 
cgaatagctc 
gctgctgccg 
ccgcaccgtt 
gcgctccatg 
ttgatatagt 
atcgccatca 
tccaagtcgc 
gcggccgttg 
cgtcgagccg 
tgttggctcc 
ccggcggcca 
acctcgtcca 
aggatctcct 
gcatcgccca 
tcctgctggt 
cgtcgttgca 
tgctgctgct 
tgctggtagt 
tgcggatagc 
tgggaggcgg 
gcgcccctct 
tcgtgcgcct 
gttttcaatc 
tagtaagact 
atcaggtgtg 
ctacctcttt 
gtcattaggc 
gtggctccca 
tcgtcatgcg 
ctgcaaaggg 
ttcgagcagc 
cgccggatct 
ttgaagcaga 
cgcaaataga 
cagcgtatgt 
atggattttc 
ccgtgcgact 
gtaaaagtga 
aacttaccac 
cctccatcag 
tggccaccac 
tctggtccac 
cagaggcaaa 



atggcaatgt 
ttttagttgt 
gtaggatatc 
tatatatctt 
taaaacataa 
tatacctaca 
tttaaaaatg 
ttatatttaa 
ccaattccaa 
gggtatggat 
gggattttcg 
atttcgacta 
agactgttcc 
caccgtgttg 
ctgatgggga 
atgcgacgac 
cgcgggcgtc 
atttaccgcc 
gaatgcgcca 
gtgctcgggt 
tttgcgcagc 
caaacattag 
gacgacgcat 
tctcgtcctc 
cctcctcatc 
atctttgctg 
tggacgagac 
ccgccgctct 
catcgaagtc 
cgtagaccgg 
tgcccagatc 
actgcgccat 
ggtgcaacag 
gctgctgttg 
aggcagccgc 
ctgttggctg 
cggccgtgcg 
tctttttgcc 
gttgcggcga 
ctcctgttcc 
cgcccaaagg 
tggacccggc 
cgagcacaac 
cactttttag 
taatggcatc 
cctcttcgct 
ttccggctgt 
gcaagcactt 
tttcgttttg 
tgaagcagta 
agtaggccga 
cgaacctagg 
aatatcttac 
tttgcggcac 
attaatttaa 
aaagaagaca 
cacgcaataa 
gctcagcgac 
ctccagcagg 
gggttagggc 



attatttccc 
ataggtggaa 
tttttggttt 
acgagcataa 
ttcgcctgaa 
tccatatata 
gcatgtggtg 
tatcgatttc 
ttccaacccc 
gtgccctatg 
aatcactgca 
acgactaata 
gggggagttg 
ttctcatcga 
tgcggatggg 
gatgatgtgg 
tgattcgccg 
aaacggccgc 
cgcagggagc 
tgcggtcctg 
tgacgtctgt 
aagtacttac 
ccgctcatcg 
ctcctcatcc 
ctcctcctcg 
agccagcgac 
actgccgcca 
catgttgttg 
acccacctcg 
cccaccatcc 
acagttggaa 
gtgtggcggt 
gtgagcatgt 
ctgctgctga 
attgccagtg 
gccaggcttg 
actgtgcaga 
attgtggccg 
tgatgcagca 
agttgatggt 
tccctgttgc 
agagctggtg 
atggccacca 
ggttcttctt 
ctctgtgatg 
cactcctaac 
gggtccattg 
gggcagccaa 
aatgcaatgg 
tccatacaca 
gcaccaaaca 
atttgaggta 
cgcacgctag 
attataatcc 
aatggtaact 
agcactggta 
tcgttttggg 
agtccgtagg 
agctcgtccg 
tgcaaagtgt 



tttacttctt 
aacggcagct 
atgaaacaca 
tcagcgtaaa 
ttctatatat 
tatacttgat 
acatgtatta 
caagttggta 
ccatcgctcg 
gtctaggttt 
cgtatagcat 
ctaggtatag 
cgttaactga 
gcatggccga 
ttagctggtg 
tggactgtgg 
gcggcagcgg 
ctggcgactt 
cctcaccaac 
ctcctggtgg 
aaaggggata 
tgaaaccgcc 
ctctgcggtg 
tcatcacaga 
tcttcgtcgt 
tgttgttgct 
ctcaggcttc 
aagacgccag 
taatgctgca 
gcgccgccga 
tgcttatttc 
ggagcaggga 
tgctcggcct 
tgtagctgct 
gctagtggtg 
gtcttcttgt 
gatccactag 
ctgttcggct 
gcacccgcat 
tgcttccagc 
tgctgttgtt 
ggtgaggcat 
agggtctgcg 
tgctgggcct 
ggcactggtg 
ggcagcttgg 
ccggtaacaa 
gcgtgctgtc 
aacacgaaga 
tcagccgtgt 
gcgctgagca 
catgggcatc 
cgaggcgagt 
aggagaggaa 
gcgcttttga 
ttacaaaggt 
tataggtgct 
acaagaggta 
aggtgagcgt 
agatgaaatt 



atattcctct 
ggatagcaag 
aaacaaatct 
tcaaaaacaa 
catgtacata 
tttctggttg 
caatacatca 
taaaatgcat 
agttttggtt 
tagttatttc 
gacagcatgt 
ctagagatcc 
tcacttagcc 
caggtggcgc 
gggatgcgga 
tgacagtttc 
tggtcccgag 
ggcattcagt 
gccgccgcta 
caaagccccg 
aatgtaattt 
gcttaaaatc 
gcaatccatg 
cgctgtcgtg 
catcgtcatc 
gctgcttctt 
cgccgccgta 
tgccaacgct 
catccgagtt 
cgctgccttg 
tatactggcg 
gctgctgctg 
gctcctcgtc 
ggcgacgcaa 
ggtccaacgc 
acgatggata 
gcgaggcggg 
tttgagtcca 
tattgtagta 
tgtagtagtc 
gttgttgctt 
gaccagtgga 
gtggagcctg 
cgttcagttc 
tctgctgttg 
gcttcttgga 
tgcccgagga 
tcacatactt 
tatagagccc 
catcgtccat 
gcaagaggaa 
gaaggatctt 
atgttccttg 
tgtgtaacca 
agcttcatct 
ggcataaaac 
gggatcgatg 
gtagcagttc 
tgagtacgaa 
cggccttctt 



tgattcgcag 
aaatgacaat 
taagcctgtg 
ttgatttctt 
cagttcctag 
tttaaattag 
tacatatatg 
gcaagtcgaa 
aagcttgcac 
atatacattt 
tgtttcacac 
taagctacaa 
agtggttcca 
tgctgatggt 
ttgggattgc 
tgtagctgct 
ttgccgtgac 
gcttgctgct 
accccaaaaa 
tagtcaagag 
tttcgtaacg 
ctcgtcctgc 
agcatcatcc 
caattgctcc 
cgccgtgcag 
gagctgctgc 
ccgactaccg 
attgttgtac 
gcggttgctg 
gttgtagtag 
ctgcataaac 
gctaccagct 
cgaggagagt 
ctgttgctgc 
ctccgcgtat 
gaaaagcacc 
aactcctccg 
gtagaatacc 
ctcccgcggt 
cggctgatgg 
gtgcgcaaaa 
gcggagagtg 
tagtgtgccc 
acaatcagag 
ctgctgggga 
cgtggagacg 
aattgatgtt 
ccgatactcc 
cagcaatgtg 
tttggctcct 
cacaaaggag 
taaagtgtat 
gtctttagac 
atggcagcct 
catattatgt 
aaggcattcg 
accagcgaga 
accttgggcg 
tggaaggctg 
accttcgtaa 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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cagaaccagg 
aaaatgcttg 
caaaggtaaa 
agacagagtg 
caccagcaga 
ttgataatct 
aacttgattt 
gaatacgaat 
acaaactaaa 
atattttatt 
acgatgtcta 
aaaaatgatt 
atgagtctat 
caaattacta 
gcaacagtga 
gagaggttca 
aaatacatat 
accaccacca 
tatccaatcc 
gctggccacg 
tgatgttgag 
ctattgaggc 
gattcaagtt 
cagcagcagc 
agttcagatt 
catgactggg 
gggtctggtc 
gattggagcc 
cgataactat 
catcgaacat 
agtttgtcag 
atccgttggc 
tgagcatatg 
ttctcggtct 
cccttgccca 
tcccccgcgt 
tttagattat 
ctcttgagat 
gcgaatacaa 
tcaatacgat 
atagtcttgg 
tttgaaatgc 
cgctgcgctc 
gcaaggatgt 
gcgattcgtc 
tagcctcgcg 
ctgtttggat 
tcacttcgga 
tttccagatt 
aggtgggctc 
cgcagctgct 
tgtcatcatc 
acttggcgat 
tcatgttcca 
cgcctccgga 
cgttcccggg 
ttgtggtcag 
tgttggatgg 
aggcagtgtc 
catcatggcg 



cggtgcccga 
tttcggtctg 
tgctggtata 
aaaagttaat 
aggaccctaa 
taaacgtttt 
acaataattt 
aaaacaatga 
tgaacaaaca 
aattaatgaa 
tgttaaattt 
aaacgattaa 
catatgaaga 
cgatacataa 
tcatcgctgc 
ttaacaattt 
gtctagaaag 
cttatattgt 
attgccgtcc 
ctgctgctgc 
gttattgctg 
agcagcagct 
gttcatggca 
tgccgcaaca 
gttgcggtgt 
cggcggcaca 
gcgaatattg 
gtttcttacc 
gaagaccacg 
ggtgaacaac 
gtggttgcaa 
agaccacgca 
agtcttactg 
tcagatgttt 
agctggccga 
tgacatcgct 
actattctaa 
cgaagtgatc 
ttcgcaccag 
ccgaactcaa 
tttccagaac 
aaacaatgca 
cctgatgatc 
ggctacacgc 
cagcaggttg 
ctgtcgctga 
aatcttcgtg 
catatcgttg 
gttgagccaa 
aaagttgcgc 
gctgttgtgc 
gtactcatcg 
gccggctgct 
gtacaaattg 
catcccggct 
gacaacggat 
tagggttccg 
tgaagtggca 
ttcaccgtgg 
gccagccgga 



taggatggca 
ttcaatgccc 
gatcgaggtg 
ataagttcga 
agattcgatc 
gaagaatagg 
tggtgggtgg 
acagaacacg 
caaaatgcaa 
atacaaaaga 
gttaaacaat 
gtaaaacatg 
tatttcatta 
atataaatgg 
ccagcacagc 
aagttactta 
cctagcctag 
aagtcgtcca 
acattaaggt 
tgctgctgct 
gcgttgctgg 
gcagcagcag 
tgttgcatat 
acttgttgtt 
gagttgcgga 
ttggactgac 
ttttgctggc 
ttcacaaaga 
cagatgccga 
gaatgctggt 
ctgcagacgc 
ctaaaacaaa 
gtcaatatag 
cagtgtcagg 
gatcaccttg 
gtcgttgctc 
cgtttttcag 
atagctgggc 
accgccttca 
gggccactga 
tcgaacggag 
aacatttaga 
gtatccgcca 
atttggtcct 
gagccggttt 
tccgggaagg 
gtgaccaaca 
gcgatggaga 
aggctgcggc 
agacgctggt 
agacagtcgc 
tagccggagt 
ccgccaggac 
cgggcatgag 
gcgttcaaat 
ccactggccg 
tcatcagttc 
ttaccgccag 
tggttgtggt 
tgcttggtgg 



cagtgtagga 
aggagaaaga 
cgcgcggact 
gaagtcagtg 
atcagggagc 
tgtcagattt 
tggtgaaaat 
aaacaagcac 
tatttacagg 
aagtcagaag 
caatgcacat 
ccagagtgat 
ttggttttgt 
gagtcttcgc 
ttttgagtct 
agcattatat 
cctaaaattg 
tgttggcctg 
tgtgattgcg 
ggacatgctg 
cctgaactgc 
ccgtctgctg 
ttcgctgcgg 
gctgttgctg 
tggaattgtg 
tggttggcgt 
gtctgctgcc 
ccccattgaa 
tgcttatgta 
gctcatccac 
tgtgcgtgcg 
aacattaatt 
ttccagaaca 
gttatgggct 
ctgttcagga 
aggatggctg 
ttctctgtgg 
tttagaatgg 
ctattatcta 
tccgaatctg 
agcactatgg 
actcacatat 
gcagaaaggc 
ccggattgag 
tgaccacaca 
tctccttgtc 
tgtcgccgcc 
tcaaggagga 
attgcgtcag 
ttaccttgtg 
caccgctggg 
cgggtatcct 
agggctgaac 
tggggccgca 
taatattgtt 
cagtgcccgg 
cggaacctcc 
ccggcagctt 
ggtggtgcag 
tcgtcgtgga 



aaatagtaat 
gcagctcgat 
ggagtggaca 
caaatttttt 
ttttagacat 
gatagtcaaa 
ccaaagacta 
aacacatagg 
acaaaattag 
tgttctaggt 
aaaaggggtt 
gatttgtaac 
gtgctaatct 
gtgcaagctg 
tatctttgaa 
cacatgtgtc 
tgtttatctg 
cataatgatg 
gtgcggatgt 
ctgctgcaga 
atgttgctgc 
accgtgctga 
ctgattggac 
ctgctgctgt 
ttggataaag 
ggtcagggtc 
cgaccgctga 
cagcttcagc 
gatgaatatg 
aacgtccatt 
gttagtggac 
cgtataaatt 
cgcaggtggg 
gggagagttg 
tgcgtaggcg 
tattgcgaac 
acttaccgta 
attccaggcg 
ttagagcagc 
ggaagaccac 
tgatatacag 
atttttgact 
attgtattcc 
atccaaccac 
atgcaacaac 
gtgcatcatc 
gtagagcgtt 
gtcccgctgg 
atcaggtgtc 
ggccatgctc 
tgccggagtt 
cttcattaga 
attcacatca 
aaacaacttg 
gatggtgcgc 
tgggcggttg 
gacgcccgtg 
gctgctcggc 
ttgcggcggc 
ggtggcccca 



gaagccgcag 
ggccaggagg 
gttaaagaga 
ctttaagcag 
tttaaggaaa 
cttaatttag 
agagtaatac 
taatatacat 
caacatttgt 
atttagttgt 
atggtatttc 
aaaaagaaaa 
taagctatga 
ttactgggac 
tcctgccaga 
tctcggtcat 
ttttttcacc 
ttgttcgtgt 
ggctgcggct 
tagttgtggc 
agtgcagcgg 
tgcagattca 
agcacaacag 
tggacattgt 
ttcgtgttcc 
aaggactcgt 
tgcggcaagg 
gttagcaggg 
cgcatgtttc 
agtatggcaa 
tccagactgc 
attaggcata 
attcgttaca 
tatgtgacgc 
ctgttggatc 
taagaaaaca 
actgcgggag 
atcgaaggcg 
tcgtggcagc 
gctggactgg 
ttataagtat 
ttttgcacca 
aggccagtta 
gaggacagct 
tccatgatca 
ttctcggaca 
ttgctactgg 
tttactcgca 
gcgggatgcc 
accggcggct 
gtcgaactga 
acgcaacgcc 
cccactcgag 
gactcatcat 
accacgctgc 
ggtgagetct 
agtatcctcg 
ttggtgggtg 
ggtggcagac 
cccttcaaac 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
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gtccaccgcc 
gcggtggtac 
gaagtctcgg 
cgttgccaaa 
tcgtcgtcga 
taccgggaca 
gggcacatct 
cacatattta 
ccagagtatg 
agatgtatta 
actccacatt 
tgaggttgat 
aggcggtttg 
agggtatggg 
ttagcggcga 
aagacactta 
gatacgcttt 
tgggtcgtta 
agatcacgaa 
tttcactcgg 
tattattatt 
gaaaggaaaa 
tgcaatatca 
gtagattcta 
aaaacactat 
ttgcatttat 
gccaagcaca 
tcatgaaagt 
ttagataatt 
atagaagata 
acacccttct 
acagaacaaa 
atgtggatgt 



cacggctgtc 
gttgaacagg 
cgggggcggc 
gatcggcgga 
agtttgggct 
gggatccccg 
ggaaaatggg 
agatgcttga 
aaatttccaa 
ccttgagttc 
cccgtggtca 
cacatcgccg 
gtacttggac 
tttatatgga 
tgattgttgt 
cgatctaacg 
tacactgcct 
gcttggtagc 
tcatcgatcg 
tttgaagtat 
attattttaa 
ggagaactat 
catcaatgtt 
ataaatttaa 
ataagcaatg 
atatctttaa 
cctatgtaca 
tggactcaaa 
gtttacatat 
agaagatggc 
gtgtacaccc 
gacccgtttt 
aaatgtagag 



gtgctgctcg 
ccgggattcc 
gcacccgctc 
ccattgctca 
gcacttatgc 
aacatgctcg 
aaaaatacgc 
ttgattgact 
atgacatgtt 
agtacgctga 
ttgcagatgg 
ggatcgcact 
actgaaaggc 
catggtatat 
agtttggttt 
atatatgctc 
gggatatact 
aagtgcaata 
tccatcattg 
tcaaactgaa 
ggcacataac 
ccttttcttt 
gagtcaatct 
gttattttaa 
caaatatcaa 
taattaaaat 
tatgtacata 
agtaccattc 
tgactccttg 
atggggcagg 
acactgtggt 
ctttgtacac 
aaaagttgct 



agggcgtgga 
cactgattcc 
caaccccggg 
gcacccatgg 
actggtagtg 
tggctgccaa 
gttcatcagt 
tgtgttgaaa 
taatgatttg 
gtgacttggg 
taatcgagaa 
cgatggtcag 
gaaggcttca 
atatatggca 
ggtttggatt 
ataggatatt 
tagtaatgct 
gtagtcataa 
gttcttatcg 
agggaagata 
taataactta 
tattgatatt 
aatgacctgg 
tattcagaac 
gaaaacataa 
ggtcttatat 
cctcttatgt 
aacttatgta 
ggcaaacacg 
ggagggggta 
tgtagttgtt 
ccgatgcaca 
gcagctgcgc 



gtgcgtgacg 
caccactccg 
cggatggatc 
cggcggactg 
tgcctccaga 
cacgccgcag 
aaacagagtt 
tagtttattg 
aagatccaat 
aaacatgcag 
gcggccatag 
tttcttacct 
taagtcacag 
tataaaaaga 
ggattgttgg 
gacaatatgg 
gaggttgatg 
ataccaaatc 
gcaatgctca 
acatttataa 
agacatgttg 
ggttccattt 
ttacagtaaa 
ctctttgtat 
ttaattgtaa 
ttatgtatga 
atttccgttt 
tctgaaaggt 
aacaagcagg 
gggtagggag 
gcatggccat 
ctattcacga 



gcggtgtgtt 
ggaagtgtgg 
agtccactgc 
ggcctgttgg 
tacttgtggg 
ctctgcttgt 
gggtaatttt 
atattcgcaa 
aaactttaaa 
ttaacactcc 
ttggcccgaa 
tcgcaggcgt 
ggtgctgggc 
aattttttgg 
tgggcctgca 
ttggtagcat 
gtggagtatg 
ctgccggccc 
ttcggcagtg 
tgtttggtat 
tatttcgtta 
aatttacttg 
tatcaaaagt 
aactcctatt 
gaattataaa 
tgatatccag 
ggagtcatat 
agagaatcca 
caaaagacaa 
ctaagaatac 
cgattcgtgg 
acatatatgc 



7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9400 



<210> 8 
<211> 5718 
<212> DNA 
<213> Drosophila 



<400> 8 

atgctaccaa 

caaaccgcct 

atcaacctca 

aatgtggagt 

tgtgcccaca 

cccggtaccc 

acgacgacca 

ggcaacggca 

agacttccca 

ccaccgcaac 

ggtggacgtt 

catgatggtc 

actgcctcac 

tccaacacga 

accacaaaga 

gggaacggca 

ggaggcgatg 

aacatgactc 



ccatattgtc 
acgcctgcga 
ttcgggccaa 
ggagtgttaa 
agcagagctg 
acaagtatct 
acaggcccag 
gtggactgat 
cacttcccgg 
acaccgccgt 
tgaagggtgg 
tgccaccgcc 
ccaccaagcc 
ggatactcac 
gctcacccaa 
gcgtggtgcg 
atgagtccaa 
gagtgggtga 



aatatcctat 
aggtaagaaa 
ctatggccgc 
ctgcatgttt 
cggcgtgttg 
ggaggcacac 
tccgccgcca 
ccatccgccc 
agtggtggga 
cacgcactcc 
ggccacctcc 
gccgcaactg 
gagcagcaag 
gggcgtcgga 
ccgcccaccg 
caccatcaac 
gttgttttgc 
tgtgaatgtt 



gagcatatat 
ctgaccatcg 
ttctcgatta 
cccaagtcac 
gcagccacga 
taccagtgca 
tgggtgctga 
ggggttggag 
atcagtggga 
acgccctcga 
acgacgacca 
caccaccacc 
ctgccggctg 
ggttccggaa 
ggcactgcgg 
aatattaatt 
ggccccactc 
cagccctgtc 



cgttagatct 
agtgcgatcc 
ccatctgcaa 
tcagcgtact 
gcatgttcgg 
taagtgcagc 
gcaatggtcc 
cgggtgcgcc 
atcccggcct 
gcagcacgac 
ccaagcatcc 
acaaccacca 
gcggtaatgc 
ctgatgacgg 
ccagtggatc 
tgaacgcagc 
atgcccgcaa 
ctggcggagc 



gtccaagtac 
cggcgatgtg 
tgaccacggg 
gaactcaaga 
ggatccctgt 
ccaaacttcg 
gccgatcttt 
gcccccgccg 
gttcaacgta 
agccgtgggc 
ggctggccgc 
cggtgaagac 
cacttcacca 
aaccctactg 
cgttgtcccc 
cgggatgtcc 
tttgtactgg 
agccggcatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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gccaagtggc 
gatgacatca 
agctgcgagc 
cccacctggc 
ctggaaatgc 
gaagtgacca 
caaacagtgt 
gaggctatga 
gaatcgcagc 
tccttgctaa 
cgcagcgtgg 
actatccagt 
attgagctgc 
ttcgccgcct 
aagagctccc 
caacagcgcc 
cacatacaac 
acgaatccca 
agtctggagt 
gccatactaa 
aacatgcgca 
ctgctaacgc 
ttgccgcatc 
gagtccttga 
aacacgaact 
aatgtccaac 
gttgtgctgt 
aatctgcatc 
gctgcactgc 
cacaactatc 
ccgcagccac 
acgaacaaca 
atttaccttt 
acaagcattt 
gcctggttct 
ctggaggtgg 
ctgagcgtgg 
tgcgtgctga 
ttctttgtgg 
cgcacgggtc 
tgctcctttg 
cgaggagcca 
aacacattgc 
cgggagtatc 
aaaacatcaa 
agcgtctcca 
gatccccagc 
acctctgatt 
acgggcacac 
agcactctcc 
atttttgcgc 
taccatcagc 
acaccgcggg 
gaggtattct 
gccggaggag 
caggtgcttt 
caatacgcgg 
cagcagcaac 
caactctcct 
cgagctggta 



gttgcgttct 
gttcgacaac 
cgccggtgag 
atcccgcgac 
gagtaaacca 
gtagcaaaac 
ccgagaagat 
tcatggagtt 
tgtcctcgtg 
ctggcctgga 
tgcaaaaagt 
ccagcgtggt 
cacgagctgc 
tcgatcgcct 
gcagttacgc 
tacgcatcct 
tctcccagcc 
cctgcgtgtt 
ccactaaccg 
tggacgttgt 
tattcatcta 
tgaagctgtt 
agcggtcggg 
ccctgaccac 
ttatccaaca 
agcagcagca 
ccaatcagcc 
agcacggtca 
agcaacatgc 
tgcagcagca 
atccgcaccg 
tcattatgca 
gcctcctggc 
tctgcggctt 
gttacgaagc 
accagacgcc 
tggccatctc 
tggaggcgaa 
ctgccattgg 
taaagaccaa 
tgttcctctt 
aaatggacga 
tggggctcta 
ggaagtatgt 
tttcctcggg 
cgtccaagaa 
agcaacagca 
gtgaactgaa 
tacaggctcc 
gctccactgg 
acaagcaaca 
cggactacta 
agtactacaa 
actggactca 
ttcccgcctc 
tctatccatc 
aggcgttgga 
agttgcgtcg 
cggacgagga 
gccagcagca 



aatgaagagg 
tccggcaccc 
catggcccac 
acctgatctg 
gcgggactcc 
gctctacggc 
gatgcacgac 
gttgcattgt 
gttggatctc 
atacaatgcc 
caaaaatata 
cttcccagat 
tctaatagat 
ggaatccatt 
catcctgagc 
gaacagcaag 
cataaccctg 
ctggaactat 
cacgcacagc 
ggatgagcac 
cataagcatc 
caatggggtc 
cagcagacgc 
gccaaccagt 
caattccatc 
gcaacagcaa 
gcagcgaaat 
gcagacggct 
agttcaggcc 
gcatgtccag 
caatcacaac 
ggccaacatg 
catcgagctg 
cattactatt 
cttccattcg 
caaggtgaac 
gctggtcatc 
tgccttgttt 
ttacacattc 
ggaacatact 
gctgctcagc 
tgacacggct 
tatcttcgtg 
gagacagcac 
cattgttacc 
gcccaagctg 
gacaccagtg 
cgaggcccag 
accgcagacc 
tcatgcctca 
acaacaacag 
cagctggaag 
taatgcgggt 
aaagccgaac 
gcctagtgga 
gtacaagaag 
cccaccacta 
ccagcagcta 
gcaggccgag 
gctccctgct 



atacccgact 
agcggtggcg 
aaggtaaacc 
acgcaatgcc 
tccttgatct 
ggcgacatgt 
aaggagacct 
gtggtcaaaa 
aatccggagg 
tttctgctgg 
ttgctctccg 
tcggatcagt 
aatagtgaag 
ctaaagccca 
aacgacagcg 
gtgatctcgg 
acactgaaac 
attgaccatg 
gtctgcagtt 
cagcattcgt 
ggcatctgcg 
tttgtgaagg 
cagcaaaaca 
cagtccaatg 
cgcaactcac 
caacaagttg 
atgcaacatg 
gctgctgcag 
agcaacgcca 
cagcagcagc 
cttaatgtgg 
gacgacttac 
ctctttctcc 
ttcctacact 
tactcaacgc 
tgctactacc 
gatcccagca 
tatgccacct 
ctctcctgga 
cgcctcgcta 
gctgtttggt 
gatgtgtatg 
ttccattgca 
gcttggctgc 
ggcaatggac 
ccgttaggag 
cccatcacag 
caaagaagaa 
cttggtggcc 
cccaccagct 
cagcaacagg 
caaccatcaa 
gctgctgcat 
agcggccaca 
tctctgcaca 
accaagcctg 
gccactggca 
catcagcagc 
caacatgctc 
ccaccgccac 



ccggctacga 
actgtctgca 
agcgtctgcg 
gcagcctttg 
ccatcgccaa 
tggtcaccac 
tcccggatca 
ccggctccaa 
accaaatgcg 
cggatacgat 
ttcgagttct 
ggcccttgag 
gcggtctggt 
gctatgatca 
atgtcaacgc 
ccagcttggg 
atctgaagac 
cgtggtctgc 
gcaaccacct 
tgttcaccat 
tggtcttcat 
taagaaacgg 
atattcgcga 
tgccgccgcc 
accgcaacaa 
ttgcggcagc 
ccatgaacaa 
ctgctgctgc 
gcaataacct 
agcagcagcg 
acggcaatgg 
aatataagtg 
tgggcattga 
gtgccatcct 
tcacctcgga 
tcttgtccta 
cctataccca 
ttgtaatacc 
ttataatgtg 
gcgtgcggtt 
gctcggccta 
gatactgctt 
ttcaaaacga 
ccaagtgctt 
ccacagccgg 
tgagcgaaga 
aggatgccat 
ccctaaaaag 
atgttgtgct 
ctgccgggtc 
gacctttggg 
ctggaacagg 
catcgccgca 
atggcaaaaa 
gtcgcacggc 
gccagccaac 
atgcggctgc 
agcaacagca 
acctgttgca 
acatggcgca 



tgagtacgat 
caacagcagc 
caactttgag 
gctcaacaat 
cgatatgtcc 
gaagattatc 
gcgacagcgc 
cctgctggac 
tgtagccaca 
catcagggag 
ggaaaccaag 
ttcggatcgt 
gcgaattgta 
cttcgatctc 
gggggagatc 
caaggggcgt 
cgagaatgta 
caacggatgc 
gacaaacttt 
gttcgatgga 
agttatcgcc 
ctccaatccc 
ccagacccac 
cagtcatggg 
tctgaactac 
tgctgctgct 
cttgaatctg 
ctcaata'gcc 
caacatcagc 
tggccagcag 
attggataac 
gtggtggtgc 
acagaccgaa 
atcgggcacc 
cgagctcctg 
cggactgtcg 
aaacgattat 
agtgcttgtc 
ccgcaaaagt 
cgacatacgc 
cttctatttg 
catctgcttc 
aaagatccgg 
gcgctgctcg 
aaccctttgc 
ggcgcatgac 
tatgggagcc 
tggcctaatg 
cgaaagaggt 
cacacacctg 
cgagtcttac 
aggattgaaa 
acaggcgcac 
gaagaggggc 
cgccgcctcc 
aggctatccg 
ctactaccag 
gcagcagcag 
cctgcaacga 
gtaccagcag 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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gagtttatgc 
gcctactaca 
ctcagcaacc 
gtgtacaaca 
ggcggtagtc 
cagcagcagc 
cgctgcacgg 
gcggagcaat 
gaggatgatg 
atgcaggacg 
cttgactacg 
tttggggtta 
cagcaagcac 
cacggcaact 
cagctacaga 
aatcccaatc 
catcagcagc 
gaaccactgg 



agcgccagta 
accaaggcag 
gcaactcgga 
atagcgttgg 
ggtacggcgg 
tcaagaagca 
cggatgacga 
tgcacgacag 
ctcatggatt 
aggattttaa 
gggctttgcc 
gcggcggcgt 
tgaatgccaa 
cgggaccacc 
aactgtcacc 
cgcatcccca 
gccacctgtc 
ctaagtga 



tagaaataag 
cgtcggcggc 
tgtgcagcat 
cactggcgtc 
cggaagcctg 
gcagcaacaa 
tgacgacgaa 
cgtctgtgat 
gccaccgcag 
gcggcggttt 
accaggagca 
tggtgagggc 
gtcgccaggc 
gctgccgccg 
acagtccacc 
ccagctaacc 
ggccatgctc 



cattccaact 
gcggatggtg 
tacgaggtgg 
ttcaacaaca 
agtggcggca 
cagtcgctgg 
gacgaggagg 
gaggatgagg 
agcgatgagc 
caacgtcagc 
ggaccgcaac 
tccctgcgtg 
ggccgtttgg 
gcgaatcaga 
acatcatcgt 
catccgcatc 
gatgagaaca 



gtgatctggg 
ggccggtcta 
gtgacttcga 
tgagagcggc 
gtgtctcgtc 
ctcagcaaag 
aggatgagga 
aggaggacga 
ggatgcgtcg 
tgcgcaaaca 
ccgagcacaa 
gcgcattccg 
cggtaaatga 
cgcccgcgca 
cgtcgcatac 
cccatcagca 
acacggtgcg 



catgggcgat 
cgaggagatc 
tgtggacgag 
ggtggccgcc 
caggagccaa 
atcggctcga 
ggcaacggcc 
gagcgacttg 
tctgatggcg 
tggagcgcct 
cggtgcggtt 
gcagcagcag 
gctattcggt 
gaagcgccag 
atcacacagc 
tcctccgcac 
atgttatctg 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5718 



<210> 9 
<211> 1905 
<212> PRT 
<213> Drosophila 

<400> 9 

Met Leu Pro Thr lie 

1 5 
Leu Ser Lys Tyr Gin 
20 

lie Glu Cys Asp Pro 
35 

Gly Arg Phe Ser lie 
50 

Ser Val Asn Cys Met 
65 

Cys Ala His Lys Gin 
85 

Gly Asp Pro Cys Pro 
100 

Cys lie Ser Ala Ala 
115 

Pro Pro Trp Val Leu 
130 

Gly Leu He His Pro 
145 

Arg Leu Pro Thr Leu 
165 

Leu Phe Asn Val Pro 
180 

Ser Ser Ser Thr Thr 
195 

Thr Ser Thr Thr Thr 
210 

Pro Pro Pro Pro Gin 
225 

Thr Ala Ser Pro Thr 
245 

Ala Thr Ser Pro Ser 



Leu Ser He 

Thr Ala Tyr 

Gly Asp Val 
40 

Thr He Cys 
55 

Phe Pro Lys 
70 

Ser Cys Gly 

Gly Thr His 

Gin Thr Ser 
120 

Ser Asn Gly 

135 

Pro Gly Val 
150 

Pro Gly Val 

Pro Gin His 

Ala Val Gly 
200 

Thr Lys His 

215 

Leu His His 
230 

Lys Pro Ser 

Asn Thr Arg 



Ser Tyr 

10 
Ala Cys 
25 

He Asn 

Asn Asp 

Ser Leu 

Val Leu 
90 

Lys Tyr 
105 

Thr Thr 

Pro Pro 

Gly Ala 

Val Gly 
170 
Thr Ala 
185 

Gly Gly 

Pro Ala 

His His 

Ser Lys 
250 
He Leu 



Glu His He 

Glu Gly Lys 

Leu He Arg 
45 

His Gly Asn 
60 

Ser Val Leu 
75 

Ala Ala Thr 

Leu Glu Ala 

Thr Asn Arg 
125 

He Phe Gly 

140 
Gly Ala Pro 
155 

He Ser Gly 

Val Thr His 

Arg Leu Lys 
205 

Gly Arg His 

220 
Asn His His 
235 

Leu Pro Ala 



Ser 

Lys 

30 

Ala 

Val 

Asn 

Ser 

His 
110 
Pro 

Asn 

Pro 

Asn 

Ser 
190 
Gly 



Asp 
Gly 
Gly 

Thr Gly Val Gly 



Leu Asp 
15 

Leu Thr 

Asn Tyr 

Glu Trp 

Ser Arg 

80 
Met Phe 
95 

Tyr Gin 

Ser Pro 

Gly Ser 

Pro Pro 
160 
Pro Gly 
175 

Thr Pro 

Gly Ala 

Gly Leu 

Glu Asp 
240 
Gly Asn 
255 

Gly Ser 
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260 










265 










270 






Gly 


Thr 


Asp 


Asp 


Gly 


Thr 


Leu 


Leu 


Thr 


Thr 


Lys 


Ser 


Ser 


Pro 


Asn 


Arg 






275 










280 










285 








Pro 


Pro 


Gly 


Thr 


Ala 


Ala 


Ser 


Gly 


Ser 


Val 


Val 


Pro 


Gly 


Asn 


Gly 


Ser 




290 










295 










300 










Val 


Val 


Arg 


Thr 


lie 


Asn 


Asn 


He 


Asn 


Leu 


Asn 


Ala 


Ala 


Gly 


Met 


Ser 


305 










310 










315 










320 


Gly 


Gly 


Asp 


Asp 


Glu 


Ser 


Lys 


Leu 


Phe 


Cys 


Gly 


Pro 


Thr 


His 


Ala 


Arg 










325 










330 










335 




Asn 


Leu 


Tyr 


Trp 


Asn 


Met 


Thr 


Arg 


Val 


Gly 


Asp 


Val 


Asn 


Val 


Gin 


Pro 








340 










345 










350 






Cys 


Pro 


Gly 


Gly 


Ala 


Ala 


Gly 


He 


Ala 


Lys 


Trp 


Arg 


Cys 


Val 


Leu 


Met 




355 










360 










365 








Lys 


Arg 


lie 


Pro 


Asp 


Ser 


Gly 


Tyr 


Asp 


Glu 


Tyr 


Asp 


Asp 


Asp 


He 


Ser 




370 










375 










380 










Ser 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Gly 


Gly 


Asp 


Cys 


Leu 


His 


Asn 


Ser 


Ser 


385 










390 










395 










400 


Ser 


Cys 


Glu 


Pro 


Pro 


Val 


Ser 


Met 


Ala 


His 


Lys 


val 


Asn 


Gin 


Arg 


Leu 








405 










410 










415 




Arg 


Asn 


Phe 


Glu 


Pro 


Thr 


Trp 


His 


Pro 


Ala 


Thr 


Pro 


Asp 


Leu 


Thr 


Gin 








420 










425 










430 






Cys 


Arg 


Ser 


Leu 


Trp 


Leu 


Asn 


Asn 


Leu 


Glu 


Met 


Arg 


Val 


Asn 


Gin 


Arg 






435 










440 










445 








Asp 


Ser 


Ser 


Leu 


He 


Ser 


He 


Ala 


Asn 


Asp 


Met 


Ser 


Glu 


Val 


Thr 


Ser 


450 










455 










4 60 










Ser 


Lys 


Thr 


Leu 


Tyr 


Gly 


Gly 


Asp 


Met 


Leu 


Val 


Thr 


Thr 


Lys 


He 


He 


465 










470 










475 










480 


Gin 


Thr 


Val 


Ser 


Glu 


Lys 


Met 


Met 


His 


Asp 


Lys 


Glu 


Thr 


Phe 


Pro 


Asp 










485 










490 










495 




Gin 


Arg 


Gin 


Arg 


Glu 


Ala 


Met 


He 


Met 


Glu 


Leu 


Leu 


His 


Cys 


Val 


Val 








500 










505 










510 






Lys 


Thr 


Gly 


Ser 


Asn 


Leu 


Leu 


Asp 


Glu 


Ser 


Gin 


Leu 


Ser 


Ser 


Trp 


Leu 




515 










520 










525 








Asp 


Leu 


Asn 


Pro 


Glu 


Asp 


Gin 


Met 


Arg 


Val 


Ala 


Thr 


Ser 


Leu 


Leu 


Thr 




530 










535 










540 










Gly 


Leu 


Glu 


Tyr 


Asn 


Ala 


Phe 


Leu 


Leu 


Ala 


Asp 


Thr 


He 


He 


Arg 


Glu 


545 










550 










555 










5 60 


Arg 


Ser 


Val 


Val 


Gin 


Lys 


Val 


Lys 


Asn 


He 


Leu 


Leu 


Ser 


Val 


Arg 


Val 










565 










570 










575 




Leu 


Glu 


Thr 


Lys 


Thr 


He 


Gin 


Ser 


Ser 


Val 


Val 


Phe 


Pro 


Asp 


Ser 


Asp 








580 










585 










590 






Gin 


Trp 


Pro 


Leu 


Ser 


Ser 


Asp 


Arg 


He 


Glu 


Leu 


Pro 


Arg 


Ala 


Ala 


Leu 






595 










600 










605 








lie 


Asp 


Asn 


Ser 


Glu 


Gly 


Gly 


Leu 


Val 


Arg 


He 


Val 


Phe 


Ala 


Ala 


Phe 




610 










615 










620 










Asp 


Arg 


Leu 


Glu 


Ser 


He 


Leu 


Lys 


Pro 


Ser 


Tyr 


Asp 


His 


Phe 


Asp 


Leu 


625 










630 










635 










640 


Lys 


Ser 


Ser 


Arg 


Ser 


Tyr 


Ala 


He 


Leu 


Ser 


Asn 


Asp 


Ser 


Asp 


Val 


Asn 










645 










650 










655 




Ala 


Gly 


Glu 


He 


Gin 


Gin 


Arg 


Leu 


Arg 


He 


Leu 


Asn 


Ser 


Lys 


Val 


He 








660 










665 










670 






Ser 


Ala 


Ser 


Leu 


Gly 


Lys 


Gly 


Arg 


His 


He 


Gin 


Leu 


Ser 


Gin 


Pro 


He 






675 










680 










685 








Thr 


Leu 


Thr 


Leu 


Lys 


His 


Leu 


Lys 


Thr 


Glu 


Asn 


Val 


Thr 


Asn 


Pro 


Thr 




690 










695 










700 










Cys 


Val 


Phe 


Trp 


Asn 


Tyr 


He 


Asp 


His 


Ala 


Trp 


Ser 


Ala 


Asn 


Gly 


Cys 


705 










710 










715 










720 


Ser 


Leu 


Glu 


Ser 


Thr 


Asn 


Arg 


Thr 


His 


Ser 


Val 


Cys 


Ser 


Cys 


Asn 


His 










725 










730 










735 




Leu 


Thr 


Asn 


Phe 


Ala 


He 


Leu 


Met 


Asp 


Val 


Val 


Asp 


Glu 


His 


Gin 


His 



SUBSTITUTE SHEET (RULE 26) 





WO 01/070980 



PCTAJS01/09341 



740 



745 



750 



Ser Leu Phe Thr Met Phe Asp Gly Asn Met Arg lie Phe lie Tyr lie 

755 760 765 

Ser He Gly He Cys Val Val Phe He Val He Ala Leu Leu Thr Leu 

770 775 780 

Lys Leu Phe Asn Gly Val Phe Val Lys Val Arg Asn Gly Ser Asn Pro 
785 790 795 800 

Leu Pro His Gin Arg Ser Gly Ser Arg Arg Gin Gin Asn Asn He Arg 

805 810 815 

Asp Gin Thr His Glu Ser Leu Thr Leu Thr Thr Pro Thr Ser Gin Ser 

820 825 830 

Asn Val Pro Pro Pro Ser His Gly Asn Thr Asn Phe He Gin His Asn 

835 840 845 

Ser He Arg Asn Ser His Arg Asn Asn Leu Asn Tyr Asn Val Gin Gin 

850 855 860 

Gin Gin Gin Gin Gin Gin Gin Gin Val Val Ala Ala Ala Ala Ala Ala 
865 870 875 880 

Val Val Leu Ser Asn Gin Pro Gin Arg Asn Met Gin His Ala Met Asn 

885 890 895 

Asn Leu Asn Leu Asn Leu His Gin His Gly Gin Gin Thr Ala Ala Ala 

900 905 910 

Ala Ala Ala Ala Ala Ser He Ala Ala Ala Leu Gin Gin His Ala Val 

915 920 925 

Gin Ala Ser Asn Ala Ser Asn Asn Leu Asn He Ser His Asn Tyr Leu 

930 935 940 

Gin Gin Gin His Val Gin Gin Gin Gin Gin Gin Gin Arg Gly Gin Gin 
945 950 955 960 

Pro Gin Pro His Pro His Arg Asn His Asn Leu Asn Val Asp Gly Asn 

965 970 975 

Gly Leu Asp Asn Thr Asn Asn He He Met Gin Ala Asn Met Asp Asp 

980 985 990 

Leu Gin Tyr Lys Trp Trp Trp Cys He Tyr Leu Cys Leu Leu Ala He 

995 1000 1005 

Glu Leu Leu Phe Leu Leu Gly He Glu Gin Thr Glu Thr Ser He Phe 

1010 1015 1020 

Cys Gly Phe He Thr He Phe Leu His Cys Ala He Leu Ser Gly Thr 
1025 1030 1035 104 

Ala Trp Phe Cys Tyr Glu Ala Phe His Ser Tyr Ser Thr Leu Thr Ser 

1045 1050 1055 

Asp Glu Leu Leu Leu Glu Val Asp Gin Thr Pro Lys Val Asn Cys Tyr 

1060 1065 1070 

Tyr Leu Leu Ser Tyr Gly Leu Ser Leu Ser Val Val Ala He Ser Leu 

1075 1080 1085 

Val He Asp Pro Ser Thr Tyr Thr Gin Asn Asp Tyr Cys Val Leu Met 

1090 1095 1100 

Glu Ala Asn Ala Leu Phe Tyr Ala Thr Phe Val He Pro Val Leu Val 
1105 1110 1115 112 

Phe Phe Val Ala Ala He Gly Tyr Thr Phe Leu Ser Trp He He Met 

1125 1130 1135 

Cys Arg Lys Ser Arg Thr Gly Leu Lys Thr Lys Glu His Thr Arg Leu 

1140 1145 1150 

Ala Ser Val Arg Phe Asp He Arg Cys Ser Phe Val Phe Leu Leu Leu 

1155 1160 1165 

Leu Ser Ala Val Trp Cys Ser Ala Tyr Phe Tyr Leu Arg Gly Ala Lys 

1170 1175 1180 

Met Asp Asp Asp Thr Ala Asp Val Tyr Gly Tyr Cys Phe He Cys Phe 
1185 1190 1195 120 

Asn Thr Leu Leu Gly Leu Tyr He Phe Val Phe His Cys He Gin Asn 

1205 1210 1215 

Glu Lys He Arg Arg Glu Tyr Arg Lys Tyr Val Arg Gin His Ala Trp 
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1220 1225 1230 

Leu Pro Lys Cys Leu Arg Cys Ser Lys Thr Ser lie Ser Ser Gly He 

1235 1240 1245 

Val Thr Gly Asn Gly Pro Thr Ala Gly Thr Leu Cys Ser Val Ser Thr 

1250 1255 1260 

Ser Lys Lys Pro Lys Leu Pro Leu Gly Val Ser Glu Glu Ala His Asp 
1265 1270 1275 1280 

Asp Pro Gin Gin Gin Gin Gin Thr Pro Val Pro He Thr Glu Asp Ala 

1285 1290 1295 

He Met Gly Ala Thr Ser Asp Cys Glu Leu Asn Glu Ala Gin Gin Arg 

1300 1305 1310 

Arg Thr Leu Lys Ser Gly Leu Met Thr Gly Thr Leu Gin Ala Pro Pro 

1315 1320 1325 

Gin Thr Leu Gly Gly His Val Val Leu Glu Arg Gly Ser Thr Leu Arg 

1330 1335 134'0 

Ser Thr Gly His Ala Ser Pro Thr Ser Ser Ala Gly Ser Thr His Leu 
1345 1350 1355 1360 

He Phe Ala His Lys Gin Gin Gin Gin Gin Gin Gin Gin Gly Pro Leu 

1365 1370 1375 

Gly Glu Ser Tyr Tyr His Gin Pro Asp Tyr Tyr Ser Trp Lys Gin Pro 

1380 1385 1390 

Ser Thr Gly Thr Gly Gly Leu Lys Thr Pro Arg Glu Tyr Tyr Asn Asn 

1395 1400 1405 

Ala Gly Ala Ala Ala Ser Ser Pro Gin Gin Ala His Glu Val Phe Tyr 

1410 1415 1420 

Trp Thr Gin Lys Pro Asn Ser Gly His Asn Gly Lys Lys Lys Arg Gly 
1425 1430 1435 1440 

Ala Gly Gly Val Pro Ala Ser Pro Ser Gly Ser Leu His Ser Arg Thr 

1445 1450 1455 

Ala Ala Ala Ser Gin Val Leu Phe Tyr Pro Ser Tyr Lys Lys Thr Lys 

1460 1465 1470 

Pro Gly Gin Pro Thr Gly Tyr Pro Gin Tyr Ala Glu Ala Leu Asp Pro 

1475 1480 1485 

Pro Leu Ala Thr Gly Asn Ala Ala Ala Tyr Tyr Gin Gin Gin Gin Gin 

1490 1495 1500 

Leu Arg Arg Gin Gin Leu His Gin Gin Gin Gin Gin Gin Gin Gin Gin 
1505 1510 1515 1520 

Gin Leu Ser Ser Asp Glu Glu Gin Ala Glu Gin His Ala His Leu Leu 

1525 1530 1535 

His Leu Gin Arg Arg Ala Gly Ser Gin Gin Gin Leu Pro Ala Pro Pro 

1540 1545 1550 

Pro His Met Ala Gin Tyr Gin Gin Glu Phe Met Gin Arg Gin Tyr Arg 

1555 1560 1565 

Asn Lys His Ser Asn Cys Asp Leu Gly Met Gly Asp Ala Tyr Tyr Asn 

1570 1575 1580 

Gin Gly Ser Val Gly Gly Ala Asp Gly Gly Pro Val Tyr Glu Glu lie 
1585 1590 1595 1600 

Leu Ser Asn Arg Asn Ser Asp Val Gin His Tyr Glu Val Gly Asp Phe 

1605 1610 1615 

Asp Val Asp Glu Val Tyr Asn Asn Ser Val Gly Thr Gly Val Phe Asn 

1620 1625 1630 

Asn Met Arg Ala Ala Val Ala Ala Gly Gly Ser Arg Tyr Gly Gly Gly 

1635 1640 1645 

Ser Leu Ser Gly Gly Ser Val Ser Ser Arg Ser Gin Gin Gin Gin Leu 

1650 1655 1660 

Lys Lys Gin Gin Gin Gin Gin Ser Leu Ala Gin Gin Arg Ser Ala Arg 
1665 1670 1675 1680 

Arg Cys Thr Ala Asp Asp Asp Asp Asp Glu Asp Glu Glu Glu Asp Glu 

1685 1690 1695 

Glu Ala Thr Ala Ala Glu Gin Leu His Asp Ser Val Cys Asp Glu Asp 
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1700 1705 1710 

Glu Glu Glu Asp Glu Ser Asp Leu Glu Asp Asp Ala His Gly Leu Pro 

1715 1720 1725 

Pro Gin Ser Asp Glu Arg Met Arg Arg Leu Met Ala Met Gin Asp Glu 

1730 1735 1740 

Asp Phe Lys Arg Arg Phe Gin Arg Gin Leu Arg Lys His Gly Ala Pro 
1745 1750 1755 1760 

Leu Asp Tyr Gly Ala Leu Pro Pro Gly Ala Gly Pro Gin Pro Glu His 

1765 1770 1775 

Asn Gly Ala Val Phe Gly Val Ser Gly Gly Val Gly Glu Gly Ser Leu 

1780 1785 1790 

Arg Gly Ala Phe Arg Gin Gin Gin Gin Gin Ala Leu Asn Ala Lys Ser 

1795 1800 1805 

Pro Gly Gly Arg Leu Ala Val Asn Glu Leu Phe Gly His Gly Asn Ser 

1810 . 1815 1820 

Gly Pro Pro Leu Pro Pro Ala Asn Gin Thr Pro Ala Gin Lys Arg Gin 
1825 1830 1835 1840 

Gin Leu Gin Lys Leu Ser Pro Gin Ser Thr Thr Ser Ser Ser Ser His 

1845 1850 1855 

Thr Ser His Ser Asn Pro Asn Pro His Pro His Gin Leu Thr His Pro 

1860 1865 1870 

His Pro His Gin His Pro Pro His His Gin Gin Arg His Leu Ser Ala 

1875 1880 1885 

Met Leu Asp Glu Asn Asn Thr Val Arg Cys Tyr Leu Glu Pro Leu Ala 
1890 1895 1900 

Lys 
1905 



<210> 10 
<211> 4914 
<212> DNA 
<213> Drosophila 



<400> 10 

cgcgggctcc 

ctagttagtt 

ggaaaaccct 

ttcatgatat 

aggcattacg 

tcggcatcgg 

tttgtttttg 

ctctctaact 

tgttatgtac 

ttgaatttgg 

acattaagta 

tgtagatgac 

ggggttatat 

caactaaaat 

accgtttccc 

tatagtagag 

agcatttcct 

catactgctg 

tacctggcgg 

gatggaactg 

ggtgatgcac 

taccaggcga 

ttcgcaccga 

tactatgccc 

cattgctgac 



atacgattcg 
ggacatttag 
cagtaattcc 
tgactcgttt 
gctacgattg 
gagaagacag 
aagacatgaa 
caaaaaaaaa 
gcaaaggcga 
ttacaccaaa 
cacacaggaa 
ctatgacctc 
acacagtaca 
cgaaataacg 
cacccattta 
gattgacttt 
gccctgaaat 
caagtgggga 
ataatacagc 
atgatgctgc 
gctcctgccc 
agtcttcagt 
atggtaggag 
gtccaggtgt 
ttgcctcttg 



tattagcccg 
aagtaggact 
gccaaagttc 
cagttaagtc 
caattgcgtt 
caaagttgga 
catgcgtatc 
gaatttatat 
cacacattaa 
atggtaaaaa 
aataggatat 
gcgataaagg 
tagatcggtt 
atatcgttat 
agttgctttg 
ggcacagcag 
ggtacctaat 
tagtagctgc 
cgatgcgagg 
tgctgctggt 
agcgaatagt 
ttgtacggac 
accaccgtgc 
ccagttgctg 
tagagagtgg 



caattgtata 
ttttctgtgc 
gttgctggaa 
cctaaagcta 
tttttttttc 
aacctagttt 
acagtatttg 
aaaaatgcta 
gtcctaatgt 
ggcaacgcaa 
gaagcccatc 
cccaaactaa 
taccccattg 
ctcacgatcg 
cctctttcga 
ctgctgcagg 
ccaattccgt 
cgtgctggct 
atgtggagtg 
gatgatgatg 
tggacatctg 
ttggcggcaa 
tattgcaagc 
ctccggcaga 
aaatgcttgg 



aaggttagtt 
gaaaatcagc 
aaggactctt 
atggcacact 
gattgttaag 
ctggttccca 
gttgctgcgc 
aatatatata 
ctaaaaggca 
ggggcaacta 
tagggcaatt 
agcactaaga 
tgttctatgg 
ttttatgaca 
tacccgaaac 
aacattgact 
ctagagtagc 
gtacttttgc 
atgctggtga 
atgcggcgaa 
attcagcgat 
cggaacactg 
cgcatgcgat 
cgtatgcttc 
tccaagtacc 



agtttcttag 
agaagttgaa 
ttaaggtaat 
gaatacaatt 
tacacttgtt 
tttttatttt 
gtttctctct 
tgtatatatg 
agagcgatta 
attaggagca 
acaaatacat 
gcacatttcc 
acaggcaaag 
aacaaattgg 
cgcattaagt 
ggaagcagga 
gtctcattgc 
gaggcatagc 
ttgtggtggg 
tggcgctgct 
atagagccgg 
gccatagaag 
ccacgcgagc 
accattttgc 
caggcaaaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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agcaaatgcc cagcatcagt tccatgaagg cgcgcaacaa aaatggccaa agtggcgtat 1560 
tggtatcacc cgatcccagc cagacatcga tgttggcaaa ctcgtaaatg acagccaaca 1620 
gaagcatggc acatgcaagt ccatagatga aaaggaatat gccaatgccg gaactactgt 1680 
gtgcgcttag ctgttgcagt tgctgttgca cgctgagtgc attgctattc ctcaggatct 1740 
ccttggtgcg acagtgtacc aaatagccgg agatcagatt catcgagcca aagatccagt 1800 
agcaaaaaac gggcgtagcc accaatatct gtagcgcctt atccgactga ttgcctacga 1860 
aacaggcgcc tagaagagcg tatggttggg ttactctccg aattctccgg cggatctata 1920 

gcatatctca ccaagcaact cgtcagcatc cacgaaacgg gcaacgatca ctgctgacgt 1980 
ttgaaaggcc ggcaatcccc aggccacgaa gcacacaaag ttgttttgcg ttagatcctg 2040 
ctgagccgtc ttggcactgt tcctcttcgc cggactgcct ccaaagttcg agggaatgac 2100 
cacatggccc ttggagtccg gactgtgccg acgaatgtcg cggtgccagc ccaaacagag 2160 
cacagcccac ctacaaatag gatatgggaa tatagtttct ataatattaa gtatatatta 2220 
taatctaata ccaagcgcag gcggccattc caaaatagta gcgcataagg aagacactag 2280 
cgcaggaggc attggacagt ccgtccacgg tgagtaggga ttcattgggc gcctggggat 2340 
ccgtgccgca cgcagttccc gttcttccca ccataaagcg cactgcccag cccaaggtga 2400 
ccatgttgtg gcaccaaatc agcggcgaca acaatctgga ccacttggcg ctggccaaac 2460 
gactgccatc gctggccaaa agacacacgg tggctaccag ggccaaaccc aaagcggcat 2520 
aggcccatgt ggagacccag atctctgcca gatgcttctc cgccggtgtg aacagtatgt 2580 
ccgcctcgca gagtggtgca cagcgtccag atctgggtag ccttacgtac agatgcgatt 2640 
tggccagtcc cgagcaatcc atgggcagtt taccgcccaa tccaccgggt attccttggc 2700 
ccggtagccc atacaaatcc tgctcctgct gcggctgatg caattcaccg ggaccttfcca 2760 
tacacatcgt ctcgtggttg ttctctcgcg gaaatttatc gcagtcgaga gccggtggcc 2820 
atggaaatcc gaatccctgc aaaactggat gacaacgtat tcgcacggac tcacagagac 2880 
tccggcacgg accgatggca tggactggag ctttgggcgt gcacatgggc acgtaggcgg 2940 
cgcacaggaa cagcttcaac tgggaactgc agtcgtattc gatcagtggt gcaaaggttt 3000 
gcagtgtgta ctccacgtcc gtttgcattt cgttacccac cagatttggc atggaggtct 3060 
cattgtagcc gatcttgcgg cacatctcaa tgcgaatggt ttcgcattgc cggaaggcgg 3120 
ggatttctgg cggagaactg gaactggaac tggcactggg gttgccgctc gttgaggatt 3180 
ttgagatgcg cggatgcagg aggatcacca ccaggagaca caatatgcag gttggcttca 3240 
ttgcggctga tttgttgttc tttcgttccg aagaatacga attcgatacg tttcttcgct 3300 
tggtgttccg atttccgttt ctctaaagga atcgaaatgt ggaaaagatc agatttcggg 3360 
gttttcttcg cgctgggttc cctcttttct ttccaaagaa tccgaactat atgcagattc 3420 
ttcgttcttc ggtggaacga acgctggaag gggtaacctg ttccaggtgc gttcgccctc 3480 
tctcgctcgc accctctcac tcgttccaac aacttccagc tctccttctt tccagcttgt 3540 
tattgttgct tttggcttgt caactcgtta actacctctt ttttttcggc gtcgttcttg 3600 
ttatcgcttc tcctccttcc cccactcgtt cttcagcccc aaccactccc ctttctccac 3660 
cgcacgccgc ccaacccgct gatcgacgtc gataaacggg tttttcctca gggattaacc 3720 
cgaaaacgat ctagcaaacc atcgaatctt tcacacatct cagtgaacga atagtgtatt 3780 
tgtttcgatg tgcttaaacg aatggcgcag tcagtttgta gccttagcga atgtattcta 3840 

atctaattgg tattaacatc gaacgtttat atattttatt ttgttttcgt ggtaaaattc 3900 

aacgcatact ggtcactctg agctagagat ggctctgcgt cacgtcacct cgcatcgatt 3960 

gcaattgctc aacggatctg ctatcgatat caacaagtgt gttggttact aaatgctttt 4020 

agatcaaaag gcgccccctg gcggagttat gcgtaatcaa agaacattga gaagggcgcg 4080 

cttttcaaaa acaaatttgt ccaaataata agaaacattt aagcataata agaacataaa 4140 

tgttcaataa attctatagg tgcagtaaaa tacgtgtttt gctagcacag ctattttaag 4200 

ctgagaactt acaatatttt agtacattgt gcttaaggtt taaagataca tatattgaaa 4260 

aagttctgga aatgcttttg aagaacaaag ctttaagagg cgcgctttcc gagtatgtga 4320 

gcttttttca aatggcacac tttctgcgga atataaaata aaaaaaccta ataatatgcc 4380 

gcaagaccac ttaagataaa atattaaaaa tattaaaatt gattatattt tgtgcaccaa 4440 

aagtggcggt agcttaatat cgcaaaacca aattaatagc aattctgagc caaaccgcga 4500 

ttgccatttc tgcatgttgg tcctccgtcc gtttcgccat ctccctttgg ccaccgcttg 4560 

gcctctcttt cgcacacaca tggcaaacac aagccacgcc catgtatttc aacccccttg 4620 

acaaaaatac gtaacatgcc acttttatgt atgaaacgtg acaaatacgc acatgtgtga 4680 

gaaagagatc caccgagctg gcgagaaaga gagagagaga cgctggacgt ttatgtgttt 4740 

ggcaggactc gcgtatgcag tgtgtgtgtg cgtgtgtgtg tgagtgtgtg tgagtgtttg 4800 

ttgttggtta acgcacacgc gcagacgtag caacacatac acatatgtgg aaaggatatg 4860 

tacatacata catacataca tacatatgcc cgctgacttt ttcgccactg ccaa 4914 
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<210> 11 
<211> 2523 
<212> DNA 
<213> Drosophila 



<400> 11 

gaccagtatg 

aataccaatt 

agcacatcga 

ctagatcgtt 

ttgggcggcg 

aggagaagcg 

aagcgaagaa 

atgaagccaa 

aaatcctcaa 

cccgccttcc 

gagacctcca 

caaacctttg 

gccgcctacg 

agtctctgtg 

tggccaccgg 

atggaaggtc 

ggccaaggaa 

aaatcgcatc 

gacatactgt 

tatgccgctt 

cgtttggcca 

gtcaccttgg 

gatccccagg 

gctagtgtct 

ctctgtttgg 
gtcattccct 
caggatctaa 
acgtcagcag 
gataaggcgc 
atgaatctga 
aatgcactca 
ggcatattcc 
gagtttgcca 
tttttgttgc 
ggaccaagca 
acgtctgccg 
gcggcttgca 
ccgttgccgc 
aatcagatgt 
catcatcatc 
catcactcca 
tacagccagc 
tag 



cgttgaattt 
agattagaat 
aacaaataca 
ttcgggttaa 
tgcggtggag 
ataacaagaa 
acgtatcgaa 
cctgcatatt 
cgagcggcaa 
ggcaatgcga 
tgccaaatct 
caccactgat 
tgcccatgtg 
agtccgtgcg 
ctctcgactg 
ccggtgaatt 
tacccggtgg 
tgtacgtaag 
tcacaccggc 
tgggtttggc 
gcgccaagtg 
gctgggcagt 
cgcccaatga 
tccttatgcg 

gctggcaccg 
cgaactttgg 
cgcaaaacaa 
tgatcgttgc 
tacagatatt 
tctccggcta 
gcgtgcaaca 
ttttcatcta 
acatcgatgt 
gcgccttcat 
tttccactct 
gagcagcaac 
atagcacggt 
caagtccgta 
ccaactattc 
atcaccagca 
catcctcgca 
acggcagcta 



taccacgaaa 
acattcgcta 
ctattcgttc 
tccctgagga 
aaaggggagt 
cgacgccgaa 
ttcgtattct 
gtgtctcctg 
ccccagtgcc 
aaccattcgc 
ggtgggtaac 
cgaatacgac 
cacgcccaaa 
aatacgttgt 
cgataaattt 
gcatcagccg 
attgggcggt 
gctacccaga 
ggagaagcat 
cctggtagcc 
gtccagattg 
gcgctttatg 
atccctactc 
ctactatttt 

cgacattcgt 
aggcagt'ccg 
ctttgtgtgc 
ccgtttcgtg 
ggtggctacg 
tttggtacac 
gcaactgcaa 
tggacttgca 
ctggctggga 
ggaactgatg 
ctacaagagg 
tggacacctg 
ggtctcctac 
caaactgaag 
gctgggcagg 
gcagcagcat 
tcggctgtat 
ctatccccac 



acaaaataaa 
aggctacaaa 
actgagatgt 
aaaacccgtt 
ggttggggct 
aaaaaagaga 
tcggaacgaa 
gtggtgatcc 
agttccagtt 
attgagatgt 
gaaatgcaaa 
tgcagttccc 
gctccagtcc 
catccagttt 
ccgcgagaga 
cagcaggagc 
aaactgccca 
tctggacgct 
ctggcagaga 
accgtgtgtc 
ttgtcgccgc 
gtgggaagaa 
accgtggacg 
ggaatggccg 

cggcacagtc 
gcgaagagga 
ttcgtggcc't 
gatgctgacg 
cccgtttttt 
tgtcgcacca 
cagctaagcg 
tgtgccatgc 
tcgggtgata 
ctgggcattt 
caagtcagca 
gacgggcata 
cattcggtgc 
acttcgcctg 
agcgtgcatc 
catcagttcc 
tatccgccag 
ttgcagcagt 



atatataaac 
ctgactgcgc 
gtgaaagatt 
tatcgacgtc 
gaagaacgag 
gaaacggaaa 
agaacaacaa 
tcctgcatcc 
ccagttctcc 
gccgcaagat 
cggacgtgga 
agttgaagct 
atgccatcgg 
tgcagggatt 
acaaccacga 
aggatttgta 
tggattgctc 
gtgcaccact 
tctgggtctc 
ttttggccag 
tgatttggtg 
cgggaactgc 
gactgtccaa 
cctgcgcttg 

cggactccaa 
acagtgccaa 
ggggattgcc 
agttgcttgg 
gctactggat 
aggagatcct 
cacacagtag 
ttctgttggc 
ccaatacgcc 
gctgttttgc 
atggcaaaat 
gtagctcgcg 
gaacttctat 
gtaccggctc 
accagcagcg 
atccccacca 
gtagctatgc 
atggcaatga 



gttcgatgtt 
cattcgttta 
cgatggtttg 
gatcagcggg 
tgggggaagg 
tcggaacacc 
atcagccgca 
gcgcatctca 
gccagaaatc 
cggctacaat 
gtacacactg 
gttcctgtgc 
tccgtgccgg 
cggatttcca 
gacgatgtgt 
tgggctaccg 
gggactggcc 
ctgcgaggcg 
cacatgggcc 
cgatggcagt 
ccacaacatg 
gtgcggcacg 
tgcctcctgc 
gtgggctgtg 

gggccatgtg 
gacggctcag 
ggcctttcaa 
caatcagtcg 
ctttggctcg 
gaggaatagc 
ttccggcatt 
tgtcatttac 
actttggcca 
ctgggtactt 
ggtgaagcat 
tggatcgcat 
ggccagtgtt 
tatatcgctg 
ccattcgccg 
caatcaccag 
ctcgcaaaag 
gacgctactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2523 



<210> 12 
<211> 700 
<212> PRT 
<213> Drosophila 



<400> 12 

Met Lys Pro Thr Cys lie Leu Cys Leu Leu Val Val lie Leu Leu His 

15 10 15 

Pro Arg lie Ser Lys Ser Ser Thr Ser Gly Asn Pro Ser Ala Ser Ser 
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20 










25 










30 






Ser 


Ser 


Ser 


Ser 


Pro 


Pro 


Glu 


He 


Pro 


Ala 


Phe 


Arg 


Gin 


Cys 


Glu 


Thr 






35 










40 










45 








He 


Arg 


He 


Glu 


Met 


Cys 


Arg 


Lys 


He 


Gly 


Tyr 


Asn 


Glu 


Thr 


Ser 


Met 




50 










55 










60 










Pro 


Asn 


Leu 


Val 


Gly 


Asn 


Glu 


Met 


Gin 


Thr 


Asp 


Val 


Glu 


Tyr 


Thr 


Leu 


65 










70 










75 










80 


Gin 


Thr 


Phe 


Ala 


Pro 


Leu 


He 


Glu 


Tyr 


Asp 


Cys 


Ser 


Ser 


Gin 


Leu 


Lys 










85 










90 










95 




Leu 


Phe 


Leu 


Cys 


Ala 


Ala 


Tyr 


Val 


Pro 


Met 


Cys 


Thr 


Pro 


Lys 


Ala 


Pro 








100 










105 










110 






Val 


His 


Ala 


He 


Gly 


Pro 


Cys 


Arg 


Ser 


Leu 


Cys 


Glu 


Ser 


Val 


Arg 


He 






115 










120 










125 








Arg 


Cys 


His 


Pro 


Val 


Leu 


Gin 


Gly 


Phe 


Gly 


Phe 


Pro 


Trp 


Pro 


Pro 


Ala 




130 










135 










140 










Leu 


Asp 


Cys 


Asp 


Lys 


Phe 


Pro 


Arg 


Glu 


Asn 


Asn 


His 


Glu 


Thr 


Met 


Cys 


145 










150 










155 










160 


Met 


Glu 


Gly 


Pro 


Gly 


Glu 


Leu 


His 


Gin 


Pro 


Gin 


Gin 


Glu 


Gin 


Asp 


Leu 








165 










170 










175 




Tyr 


Gly 


Leu 


Pro 


Gly 


Gin 


Gly 


He 


Pro 


Gly 


Gly 


Leu 


Gly 


Gly 


Lys 


Leu 






180 










185 










190 






Pro 


Met 


Asp 


Cys 


Ser 


Gly 


Leu 


Ala 


Lys 


Ser 


His 


Leu 


Tyr 


Val 


Arg 


Leu 






195 










200 










205 








Pro 


Arg 


Ser 


Gly 


Arg 


Cys 


Ala 


Pro 


Leu 


Cys 


Glu 


Ala 


Asp 


He 


Leu 


Phe 




210 










215 










220 










Thr 


Pro 


Ala 


Glu 


Lys 


His 


Leu 


Ala 


Glu 


He 


Trp 


Val 


Ser 


Thr 


Trp 


Ala 


225 










230 










235 










240 


Tyr 


Ala 


Ala 


Leu 


Gly 


Leu 


Ala 


Leu 


Val 


Ala 


Thr 


Val 


Cys 


Leu 


Leu 


Ala 








245 










250 










255 




Ser 


Asp 


Gly 


Ser 


Arg 


Leu 


Ala 


Ser 


Ala 


Lys 


Trp 


Ser 


Arg 


Leu 


Leu 


Ser 








260 










265 










270 






Pro 


Leu 


He 


Trp 


Cys 


His 


Asn 


Met 


Val 


Thr 


Leu 


Gly 


Trp 


Ala 


Val 


Arg 






275 










280 










'285 








Phe 


Met 


Val 


Gly 


Arg 


Thr 


Gly 


Thr 


Ala 


Cys 


Gly 


Thr 


Asp 


Pro 


Gin 


Ala 




290 










295 










300 










Pro 


Asn 


Glu 


Ser 


Leu 


Leu 


Thr 


Val 


Asp 


Gly 


Leu 


Ser 


Asn 


Ala 


Ser 


Cys 


305 










310 










315 










320 


Ala 


Ser 


Val 


Phe 


Leu 


Met 


Arg 


Tyr 


Tyr 


Phe 


Gly 


Met 


Ala 


Ala 


Cys 


Ala 










325 










330 










335 




Trp 


Trp 


Ala 


Val 


Leu 


Cys 


Leu 


Gly 


Trp 


His 


Arg 


Asp 


He 


Arg 


Arg 


His 








340 










345 










350 






Ser 


Pro 


Asp 


Ser 


Lys 


Gly 


His 


Val 


Val 


He 


Pro 


Ser 


Asn 


Phe 


Gly 


Gly 






355 










360 










365 








Ser 


Pro 


Ala 


Lys 


Arg 


Asn 


Ser 


Ala 


Lys 


Thr 


Ala 


Gin 


Gin 


Asp 


Leu 


Thr 




370 










375 










380 










Gin 


Asn 


Asn 


Phe 


Val 


Cys 


Phe 


Val 


Ala 


Trp 


Gly 


Leu 


Pro 


Ala 


Phe 


Gin 


385 










390 










395 










400 


Thr 


Ser 


Ala 


Val 


He 


Val 


Ala 


Arg 


Phe 


Val 


Asp 


Ala 


Asp 


Glu 


Leu 


Leu 










405 










410 










415 




Gly 


Asn 


Gin 


Ser 


Asp 


Lys 


Ala 


Leu 


Gin 


He 


Leu 


Val 


Ala 


Thr 


Pro 


Val 






420 










425 










430 






Phe 


Cys 


Tyr 


Trp 


He 


Phe 


Gly 


Ser 


Met 


Asn 


Leu 


He 


Ser 


Gly 


Tyr 


Leu 






435 










440 










445 








Val 


His 


Cys 


Arg 


Thr 


Lys 


Glu 


He 


Leu 


Arg 


Asn 


Ser 


Asn 


Ala 


Leu 


Ser 




450 










455 










4 60 










Val 


Gin 


Gin 


Gin 


Leu 


Gin 


Gin 


Leu 


Ser 


Ala 


His 


Ser 


Ser 


Ser 


Gly 


He 


465 










470 










475 










480 


Gly 


He 


Phe 


Leu 


Phe 


He 


Tyr 


Gly 


Leu 


Ala 


Cys 


Ala 


Met 


Leu 


Leu 


Leu 








485 










490 










4 95 




Ala 


Val 


He 


Tyr 


Glu 


Phe 


Ala 


Asn 


He 


Asp 


Val 


Trp 


Leu 


Gly 


Ser 


Gly 
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500 



505 



510 



Asp 


Thr 


Asn 

DID 


Thr 


Pro 


Leu 


Trp 


Pro 

C.90. 
3£U 


Phe 


Leu 


Leu 


Arg 


Ala 

^ 9 S 

D £. D 


Phe 


Met 


Glu 


Leu 


Met 

3oU 


Leu 


Gly 


He 


Cys 


Cys 

c 1 c: 


Phe 


Ala 


Trp 


Val 


Leu 


Gly 


Pro 


Ser 


He 


Ser 


Thr 


Leu 


Tyr 


Lys 


Arg 


Gin 


Val 


Ser 


Asn 


Gly 


Lys 


Met 


Val 


Lys 


His 


3 4 3 










c c r\ 

30U 










c c c 
J J J 










JuU 


Thr 


C «-i v 


ft 1 

Ala 


b±y 


Ala 
565 


Ala 


i nr 


oj.y 


nlS 


Leu 

3 / U 


Asp 


ri i, 


nlS 


C Q r- 


c "7 c 

O / 3 




Arg 


Gly 


Ser 


His 
580 


Ala 


Ala 


Cys 


Asn 


Ser 

c o c 
003 


Thr 


Val 


Val 


Ser 


Tyr 

con 


His 


Ser 


Val 


Arg 


Thr 
595 


Ser 


Met 


Ala 


Ser 


Val 
600 


Pro 


Leu 


Pro 


Pro 


Ser 


Pro 


Tyr 


Lys 


Leu 


Lys 
610 


Thr 


Ser 


Pro 


Gly 


Thr 
615 


Gly 


Ser 


He 


Ser 


Leu 
620 


Asn 


Gin 


Met 


Ser 


Asn 


Tyr 


Ser 


Leu 


Gly Arg 


Ser 


Val 


His 


His 


Gin 


Gin 


Arg 


His 


Ser 


Pro 


625 










630 










635 










640 


His 


His 


His 


His 


His 
645 


Gin 


Gin 


Gin 


Gin 


His 
650 


His 


Gin 


Phe 


His 


Pro 
655 


His 


His 


Asn 


His 


Gin 
660 


His 


His 


Ser 


Thr 


Ser 
665 


Ser 


His 


Arg 


Leu 


Tyr 
670 


Tyr 


Pro 


Pro 


Gly 


Ser 
675 


Tyr 


Ala 


Ser 


Gin 


Lys 
680 


Tyr 


Ser 


Gin 


His 


Gly 
685 


Ser 


Tyr 


Tyr 


Pro 


His 
690 


Leu 


Gin 


Gin 


Tyr 


Gly 
695 


Asn 


Glu 


Thr 


Leu 


Leu 
700 











<210> 13 
<211> 9795 
<212> DNA 
<213> Drosophila 



<400> 13 

tcatactcac 

cttggctgat 

tgtagataca 

aatgcagcta 

cacaactggc 

tcctacgcgg 

tggtcaaaaa 

ggaatcaaca 

aaggaaataa 

caagaagtaa 

ttgctgatac 

ttacattgta 

gttgttgatt 

agggggaggt 

agctgatgat 

aaaataattg 

tggaccgtgt 

gcgcctgctg 

ccatcgattt 

tgcagctgcc 

tgagactgca 

cccttaaact 

tcaccttggt 

cgccagaaat 

taccgctttt 

aggacgcctc 

ccgtttgaga 



tggctgccaa 
gctaggttct 
ccgcagatac 
cttacgacta 
tgggaaactg 
cccagggtca 
ctaagcttaa 
tttttataga 
tatgcatgct 
acatttggca 
cgaaaactcg 
aattttgtct 
ctgctaagtt 
tcgtatcgtg 
tggtcagttg 
cttggtgttt 
acagagtgga 
ctcctcttcc 
ccaggtgcct 
caagatagag 
agtgggcggc 
gggtggctgc 
ctccagcagt 
agtggggata 
ggtcaatgaa 
tgagggaatg 
ggagggaatc 



cccctactct 
aactttaagt 
agccgtcctg 
ggaggatgat 
aaagtggccg 
atcacattgc 
acgaccgcga 
cccatttaaa 
tagtaaagga 
ccccgaggcg 
cgaattaaga 
acttctaggc 
caggaaaatg 
acttatctag 
cacacgcgct 
cgcttctaat 
caatggacag 
tgggaatcgc 
ccactggtgg 
gtgctatctc 
aggatgcggt 
tgcaccaccg 
ttcacatact 
ggcaggaaat 
cccctatgga 
gaatcctgat 
aggcgggtga 



actctgacta 
acgattctta 
cggcggcaca 
tagtggcaag 
acaagcgcat 
ggagtaggtt 
tgttatctta 
gagctatcat 
acactacact 
aatactgata 
taaattctaa 
agagcacaat 
actagctatg 
cgttttaccc 
attctggggc 
tgacgaggac 
ggcatcttat 
cgagggaaag 
cctgctgcag 
cgggacggtc 
gtggaaagcg 
aaggtctccg 
ttccacccaa 
tcggatcctt 
tgggctttag 
ttggattatt 
acagtcccat 



ctttggttct 
gtattgctca 
acaacaagaa 
gggttaagga 
ctacttaacc 
ctacacagag 
tcaattaaaa 
cttttttcac 
aggtcttaag 
ctggtagttt 
cattgcacaa 
gtgcctgcaa 
tgaacttgac 
taatgcattt 
agattttgtg 
ttggacttgg 
ttattcggtg 
cctcgacttg 
cgaaatgcga 
ctcggattcg 
tggcgaaccc 
gttgctgggc 
cgtttggtcc 
ggtaaggtga 
gtggttacca 
gtatatggca 
gtgcgattta 



ggttttggat 
ctggatacac 
aacaaacgct 
gcgggttaat 
tagatgatgg 
agatgcgatt 
cgaaaaaaat 
cgaaatactt 
aaatgattat 
ttagaaaact 
atttcttagt 
ttttctccgt 
ttgaggaggg 
gattgcttta 
ggcgtgtcgt 
atttgaagtg 
ccattcgcag 
cccgctgttt 
ggtatggact 
gccagggcgc 
cttacccttt 
aattggaagt 
ccctgttcac 
tccaggtgag 
ttcgggtggg 
ttgcaggacg 
aaggccgcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tcagaccctt ttggtccgcc aagctggcgg tggatccctt gttgctcgaa gtggtgcgag 1680 
aaaatatccc tcgtaatcgg tttatcaagc ccgtcttagc ctccgtggcc tgctggtggg 1740 
ctaaacggag gttaatcaag tcctgtgcag gcaggcaggc ctcctctaat tggggctggg 1800 
aggcaaaggt aacagaacgc ctcatcggga tctgcatgta ctctgctgac ggccaactgg 1860 
cccgtggaat cacaggaggc agcacggaaa gcaggagatt gggcagtatt cccgagatac 1920 
ttggagttct gtttttactt gcagccggag tctgcgagat gccactggtt accgtcagtt 1980 
ctggctttag atgacctccc ccagttaaga gtgatgtgga cgacgaagtt gtggtggagg 2040 
tggtggttag tcccgactcg agtagcctct ccagctttcg aatctcggcg tccagagcct 2100 
ggatttcctt cttatagacc ctgttttgca cctccactcg atactgcagt tctcggcgat 2160 
catcgaccac gaatcggcgt gtgttgcact ccatcttaag gcccatactg tggataaccg 2220 
gatcgataat atcgtctgag ggagaaggag agttcccaaa ggattaggac acaactttat 2280 
aaaggttcta gaaattagga gactagaact taaaagtatt tcctaagtga gcaagatctt 2340 
attgcacctt caagcgctac ttacttcttg cccaaatatc atggagtttt gggatgaaaa 2400 
gcagacaaag ggttgcagtg gtgctggtta aaatcagagc tgtgattgtg atgaaggcca 24 60 
gggtgactcg ctccgaaatc aagttggcca gcaccacgac gatggcgctg gtgatgacca 2520 
cactgtatac agacactccg atgtactgcg agtcattgag agcaggtatt tttacgtggc 2580 
gcgtctccca ggccatatag acacccacca caagaagaag gcctttgtag gcgtacagga 2640 
cactcaacca cgtttgcgtg tgctgcgaac ggcaaacttc aacctaaaat agatagatag 2700 
atagatattc atagattcta aatgaattaa tactttttca acataattaa aagtaatctg 2760 
ataatttgaa gcatattaat atttttctta cctgaggctg gtaaacgaca cttctatcag 2820 
tcgcactgat ctcgagcgtc aggttgtgaa gatggcgctc cattggatcg gtgaccaccc 2880 
aaagggttac gagcagcgca tccaccagaa gcaatccgcc gacgagcaag atcagttgaa 2940 
tgtcctgcag catcttgtcc ttgaaaacgc tgccggtacg agtgaatatc cgatgcactc 3000 
tgtaggtctt ggcaaacatc gatccaaagg ccaacgagaa tccggcggag agcagataga 3060 
cgcgggcctg gtgagcatga taagcggatg acggggaagt tcaggttagg gagaaagtac 3120 
tgtagtagca gcagcaaagt aacacactaa gtgtcatgtc aggcagaccc aaagaaaaca 3180 
agccggcggc aacaatgggg gaaacaagtc atcaatggac cacaattctc agagtgcttg 3240 
agactttgtt gtgcgtttca aatttatgca aataaaggaa tatcacaaaa gttgtaaatt 3300 
tccgtgtcat gtgtttgcat ttccctgcca caagcctcaa caaaacaaaa acaagcaggc 3360 
aactggaaaa gcggaacgac agtagaagtg gttacagaaa cagaaacaac agcgaccagc 3420 
aaatttccca ggagtggcgt tcgaggcagt tgttaagggg gtcaaagtgg aaaggcgcag 3480 
atcgggatag aagaaagctc gtcaaacttg acttgacttc tggcttaaag agcagcagat 3540 
ttgcagcgaa accgaacggc tatttatagc cagcaggggc agtcgtgtat cttgaaaaga 3600 
cacgcggaat aatcaatatt cagtttatat aatccacttt aaaataatag aaattaacaa 3660 
aaaattgcca caattatcag taatttttat acttcgcagg ctattttatc gccatcttcg 3720 
tgtatacgaa tacgacaatt agttcaagca atatttcttt cagtgaataa ataaagcaaa 3780 
cacatgagcg cgactcgaaa gttaccagcc tgaaactgaa agtcatattt gatgaggacc 3840 
ccacatgctt gcttacacac agaagcccgt caaagtcaaa caaagacggc tgagaggaaa 3900 
. ggcggatgag aatgcggatt cggataggaa tggcgatagg gatggggatg tcggagccga 3960 
aggataacaa ctcaccgtgc agaccgttgc gaaagagtcc tccgccgagg gcagcgtcga 4020 
gtggtccaag cccaaaagga tgacggtggc gtacacaaag atgcagccca ctgcggtgat 4080 
gttgctcagc ttcgggctgg aaagtttaat tgccctgtgg agtgagaagg gggatggtca 414 0 
tggttataag ctgaggcgcc gcaatccttt gcagagaaag tgcagagtaa ccgtagaatg 4200 
cgaaacatat atacgaactc aactaacgat gggtcagaaa tacagctgga aagccaagca 4260 
attacttcac ttcaaaaacc gaaacaagtc acatgtgtaa tgggttcaat gggttgagca 4320 
ccgaaagcag ttaactcatc atggaatata aagaaacagc ccgcacaaat tctcttttat 4380 
atggccacta aatattttat agtcgtttaa tgaaaaataa tatattcaaa caatcagggc 4440 
ccataaaaag agtaagccaa taatttatgt acttatcgcc ttacttcagc ttccgaaagt 4500 
gcagattgaa cgccaggaag gcgatggcca gagcgattcc cacgctggag agggtggcga 4560 
tggtgtagaa ggccagtgga gcgatggtcg ccacccgcag cttgaacacc cgcttggcga 4 620 
tgggtacctg cccgctgtgc cacttcaccg gccggcagcg gggacaccgg aagtccaggg 4 680 
catccgtggc cggatagtag agggccaccg gttccagcaa accgcgctgg atttgataga 4740 
aggcagtggt gccaacgcga tctgggccgc tgaaggaaac ggggccctac caaaaagaga 4800 
aatactcgtg atgaactaga tttaagaacc ttaggatcaa agttaaatgg gttggatgat 4860 
ttcagttgaa agagataact caccgacact cccaggaagt ggagcttgcc catttgctgc 4 920 
aggaactccc aggccatgtc gctgcgggtg taatcgaatc cgtccagctt cgactgctcc 4980 
tcgttccgcc gccagtgctc ctcagcggct ctcaaggcca gggcgatggc ccacacggcg 5040 
tcgtaggttt gcggcgcgta ctgcgagatc gcctccggaa agaggtgccc tccgctggtg 5100 
cctcccacgc cccttctcct ccgccgacga gagtcagatt gcgttgcagc gatactgatc 5160 
ctggaaccat aaccggagcc aaatccatcc tggccgtgga actgggcgga ttgcttgcgc 5220 
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agctgggagt 
atgccgcaga 
tttcatgacc 
atgacaggcc 
gaaaccatcg 
ttcggacacg 
ctgaataaaa 
gttgcttgaa 
gcatgtataa 
tcatgaaatt 
ctcctcaact 
agaccacgat 
ggtccggcca 
acattcgaag 
cgatgatgat 
ccatttgtca 
ggtaacaatc 
tatccatgcg 
taaaaaacgt 
ttgccaggca 
gttcatgcat 
tttcgaaaat 
gtttacgttt 
ggccatttgt 
gtgatggtgg 
gccagcgagt 
cgaatgaaag 
taggggaact 
cgaacgaaat 
atcaagaaat 
cgcaccgccg 
ctcataattt 
ggctggattc 
agaaattgcc 
ttaatcagtc 
ctctcggtga 
gtgtagatgg 
aagaaatgtg 
gggcagggaa 
aaggcaacat 
ggagatgcgg 
aaaaaatcgt 
ttgacggttt 
aattaattgt 
ttacattgtt 
tttgcgtgcc 
gttgatgaca 
ttgttgttgc 
aatttcgtaa 
ccacgtagaa 
ggcagcaggc 
agaccatccg 
ttgccatgat 
ttgaccttca 
ttggtgcttc 
gtccgtttgt 
gatgccgatt 
ggcctaattg 
gccgacgttg 
tatctggtcc 



tgaacatgtg 
aatgtcgcca 
agcactaaaa 
ggacttaggg 
cgaaacgcaa 
cacagcaaga 
aaaaaaaaaa 
gctgcctttg 
cttaacaaca 
aatttattta 
taccagtcca 
gaggttctcg 
ccacggagcc 
cctgtaggcc 
gcgcgtgtcc 
tttgtcatcc 
atcatgctga 
gactggacga 
aattataatc 
aataaattgg 
aagcttttaa 
tgaaatttag 
atacgtttgc 
tgctcaccct 
cggcacagga 
gaacctcctc 
cgatgcgcgc 
cccgcctgtc 
cgttagacta 
ccagccctct 
aaatgaaact 
gtgggctgtg 
cgcagcattt 
cgatacgagg 
gcagatgact 
cctccgagca 
cgtggaagaa 
tgaagaaacg 
aatagcagtg 
gatggcgcaa 
ctccagatat 
ttgaaaacct 
aatttatcag 
agctttaata 
ggtaacttag 
ccatcaaaac 
tatttgcctc 
agattttctt 
tggaacagcg 
atgacattct 
gcttgcggtt 
gacgtggtcc 
tgcgctgcat 
agtatcgctg 
cgggcagatt 
acttccaatc 
ctccgggtgg 
cctcgtagcc 
cccaggcaag 
cttggaccgg 



attgttctgc 
gtaatgagat 
gagccataaa 
ggcttcgcac 
tcgcagcctc 
cggacaagac 
aaaaaaacgg 
agagttgggt 
atctatLctt 
ttttcccttt 
ctatagctga 
acggccagct 
cccatgctct 
tcgcacagga 
gtctcctgtg 
ttcgattagc 
tagagggtct 
acaaaaaact 
atacatatga 
tctatttaaa 
tataaacaga 
ttaaatttaa 
gtatacgagt 
aagtagcagc 
tatgttggcc 
gttctgcgag 
cggattgtgt 
gctcaacgcc 
tctctaccgt 
aggtcatgtg 
tcaaaggcca 
gaaatgttta 
atgctaatcg 
agttggttgg 
cacctgcacg 
ggccgatccc 
gcgatccacg 
gaggagcaaa 
gcagaggccg 
taaacatttt 
gacagtctgt 
agttcactaa 
aaactaaact 
cactactgca 
tcagtggcat 
cggcaacgac 
tcttttaggg 
attaagtttt 
acgcctagaa 
cacctgagta 
gatgtgctcc 
ccgcgatgtg 
ttcggccggc 
cctggggggc 
gctgtgcgaa 
ggagcgcggt 
cgagctggat 
cagggcctgc 
ggccaggact 
ttgaattatg 



gaagtggaaa 
tgcctatggc 
aacgcgtctg 
ttggcaatca 
gggattgtct 
aggacaacga 
ggaataaaag 
tcaatccaat 
gaaattcatc 
aatattgaat 
cgttatttcc 
gcagttcgtg 
cgtggaggat 
tctggggggc 
gcgatacgaa 
acaagtgcca 
ctaggcgatg 
tgaagataaa 
accttttttt 
tattccccaa 
acagcgctag 
tgttggtttt 
ataaaataaa 
agctgctcct 
gcctccagtt 
aaagtggtca 
gaggagtccg 
ggcgatgtgg 
acatggggtc 
tcttcccagt 
aatgcaaatg 
tggctcaaag 
agctacgcac 
atgaaccgag 
atgttccagt 
agcagcatca 
cccactccag 
ggacaccaca 
ttgctcgctg 
taaaatgtcg 
ttaaggtcga 
gtgctgacat 
actattatgc 
ccaagttaaa 
tgcttttctt 
tcatcgcaat 
ccacagccgc 
aatgttgcac 
gtggcgccca 
tcgttggtca 
acggccatgg 
gacagctcaa 
gacatgcggc 
agattgagga 
ttgaggcgct 
ggaatcggag 
gggctggatg 
agctcggctg 
agcctcagtc 
cgcatcccgt 



cgcgattaaa 
agctaccact 
cgatggtgaa 
ccatcgccat 
aatcaaataa 
gaaggacaag 
tttcggcaat 
ttggatcccg 
tttacgttaa 
gatattttct 
aacgatgctg 
gt.tagagcag 
ccaggcgtag 
cagctcctgc 
cgaaacggaa 
atccacactc 
tggtctgcta 
ctcagagaaa 
taaaaactta 
atacgttcat 
caccatttta 
tcaaacaaat 
acgaaaatgt 
tgaagtcggt 
cggtgaccag 
ccgtgcccca 
gggccaccgt 
aaccgaagga 
gcttaaaatg 
ttccccactc 
ctctaagtga 
gtttatgggg 
gtgagttgat 
cgagctggat 
agggcaccac 
ccatcctcgt 
gatcacactg 
tcagagatga 
actcgtttcg 
acttttagag 
tgtgcatatg 
cacagtcgag 
attttctttt 
aatgattgca 
cagtgacagt 
aaaagcgcga 
agctgctatc 
gcgctactga 
acgtggggca 
cgagctcgag 
tggcagctcc 
agagtccgag 
gttcgctttc 
ggtggtgggc 
gctgccggcg 
tcccccctgc 
tgctaattga 
atgactccat 
ccacggccgg 
cggcaatggc 



caagcctcga 
cctgccccaa 
tgcgaatgcg 
cgccattgct 
atcatttcat 
atagtcgtct 
tctggaaaat 
aaatttatat 
tctcatactg 
cgcagtgcac 
ttgtgcgttg 
gcggtgcgct 
tccgccccga 
gagaagctgc 
cggcatgtgc 
aaaaaaaatg 
atgaaacatt 
taagttaacg 
ctaaattaag 
tttcatattt 
atttaaacat 
attcataaat 
gtatttaaat 
ggccgcaaag 
gttgttcacc 
gccaaacttc 
cctgtagaag 
tacctttcat 
caagacacgc 
atttaacaca 
atttgtggac 
tgctggaatc 
tagatttttg 
tatttgcatt 
cttcgccagg 
cgagggctgt 
attgtcggag 
gtaatagagg 
acgcaataaa 
aggcacagcc 
catacactta 
gtattacgta 
ttttaccatc 
ggtaaatagt 
gttctgtaca 
aagacggaag 
cgcactgttc 
atgaaaattc 
tacggttacg 
ggtgtagccc 
caattcgctc 
cagcacgatt 
gaacacctgg 
gtgggaggca 
tttgactttc 
agtcacagtc 
ggcagcacct 
ggcagccgct 
ccatggaccg 
caatggccca 



5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
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ttaatcacca 
gccacacagc 
acgcttatcg 
cgtcaatttt 
cattcactct 
gtccgattga 
tcacttttcg 
aatttaggtg 
tgaatctaca 
tgaaagggaa 
tcacaccaat 
cgccgctcac 
tgctaatggc 
ctcacgcgcc 
tgtttcaaga 
tgcctgcacc 
cacgcttttc 



aggcatttga 
acaggacacc 
tttctcttcc 
atcgcctttt 
gcacagcagt 
caggattggg 
tagaaaaaaa 
aatatttatt 
tatgtatata 
tgttaatgga 
ttggcagtcg 
gtctcgtccc 
caaacatttt 
acagatttct 
ttaccaactt 
taagcccact 
gctgt 



atcgtcggct 
ggaatatata 
gacgagcggc 
tattggcctc 
cgaccataaa 
taacacggga 
tgcttatttt 
agtattttgc 
tatgtatata 
aataccgaaa 
aaaggattat 
atgacttgtt 
gcgctttcat 
gtggcgaacg 
gcagtccctt 
tcatagacgc 



atcagaggct 
tcgccttttt 
tcttccttct 
acttttgctc 
agtatgcaat 
aaaaactaca 
tattatgcac 
tgtagtatgt 
tgtgaaaaca 
caaatgtcta 
gacgaaggct 
atgtgcctgc 
tgcgggcttt 
tttttctctg 
tgccgctttc 
tttccttcgc 



gcagcaacaa 
atgcactctg 
gctccctcct 
gcattttgta 
gcaaaggact 
cgccattcgc 
tgtttattaa 
gattttatta 
ttgatttttt 
gggaggcata 
tcgatgatat 
cacttggcca 
tggactactt 
tatgcgcgcg 
ccgcttttcc 
gctttctgct 



acaaacacga 
ccgacaggat 
cttgtatcgg 
tctttttgca 
ttctcgatag 
taactttcag 
ggattaacac 
aaatgtaatt 
catgagaatg 
aaaaggcatt 
ggacacaccg 
ttatttttta 
tttgtcgagt 
tatgcataat 
accgcccact 
tttcccacag 



8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9795 



<210> 14 
<211> 3918 
<212> DNA 
<213> Drosophila 



<400> 14 

atgcgcataa 

aggctagtcc 

gagctgcagg 

agcccatcca 

attccaccgc 

ctcaattcgc 

aatctgcccc 

atgtcgccgg 

ctgtccacat 

gccgtggagc 

aacgatactc 

cagccctcga 

ctggcgaagg 

gcgttgagcg 

cacaatccgg 

tcgcagaacg 

gccaacatat 

ctgctactta 

ccccagatcc 

ctccacgaga 

gaactgcagc 

aataacgtca 

tccgcccagt 

gctgcaacgc 

gggcacctct 

gccatcgccc 

aagctggacg 

ggcaagctcc 

ggcaccactg 

ccggccacgg 

gggcaggtac 

gccttctaca 

gcgttcaatc 

aacatcaccg 



ttcaaccggt 
tggcccttgc 
ccctgggcta 
gctcgccacc 
gctccgattg 
acagcaatct 
ccaggcagcg 
ccgaaatgca 
cgcggggacc 
acatcaaccg 
agtgtgatcc 
cgaggatggt 
tggtgcccta 
acaggcggga 
cgcgcatcgc 
aggaggttca 
cctgtgccgc 
gggagacgga 
tgtgcgaggc 
gcatgggggc 
tggccgtcga 
gctatagtgg 
tccacggcca 
aatctgactc 
ttccggaggc 
tggccttgag 
gattcgatta 
acttcctggg 
ccttctatca 
atgccctgga 
ccatcgccaa 
ccatcgccac 
tgcactttcg 
cagtgggctg 



ccaagggacc 
ctgggcaacg 
cgaggcaatt 
cggagaatcg 
gaagtacaaa 
gcccggaagc 
atacttgaag 
gcgcaatcat 
acgtccggat 
caagcgcctg 
tggagtgggc 
gatgctgctg 
ctggaacatc 
gttcccctac 
tttcattcgg 
ctcgctggcg 
caccatcacc 
cacgcgcatc 
ctacaggctt 
tccgtggtgg 
gaacctcatc 
actgaacaat 
ggatggattt 
tcgtcggcgg 
gatctcgcag 
agccgctgag 
cacccgcagc 
agtgtcgggc 
aatccagcgc 
cttccggtgt 
gcgggtgttc 
cctctccagc 
gaagctgaag 
catctttgtg 



agatacggtc 
tcggcagcgg 
aggccaggtg 
gcatcgactg 
cggacgaaag 
accaatgcct 
gtcaaccagg 
ggcaaaatcg 
ggtctgagcg 
ctgccgggct 
gtggatcgct 
ggatcggcct 
gtgcaggtat 
ttctacagga 
aagtttggct 
gtgaacaacc 
tttgcggcca 
atcatcggca 
cgaatgttcg 
ccggaccagc 
gtggtctcaa 
cacatgttca 
ggctccggtt 
aggagaaggg 
tacgcgccgc 
gagcactggc 
gacatggcct 
cccgtttcct 
ggtttgctgg 
ccccgctgcc 
aagctgcggg 
gtgggaatcg 
gcaattaaac 
tacgccaccg 



catggccggc 
ctgccatgga 
ctgcctcaat 
tgactgcagg 
tcaaacgccg 
cccacgccca 
tgttcgaaag 
tgctgctcgg 
aattgggagc 
acaccctcga 
tcttccacgc 
gctcggaggt 
ccttcggttc 
cggtggcccc 
ggggcacggt 
tggtcaccga 
ccgacttcaa 
gcttctcgca 
gggcggacta 
gcaccgcctg 
cgcacaacag 
actcccagct 
atggttccag 
gcgtgggagg 
aaacctacga 
ggcggaacga 
gggagttcct 
tcagcggccc 
aaccggtggc 
ggccggtgaa 
tggcgaccat 
ctctggccat 
tttccagccc 
tcatcctttt 



cgtgggactg 
gtcatcagcc 
tagcacatcc 
ggggactccg 
gcagcagcgc 
ccacctcctc 
cgaacgccgc 
actctttgag 
tgccaccatg 
gctcgtgacc 
catctacaca 
caccgagagc 
cacatcgccg 
ggactcctca 
gaccactttc 
actggaggcg 
ggagcagctg 
ggagctggcc 
cgcctggatc 
ctctaaccac 
catcgttgga 
gcgcaagcaa 
gatcagtatc 
caccagcgga 
cgccgtgtgg 
ggagcagtcg 
gcagcaaatg 
agatcgcgtt 
cctctactat 
gtggcacagc 
cgctccactg 
cgccttcctg 
gaagctgagc 
gggcttggac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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cactcgacgc 
ctgctctccg 
catcggatat 
ctgatcttgc 
gtcaccgatc 
agaagtgtcg 
ttgagtgtcc 
gagacgcgcc 
tacagtgtgg 
gtcaccctgg 
tgtctgcttt 
gttatccaca 
cgccgagaac 
gctctggacg 
tccaccacaa 
ctgacggtaa 
atctcgggaa 
agttggccgt 
cagccccaat 
caccagcagg 
cgcaccactt 
gcggccttta 
tcctgcaatg 
cacccgaatg 
cacctggatc 
gaacaggggg 
ttccaattgc 
agggtaaggg 
gccctggccg 
tccatacctc 
acagcgggca 
gcgaatggca 



tgccctcggc 
ccggattctc 
tcactcgtac 
tcgtcggcgg 
caatggagcg 
tttaccagcc 
tgtacgccta 
acgtaaaaat 
tcatcaccag 
ccttcatcac 
tcatcccaaa 
gtatgggcct 
tgcagtatcg 
ccgagattcg 
cttcgtcgtc 
ccagtggcat 
tactgcccaa 
cagcagagta 
tagaggaggc 
ccacggaggc 
cgagcaacaa 
aatcgcacat 
ccatatacaa 
gtaaccacct 
accttaccaa 
accaaacgtt 
ccagcaaccg 
gttcgccacg 
aatccgagga 
gcatttcgct 
agtcgaggct 
ccgaataa 



ggaggactct 
gttggccttt 
cggcagcgtt 
attgcttctg 
ccatcttcac 
tcaggttgaa 
caaaggcctt 
acctgctctc 
cgccatcgtc 
aatcacagct 
actccatgat 
taagatggag 
agtggaggtg 
aaagctggag 
cacatcactc 
ctcgcagact 
tctcctgctt 
catgcagatc 
ctgcctgcct 
taagacgggc 
gggatccacc 
gggactgttc 
taatccaaat 
aaagcccatc 
ggatccgaat 
gggtggaaag 
gagaccttcg 
ctttccacac 
ccgtcccgga 
gcagcaggcc 
ttccctcggc 



ttcgcaacgg 
ggatcgatgt 
ttcaaggaca 
gtggatgcgc 
aacctgacgc 
gtttgccgtt 
cttcttgtgg 
aatgactcgc 
gtggtgctgg 
ctgattttaa 
atttgggcaa 
tgcaacacac 
caaaacaggg 
aggctactcg 
ttaa'ctgggg 
ccggctgcaa 
tccgtgctgc 
ccgatgaggc 
gcacaggact 
ttgataaacc 
gccagcttgg 
acccgcctga 
caggattcca 
cataggggtt 
ttcctgccta 
tatgtgaaac 
gtggtgcagc 
cgcatcctgc 
gatagcacct 
accagtggag 
gattcccagg 



tctgcacggc 
ttgccaagac 
agatgctgca 
tgctcgtaac 
tcgagatcag 
cgcagcacac 
tgggtgtcta 
agtacatcgg 
ccaacttgat 
ccagcaccac 
gaaacgatat 
gccgattcgt 
tctataagaa 
agtcgggact 
gaggtcatct 
gtaaaaacag 
ctcctgtgat 
gttctgttac 
tgattaacct 
gattacgagg 
cggaccaaaa 
ttccctcctc 
ttccctcaga 
cattgaccaa 
tccccactat 
tgctggagac 
agccacccag 
cgcccacttg 
ctatcttggg 
gcacctggaa 
aagaggagca 



ccgcgtctat 
ctacagagtg 
ggacattcaa 
cctttgggtg 
tgcgactgat 
gcaaacgtgg 
tatggcctgg 
agtgtctgta 
ttcggagcga 
tgcaaccctt 
tatcgatccg 
ggtcgatgat 
ggaaatccag 
aaccaccacc 
aaagccagaa 
aactccaagt 
tccacgggcc 
ctttgcctcc 
ccgtttagcc 
gatattttct 
gggtctgaag 
tcaaacggcg 
ggcgtcctcc 
aagcggtact 
ttctggcggt 
caaggtgaac 
tttaagggaa 
cagtctcagc 
cagctgcaag 
atcgatggaa 
gcaggcgcct 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3918 



<210> 15 

<211> 1305 

<212> PRT 

<213> Drosophila 



<400> 15 

Met Arg lie lie Gin 

1 5 
Ala Val Gly Leu Arg 
20 

Ala Ala Ala Met Glu 
35 

Ala lie Arg Pro Gly 
50 

Ser Pro Pro Gly Glu 
65 

lie Pro Pro Arg Ser 
85 

Arg Gin Gin Arg Leu 
100 

Ala Ser His Ala His 
115 

Leu Lys Val Asn Gin 
130 

Glu Met Gin Arg Asn 
145 

Leu Ser Thr Ser Arg 



Pro Val Gin Gly 
Leu Val 
Ser Ser 



Ala Ala 

55 
Ser Ala 
70 

Asp Trp 

Asn Ser 

His Leu 

Val Phe 
135 
His Gly 
150 

Gly Pro 



Leu Ala 

25 
Ala Glu 
40 

Ser lie 



Ser Thr 

Lys Tyr 

His Ser 
105 
Leu Asn 
120 

Glu Ser 
Lys lie 
Arg Pro 



Thr Arg 
10 

Leu Ala 

Leu Gin 

Ser Thr 

Val Thr 

75 
Lys Arg 
90 

Asn Leu 

Leu Pro 

Glu Arg 

Val Leu 
155 
Asp Gly 



Tyr Gly 

Trp Ala 

Ala Leu 

45 
Ser Ser 
60 

Ala Gly 

Thr Lys 

Pro Gly 

Pro Arg 
125 
Arg Met 
140 

Leu Gly 
Leu Ser 



Pro Trp Pro 
15 

Thr Ser Ala 
30 

Gly Tyr Glu 

Pro Ser Ser 

Gly Thr Pro 
80 

Val Lys Arg 
95 

Ser Thr Asn 
110 

Gin Arg Tyr 

Ser Pro Ala 

Leu Phe Glu 
160 

Glu Leu Gly 
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165 










170 










175 




Ala 


Ala 


Thr 


Met 


Ala 


Val 


Glu 


His 


He 


Asn 


Arg 


Lys 


Arg 


Leu 


Leu 


Pro 








180 










185 










190 






Gly 


Tyr 


Thr 


Leu 


Glu 


•Leu 


Val 


Thr 


Asn 


Asp 


Thr 


Gin 


Cys 


Asp 


Pro 


Gly 




195 










200 










205 








Val 


Gly 


Val 


Asp 


Arg 


Phe 


Phe 


His 


Ala 


He 


Tyr 


Thr 


Gin 


Pro 


Ser 


Thr 




210 










215 










220 










Arg 


Met 


Val 


Met 


Leu 


Leu 


Gly 


Ser 


Ala 


Cys 


Ser 


Glu 


Val 


Thr 


Glu 


Ser 


225 










230 










235 










240 


Leu 


Ala 


Lys 


Val 


Val 


Pro 


Tyr 


Trp 


Asn 


He 


Val 


Gin 


Val 


Ser 


Phe 


Gly 










245 










250 










255 




Ser 


Thr 


Ser 


Pro 


Ala 


Leu 


Ser 


Asp 


Arg 


Arg 


Glu 


Phe 


Pro 


Tyr 


Phe 


Tyr 








260 










265 










270 






Arg 


Thr 


Val 


Ala 


Pro 


Asp 


Ser 


Ser 


His 


Asn 


Pro 


Ala 


Arg 


He 


Ala 


Phe 






275 










280 










285 








He 


Arg 


Lys 


Phe 


Gly 


Trp 


Gly 


Thr 


Val 


Thr 


Thr 


Phe 


Ser 


Gin 


Asn 


Glu 




290 










295 










300 










Glu 


Val 


His 


Ser 


Leu 


Ala 


Val 


Asn 


Asn 


Leu 


Val 


Thr 


Glu 


Leu 


Glu 


Ala 


305 










310 










315 










320 


Ala 


Asn 


He 


Ser 


Cys 


Ala 


Ala 


Thr 


He 


Thr 


Phe 


Ala 


Ala 


Thr 


Asp 


Phe 










325 










330 










335 




Lys 


Glu 


Gin 


Leu 


Leu 


Leu 


Leu 


Arg 


Glu 


Thr 


Asp 


Thr 


Arg 


He 


He 


He 






340 










345 










350 






Gly 


Ser 


Phe 


Ser 


Gin 


Glu 


Leu 


Ala 


Pro 


Gin 


He 


Leu 


Cys 


Glu 


Ala 


Tyr 




355 










360 










365 








Arg 


Leu 


Arg 


Met 


Phe 


Gly 


Ala 


Asp 


Tyr 


Ala 


Trp 


He 


Leu 


His 


Glu 


Ser 




370 










375 










380 










Met 


Gly 


Ala 


Pro 


Trp 


Trp 


Pro 


Asp 


Gin 


Arg 


Thr 


Ala 


Cys 


Ser 


Asn 


His 


385 










390 










395 










400 


Glu 


Leu 


Gin 


Leu 


Ala 


Val 


Glu 


Asn 


Leu 


He 


Val 


Val 


Ser 


Thr 


His 


Asn 










405 










410 










415 




Ser 


He 


Val 


Gly 


Asn 


Asn 


Val 


Ser 


Tyr 


Ser 


Gly 


Leu 


Asn 


Asn 


His 


Met 








420 










425 










430 






Phe 


Asn 


Ser 


Gin 


Leu 


Arg 


Lys 


Gin 


Ser 


Ala 


Gin 


Phe 


His 


Gly 


Gin 


Asp 






435 










440 










445 








Gly 


Phe 


Gly 


Ser 


Gly 


Tyr 


Gly 


Ser 


Arg 


He 


Ser 


He 


Ala 


Ala 


Thr 


Gin 




450 










455 










460 










Ser 


Asp 


Ser 


Arg 


Arg 


Arg 


Arg 


Arg 


Arg 


Gly 


Val 


Gly 


Gly 


Thr 


Ser 


Gly 


465 










470 










475 










480 


Gly 


His 


Leu 


Phe 


Pro 


Glu 


Ala 


He 


Ser 


Gin 


Tyr 


Ala 


Pro 


Gin 


Thr 


Tyr 








485 










490 










495 




Asp 


Ala 


Val 


Trp 


Ala 


He 


Ala 


Leu 


Ala 


Leu 


Arg 


Ala 


Ala 


Glu 


Glu 


His 






500 










505 










510 






Trp Arg 


Arg 


Asn 


Glu 


Glu 


Gin 


Ser 


Lys 


Leu 


Asp 


Gly 


Phe 


Asp 


Tyr 


Thr 






515 










520 










525 








Arg 


Ser 


Asp 


Met 


Ala 


Trp 


Glu 


Phe 


Leu 


Gin 


Gin 


Met 


Gly 


Lys 


Leu 


His 




530 










535 










540 










Phe 


Leu 


Gly 


Val 


Ser 


Gly 


Pro 


Val 


Ser 


Phe 


Ser 


Gly 


Pro 


Asp 


Arg 


Val 


545 










550 










555 










560 


Gly 


Thr 


Thr 


Ala 


Phe 


Tyr 


Gin 


He 


Gin 


Arg 


Gly 


Leu 


Leu 


Glu 


Pro 


Val 








565 










570 










575 




Ala 


Leu 


Tyr 


Tyr 


Pro 


Ala 


Thr 


Asp 


Ala 


Leu 


Asp 


Phe 


Arg 


Cys 


Pro 


Arg 








580 










585 










590 






Cys 


Arg 


Pro 


Val 


Lys 


Trp 


His 


Ser 


Gly 


Gin 


Val 


Pro 


He 


Ala 


Lys 


Arg 




595 










600 










605 








Val 


Phe 


Lys 


Leu 


Arg 


Val 


Ala 


Thr 


He 


Ala 


Pro 


Leu 


Ala 


Phe 


Tyr 


Thr 




610 










615 










620 










He 


Ala 


Thr 


Leu 


Ser 


Ser 


Val 


Gly 


He 


Ala 


Leu 


Ala 


He 


Ala 


Phe 


Leu 


625 










630 










635 










640 


Ala 


Phe 


Asn 


Leu 


His 


Phe 


Arg 


Lys 


Leu 


Lys 


Ala 


He 


Lys 


Leu 


Ser 


Ser 
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09 0 










£ ^ n 














Pro 


Ly s 


Leu 


JCI 


Asn 


lie 


i nr 




vdl 


pi w 
ui y 


Cys 


lie 


rne 


va jl 


Tyr 


Aid 








C\ £n 

DDU 










ODD 










D / U 






Thr 


Val 


lie 


Leu 


Leu 


Gly 


Leu 


Asp 


HIS 


ber 


Thr 


Leu 


Pro 


Ser 


7\ 1 — 

Ala 


Glu 






b / D 










Con 

boU 










000 








Asp 


Ser 


Phe 


Ala 


Thr 


\ 7 — 1 

val 


Cys 


Thr 


Ala 


Arg 


val 


Tyr 


Leu 


Leu 


Ser 


Ala 




con 

byu 










o y d 










i r\ f\ 
1 UU 










Gly 


Phe 


O n v 

ber 


Leu 


Ala 


Fne 


Gly 


C n v 

ber 


Met 


rne 


Ala 


Lys 


Thr 


Tyr 


Arg 


val 


n r\ c 
/Uo 










1 10 










/ 13 










/ 20 


His 


Arg 


lie 


Phe 


Thr 


Arg 


Thr 


Gly 


Ser 


17—1 

val 


Phe 


Lys 


Asp 


Lys 


Met 


Leu 










725 










Tin 










735 




Gin 


Asp 


lie 


Gin 


Leu 


lie 


Leu 


Leu 


Val 


Gly 


Gly 


Leu 


Leu 


Leu 


Val 


Asp 








"7 a n 
/ 4 U 










/ 4 0 










Ten 
/OU 






Ala 


Leu 


Leu 


Val 


Thr 


Leu 


Trp 


val 


val 


Thr 


Asp 


Pro 


Met 


Glu 


Arg 


His 






/50 










1 bu 










1 c c 

/bo 








Leu 


His 


Asn 


Leu 


Thr 


Leu 


Glu 


lie 


Ser 


Ala 


Thr 


Asp 


Arg 


Ser 


Val 


Val 




770 










775 










780 










Tyr 


Gin 


Pro 


Gin 


Val 


Glu 


Val 


Cys 


Arg 


Ser 


Gin 


His 


Thr 


Gin 


Thr 


Trp 


785 










790 










7 95 










800 


Leu 


Ser 


Val 


Leu 


Tyr 


Ala 


Tyr 


Lys 


Gly 


Leu 


Leu 


Leu 


Val 


Val 


Gly 


Val 










805 










810 










815 




Tyr 


Met 


Ala 


Trp 


Glu 


Thr 


Arg 


His 


Val 


Lys 


He 


Pro 


Ala 


Leu 


Asn 


Asp 








o o o 










o /. D 










o Oft 

o 3U 






Ser 


Gin 


Tyr 


T "1 « 

lie 


Gl y 


val 


Ser 


Val 


Tyr 


Ser 


val 


val 


lie 


Thr 


Ser 


Ala 






o o c 

835 










840 










O A C 

845 








lie 


Val 


Val 


Val 


Leu 


Ala 


Asn 


Leu 


lie 


Ser 


Glu 


Arg 


Val 


Thr 


Leu 


Ala 




djU 










ODD 










o c c\ 
o bU 










Phe 


lie 


Thr 


lie 


Thr 


Ala 


Leu 


He 


Leu 


Thr 


Ser 


Thr 


Thr 


Ala 


Thr 


Leu 


ODD 










0 / U 










QIC 










BoU 


Cys 


Leu 


Leu 


Phe 


lie 


Pro 


Lys 


Leu 


His 


Asp 


He 


Trp 


Ala 


Arg 


Asn 


Asp 










885 










890 










895 




lie 


lie 


Asp 


Pro 


Val 


lie 


His 


Ser 


Met 


Gly 


Leu 


Lys 


Met 


Glu 


Cys 


Asn 








900 










905 










910 






Thr 


Arg 


Arg 


Phe 


Val 


Val 


Asp 


Asp Arg 


Arg 


Glu 


Leu 


Gin 


Tyr 


Arg 


Val 






915 










920 










925 








Glu 


Val 


Gin 


Asn 


Arg 


Val 


Tyr 


Lys 


Lys 


Glu 


He 


Gin 


Ala 


Leu 


Asp 


Ala 




930 










935 










940 










Glu 


lie 


Arg 


Lys 


Leu 


Glu 


Arg 


Leu 


Leu 


Glu 


Ser 


Gly 


Leu 


Thr 


Thr 


Thr 


94 5 










950 










955 










960 


Ser 


Thr 


Thr 


Thr 


Ser 


Ser 


Ser 


Thr 


Ser 


Leu 


Leu 


Thr 


Gly 


Gly 


Gly 


His 










(1 c c 

965 










970 










975 




Leu 


Lys 


Pro 


Glu 


Leu 


Thr 


Val 


Thr 


Ser 


Gly 


He 


Ser 


Gin 


Thr 


Pro 


Ala 








n o r\ 

you 










985 










990 






Ala 


Ser 


Lys 


Asn 


Arg 


Thr 


Pro 


Ser 


He 


Ser 


Gly 


He 


Leu 


Pro 


Asn 


Leu 






995 










1000 








1005 






Leu 


Leu 


Ser 


Val 


Leu 


Pro 


Pro 


Val 


lie 


Pro 


Arg 


Ala 


Ser 


Trp 


Pro 


Ser 




1010 








1015 








1020 








Ala 


Glu 


Tyr 


Met 


Gin 


lie 


Pro 


Met 


Arg 


Arg 


Ser 


Val 


Thr 


Phe 


Ala 


Ser 


1025 








1030 








1035 








1041 


Gin 


Pro 


Gin 


Leu 


Glu 


Glu 


Ala 


Cys 


Leu 


Pro 


Ala 


Gin 


Asp 


Leu 


He 


Asn 










1045 








1050 








1055 


Leu 


Arg 


Leu 


Ala 


His 


Gin 


Gin 


Ala 


Thr 


Glu 


Ala 


Lys 


Thr 


Gly 


Leu 


He 








1060 








1065 








1070 




Asn 


Arg 


Leu 


Arg 


Gly 


lie 


Phe 


Ser 


Arg 


Thr 


Thr 


Ser 


Ser 


Asn 


Lys 


Gly 






1075 








1080 








1085 






Ser 


Thr 


Ala 


Ser 


Leu 


Ala 


Asp 


Gin 


Lys 


Gly 


Leu 


Lys 


Ala 


Ala 


Phe 


Lys 




1090 








1095 








1100 








Ser 


His 


Met 


Gly 


Leu 


Phe 


Thr 


Arg 


Leu 


He 


Pro 


Ser 


Ser 


Gin 


Thr 


Ala 


1105 








1110 








1115 








1121 


Ser 


Cys 


Asn 


Ala 


lie 


Tyr 


Asn 


Asn 


Pro 


Asn 


Gin 


Asp 


Ser 


He 


Pro 


Ser 



30 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/US01/09341 



1125 1130 1135 

Glu Ala Ser Ser His Pro Asn Gly Asn His Leu Lys Pro He His Arg 

1140 1145 1150 

Gly Ser Leu Thr Lys Ser Gly Thr His Leu Asp His Leu Thr Lys Asp 

1155 1160 1165 

Pro Asn Phe Leu Pro He Pro Thr He Ser Gly Gly Glu Gin Gly Asp 

1170 1175 1180 

Gin Thr Leu Gly Gly Lys Tyr Val Lys Leu Leu Glu Thr Lys Val Asn 
1185 1190 1195 1200 

Phe Gin Leu Pro Ser Asn Arg Arg Pro Ser Val Val Gin Gin Pro Pro 

1205 1210 1215 

Ser Leu Arg Glu Arg Val Arg Gly Ser Pro Arg Phe Pro His Arg He 

1220 1225 . 1230 

Leu Pro Pro Thr Cys Ser Leu Ser Ala Leu Ala Glu Ser Glu Asp Arg 

1235 1240 1245 

Pro Gly Asp Ser Thr Ser He Leu Gly Ser Cys Lys Ser He Pro Arg 

1250 1255 1260 

He Ser Leu Gin Gin Ala Thr Ser Gly Gly Thr Trp Lys Ser Met Glu 
1265 1270 1275 1280 

Thr Ala Gly Lys Ser Arg Leu Ser Leu Gly Asp Ser Gin Glu Glu Glu 

1285 1290 1295 

Gin Gin Ala Pro Ala Asn Gly Thr Glu 
1300 1305 



<210> 16 
<211> 7913 
<212> DNA 
<213> Drosophila 



<400> 16 

ttttctgccc 

tcaaggacaa 

tgcgcgtggc 

gcggcacaac 

tatccataca 

aagcagaagg 

cgcaagaaga 

accctcaagg 

tagaatgcat 

gaagaagaaa 

tttgtgaccg 

taaaaaacaa 

ggatctactt 

gccttgtaga 

gcaaacttga 

tctttgatgt 

gtaattgata 

acatttggtt 

ccgacagctc 

gttcgcccca 

tagattcgct 

agttgcgatt 

cggaggcgat 

ccagcattag 

tcagtcgatt 

cggacgacga 

ttttctggtg 

gcagcaggaa 

ttgtgcagga 



gtttgtttgt 
gaaggaactc 
caaggtgacc 
gacaagcgtc 
gccgcgttgt 
agaatctgcg 
agacccgcgc 
agatccgcaa 
ccagatactc 
aactaaataa 
aaaaaacgcg 
aaaaacaagt 
aactacttgt 
agtaatggca 
aaaacaccag 
aagcacggcc 
tgaattacaa 
aatactattc 
cttggagttc 
gaacttgagc 
attagcagct 
gctggccgca 
gatgtgcagc 
cggcaggaag 
gtcttcgcgc 
ggagcccttg 
cagcaccgta 
gccgatcacg 
gaaggtggcc 



gctcccgtcc 
accaagcaat 
ggcggagctc 
acgaactaat 
ccgcaaggcc 
caaggtcttc 
tatccgcaag 
gcgctccgtc 
cgactgtcta 
aatctttttt 
agtggcacat 
gccgcattcc 
ggtccttaag 
aatcttaagt 
aattctcaat 
atacatcgta 
gctctcgact 
ttctgcacag 
ttgctttggt 
agcgcaaaga 
aaacccgata 
tagatgatcg 
caggggtacg 
cagacgagga 
gccttgcgcg 
gccgctgcga 
atgtagatgc 
aacagcgtct 
tcgtccaggc 



aggtgaaggt 
tggatgagct 
cctccaagct 
tccggtgttt 
atcgctcgcg 
aagaacaaga 
gccctgtctc 
ttcccccaga 
tagtgcgcgt 
ttaaagtgaa 
ggctttggcc 
ctgttctaca 
atcattatag 
tcaatcttta 
tggtctacgt 
aataattatg 
gcgagctcga 
cgtgaaacag 
gatagtttct 
tcttgtagta 
gcagccaaaa 
ggttaatgac 
aggtattgcg 
agcacagaaa 
tgcatgtggt 
tctggaagtt 
acgagtacga 
tcttgatcga 
ccatctcgcc 



taagtgctcc 
caagaatgag 
ctccaagatg 
tgtgcattta 
tctacattgt 
agtacaagcc 
cgcgcgacgc 
ggaagttcgc 
taagggttct 
aatagcgatg 
atgggttcat 
tatgttagtc 
agcaaaaacg 
attcagcctt 
atatgtattt 
aactaacaag 
tcatactgaa 
cggcgtgctg 
actcggcatc 
ggccactctg 
ctgccagcaa 
actggaagcc 
ctcgtcgtcc 
gatggtgacc 
ggtcatgtag 
atcgtggttg 
aacgatgatc 
tcgcccctcc 
ccagatgccc 



gagctgagga 
ttgctcagcc 
tgagtatgcc 
tgtgcgttct 
gatgcaccag 
cctggatctg 
caaccgcaag 
cgtcaaggcc 
tcggtacaat 
cgtgtttgat 
ggaaattagt 
ggatccaatg 
ttgagttagc 
ctttgaatat 
aattcgattg 
taataagtag 
tatgtttggc 
cggatgacgc 
ggcgacaacg 
aaatagaaga 
acttacctgt 
caggccatca 
accacattgg 
atcatgacgg 
gatccgccgc 
cgaatcttct 
accaggcaag 
ttcttgagta 
aatatgggcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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gcagctggga agataaaaac cgattagttg aatgtgtgca cacaaatcga agacgcctta 1800 
cttaccaaca gcagaaagga gacggcccaa acgaacaaca gctgcagggt tatgaactta 1860 
ggcttatata tctgcgagta gcggctgtgg caagcgatga gtatatatct gccaggggat 1920 
tagggtgacg gttgcggatt agcttgcgac aatgcggact agacttgggc ctacctgttc 1980 
agggtgatgc ccaccatgct gaggagtgaa acagccacat tgccatagaa gatcaccgga 2040 
aagatcttgc acaatgtggt accaaaagtc cagctctgtc gttggaatcg agcaaattat 2100 
aatcattcac taattgtata ttacattcac aggtataatg actcacctcc tggaagaatc 2160 
gcactgcggt cagtggcagg ctgaaggagc agaagagcag gtcggagatg cttagcgaaa 2220 
tgacgaaggc ggttgtggca tgctcccgta tcgtggggct cttgagcagc gccagcaggg 2280 
taatcaggtt gcctgtgcgg caattcagca tagagaaaca ggttagctaa tcgaggagct 2340 
gaatgtggga tgccaattac tcaccaagaa cgccgatcgt cacaaagaca caggcactaa 2400 
tggccgcgaa taatgtcgcc gaatgcgggt agatcgattg cgtggcggcc gccggttcat 2460 
ccatctgcat gtctgcgtcc tgaaagtagc ccgttgccat gcccatgtcc tgatccattc 2520 
tgcgggggaa acaagcaaac gatgtgatta gccaatttca attacttatt tttgaacagg 2580 
aaaaggagct gtatctttta aattggttgc attcaaatcg cttatgctaa tcctattgtt 2640 
tttttttttt taatttgtta gcccgtatca tgagtaatta aaaaggaagt taatagccgg 2700 
atgtagacaa ctcaatatgc atccgcttgg ttatttgaaa acaattaact tatgaagaga 2760 
aatgaatgaa attaccattc ctaattgccc agaagcatga tgcttttgtt aagaagggat 2820 
tgttaagaat ttaactatct aaacgaacta atgtgtacat ttctgtgctc cgcgataaag 2880 
gaaatgtaat cagcaaaatc aagtttcagg cgcgtggaaa atctgtgaat gaatgaacgt 2940 
acgagtaata caaaaagata actaatcacc gctttgcgga tgcaatagcc cagttttgtc 3000 
gtcggtgaca ccgagtaaaa acccgatccg atcgacggga ttagcgagga agtctgcgta 3060 
ctcccaccag cttatcgcat ccgagacatt tgatttatgt aaacggattt tgttataaga 3120 
tttctgccga gccaagctgg ccaactggcg aatcgtaatg atcgagccgg tttgcacgcc 3180 
atttgaattc acatttgtac gaattttttt tttatcaaaa gttcgagttt ttcaccaatt 3240 
tcctcatcaa ccgagcaagg caaacggctt tgaataatat ggtgttatat atacatatat 3300 
caaatcgctg ctgactgcgt gattgatggc cccaagatta catattatcg aatcaggatt 3360 
cagaaggaga tcaatgtcaa atgcggacag gaacatgaga gacgcctgtt atgcgcaatt 3420 
aaaaatttgg gtttaattgc tgtggaaact gttgttggtg gcatcttaag ttcctgttta 3480 
acaacatcaa ctacttatgt acgtagaagc gtttaagcca tttgcataca gatgagaact 3540 
ggcttttgtg ctaatcagcc aagatgactc cgatgatgat gactcattac ctgaccagtt 3600 
ttcgctgctt tcttttcaac aactacttgt atatgtattg tatccaatag caatacattg 3660 
gatttccatg gtctagtcac gtattatcat ttaattgacg ccaagtcgtg ttattgttga 3720 
gctatcgagt tcagctcaaa catttcttat tcccatgaat aagccggcaa aaatatgcaa 3780 
tctatgaaag ttaatataag caaaccttac tttgactcaa taccaatgca ctttgtgtcg 3840 
ataggttcac gcaattgagg cgattattcc gataacccaa gcgattgact gttcccgttt 3900 
cgattccaat tgaaatttgg aaatgtacaa tagttttgct atatgctgtc aagtacgctc 3960 
ttatcttctc tgggttttct tcagagtttc gaaacgcttc ttcttttttt tgtttttttt 4020 
tttttggaat ctcgtatttt ggaaggggct cccctctgga atttgttaca ctgtcgttat 4080 
cattgcgaac aagcggcccg aagctatcag cgactttaac atttacaatg cactttttta 4140 
cgaccaatta aatgtacatt ttcctttctt cgcccgttga taagcgaacg cgatgtggcg 4200 
caggcaatgt gttgctcttg cgacacaaac gcaatcaaaa tggattcaat ttcgcttttt 4260 
cccagtgaaa cgaagaacga accgaccatc atgatatgct gctctgcatg ttgcgtattg 4320 
aatcaatgac aatttcaatt aagccgcccg ttcgtcatgc gttttcgtgg gcttcgaaat 4380 
gctgataacg ctgctgtcct ccaactgctt tgcatgtgga cacaattcca tttatttaat 4440 
tcttttattt ggatcggtta aattaaaaag cgccttgtta cgcatttaac gttgtttccg 4500 
gtgcgtggtg gtttcatgct tctgggaacg gcaaatgggt ttaggattgg gaacccctca 4560 
tcatccgtcc aaatatgggg agttcaagga atggaggggg gaggggggga gggggcggag 4 620 
cagcggtccg ctttcatgca tgatcaacat gtgtgtttgt tttccaactc tatgtgagtt 4680 
gtttcagttc ttttcggtct cctcggttct actcagttcg ttcttacttt ctggcttttg 4740 
tttccgaata gggggagcgc atcgggccgc aactggaaat ggaaatctga atatgcatcc 4800 
cggagcatcc cgtccgaaat gagcaccatg cgccatgccg tttcgtttaa ttccctaccc 4860 
ataaatgcgt gtccaaacag cagagggcat ttcaagaggt ccaactatta gctatgtgca 4 920 
tatataattc tctagatata ttaattcagc taaagtgcta accgcgagtg tgccgccagt 4980 
tgcccctcaa cagtttgcaa cgatgatgat ggcacagata tcagagataa cactcagatc 5040 
gcgcccctca caagtatgct cattatgtac attcatatgt acatatgtac acacagctaa 5100 
gcacataagg aatacgatat gcgctatgta catatataca tatttacatc aaagtcttat 5160 
cagcctcggg ttcccaccgt ccaaaaaaaa aggcggaacg aagctttata aataatcaat 5220 
caacaaatat ttacaatcaa gtatttatgt agcgggcgga aagtcgaatg caaataaggt 5280 
tagacgcatt tgtacatatg tattttctta aagggttcct ctgatcaata tctgcccgtt 5340 
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tttatctagc cccgtttata aggctgtgaa tgccccgatt gtgttggtcg tgatatcaat 5400 

taaatcggca taatctcatg tcatcttttt aaacattatg atagataaaa tgtattataa 5460 

cgactctaat tacattggga ttaaatgggt tttcggaaaa tacctatgca gtaaatcaaa 5520 

tttaagatca atgtgcagtt atttggggtt tcttgagtta ggacctctgt tttaatattg 5580 

agaattactc gatattatct tccgttttat tttattttct gatcgtgtcg gtgcacgaga 5640 

aatacatatt tcccactttt gttgcatttc gctgactaat cgaaagctaa ataccaaata 5700 

agtttccgat atggggggta tgtatgtaca tacatacata tatagtacga gtatatataa 5760 

tatacactga ataagcccga gcatacatat gtacatatgt agatagtgca catgtgtatt 5820 

gtggatcgtt ttggagctgg gtttttcttt gcttttcgtc gtttttaaga gacttgagct 5880 

gaatgagtcg ctaccaactg ttgcggtggg ttaaagggtg gggggagggg gcgaatgggg 5940 

atgggggatg aagatgatga ttcatgcagc tgatgtttgg acagggaaga aagaaagagc 6000 

gaaagagaaa gcaaatcgga agcggcggta tcaaacaatc tgtttaaata acaacaaaag 6060 

actttataag cctgactttc atttccattt ccaatttatt tttcccaagt ctggctcatt 6120 

ttgtgcaaat aaacaaatga ctcatttgtt taaaaattcg atgagactca aggaatgtca 6180 

cattcacaaa aattttcaat ttgaaatctg aaaagattca acagcataag caaatgaatt 6240 

taagattgcg atatttgggt attatcctta attggctggc tgcctaatgt agatttgttt 6300 

ttctcggtgt accgatgcat cctctttgcc catccacttc tttctcccct ctttgctgat 6360 

tttttttttt tttgttttgt gagagggtgg aattcgggat ttgggatgtg agcgagaagg 6420 

atggccatat tttgaattgt ggaaacagcc gttccattat cgttccatta caaatgcaca 6480 

cacacacaca cacacacgcg cgcgtgcagt ttgcgcttct ctgtgtgtgc gtttggacca 6540 

aatgcaataa caataaaaac gcgcaactca aattgttaac aaaggtaagg ccgttttttt 6600 

ggccggattt ggattctggt cttgaaagag aaacacacac acacacacac gcacgcaaaa 6660 

aaccaacgaa aataataaca aaacaacaac tcacccgaac tcgaacccgt ttcactgttt 6720 

tccgcttatt ataaatttat acttcgtttg gcgtttctta aatgctttcg aattgcattt 6780 

tggatttgac attaaatcgc tccgcgcagc acagtaacaa aaataaacgg aacaaaataa 6840 

atacaaccag ccgattctgg ccttctcaca attccgactt ttgaattatc gaaacagaag 6900 

cgctgcttgg gccaactaag ccaacttaac taaccgatcg cagagcgcaa agtgtcgtat 6960 

tttcactgtc gtcaattaac aacagcggca acaagaacaa caaacgaaat gagcggctgg 7020 

gctacagttt tttagtccga gcgcgatcga acatgttggc tgcctggcaa caagttcgat 7080 

gcgaactttt ctattgatct aagaaatcgg cgttctatcg attgacgcgc tgcgcaggcg 7140 

cctatcgatt tttgaaattc taagcccgcc ttaaaggata tttgtagaaa agtagcttgc 7200 

aaagaaatta aattccttga aataaaaaag aaatttagta ctctcgttct atactcactt 7260 

ctattcactt gcggttagtt agttagtttt taatgccaca tttttatgat gatgactttt 7320 

tttcagctca tctatgcgca atcaattttg cgccaagcga cagctgcaca tagtttcttt 7380 

tgaccaagca atcatttatc atataatttt ccaagtagta gtgccaagtt cataactgat 7440 

ggagtgtgct gccaatttgc atgttgtgga gccattcagt ttcaaggtcg cgctaattaa 7500 

atactcgtta caagtgatgc caaatgtata aatatttgaa cagaacgtat taaattaata 7560 

ttatccacta agagcactat tttaaattct taaaaaccag atgcacgtca aatagatgta 7620 

cataaaaagt atatttactt acaaatatca tacacggacc tgttttattc ctcatcactg 7680 

gccattagtt gataacatgt ggacttctat gatatcctgc tgcatttttg ccgtttagtc 7740 

aggatttaga gcttgtacac aatgcgacag ctgaaagggc gaaatcggtg caatcgtgct 7800 

gttcggcatc tgaaagttca gggcaagatg tggttgaaaa atcttaaaag cggtttggaa 7860 

caaattcgag gtgaatctat ccaaatcgtt agaatcccat gagtacgttc ate 7913 



<210> 17 
<211> 1399 
<212> DNA 
<213> Drosophila 

<400> 17 

ggcccaagca gcgcttctgt 
cggctggttg tatttatttt 
aatgtcaaat ecaaaatgea 
tataataagc ggaaaacagt 
ggcaaeggge tactttcagg 
atcgatctac ccgcattcgg 
gateggegtt cttggcaacc 
ggagcatgee acaaccgcct 
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ttcgataatt caaaagtegg 
gttccgttta tttttgttac 
attcgaaagc atttaagaaa 
gaaacgggtt cgagttcgga 
aegcagacat gcagatggat 
cgacattatt cgcggccatt 
tgattaccct gctggcgctg 
tegtcattte gctaagcatc 



aattgtgaga aggecagaat 60 
tgtgctgcgc ggagcgattt 120 
cgccaaacga agtataaatt 180 
atggatcagg acatgggcat 240 
gaaceggegg ccgccacgca 300 
agtgcctgtg tctttgtgac 360 
ctcaagagcc ccacgatacg 420 
tccgacctgc tcttctgctc 480 
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cttcagcctg 
attgtgcaag 
ggtgggcatc 
atataagcct 
gttgctgccg 
ctgcacaata 
cctgctgcct 
ccagaaaaag 
gtcgtccggc 
tcgactgacc 
aatgctggcc 
cgcctccgtg 
tcgcaactac 
gttgtcgccg 
cgtcatccgc 
ccaaacatat 



ccactgaccg 
atctttccgg 
accctgaaca 
aagttcataa 
cccatattgg 
ctcaagaagg 
tgcctggtga 
aagattcgca 
ggcggatcct 
gtcatgatgg 
aatgtggtgg 
atggcctggg 
agagtggcct 
atgccgagta 
agcacgccgc 
tcagtatga 



cagtgcgatt 
tgatcttcta 
gatatatact 
ccctgcagct 
gcatctgggg 
aggggcgatc 
tcatcgtttc 
accacgataa 
acatgaccac 
tcaccatctt 
acgacgagcg 
cttccagtgt 
actacaagat 
gaaactatca 
tgtttcacgc 



cttccaggag 
tggcaatgtg 
catcgcttgc 
gttgttcgtt 
cgagatgggc 
gatcaagaag 
gtactcgtgc 
cttccagatc 
cacatgcacg 
tctgtgcttc 
caatacctcg 
cattaacccg 
ctttgcgctg 
ccaaagcaag 
tgtgcagaag 



agctggactt 
gctgtttcac 
cacagccgct 
tgggccgtct 
ctggacgagg 
acgctgttcg 
atctacatta 
gcagcggcca 
cgcaaggcgc 
ctcgtctgct 
tacccctggc 
atcatctatg 
ctcaagttct 
aactccaagg 
aatagtatta 



ttggtaccac 
tcctcagcat 
actcgcagat 
cctttctgct 
ccaccttctc 
tgatcggctt 
cggtgctgca 
agggctcctc 
gcgaagacaa 
tcctgccgct 
tgcacatcat 
cggccagcaa 
ggggcgaacc 
agctgtcggg 
accaaatgtg 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1399 



<210> 18 
<211> 386 
<212> PRT 
<213> Drosophila 

<400> 18 
Met Ala Thr Gly 
1 

Ala Ala Ala Thr 
20 

Ala He Ser Ala 
35 

He Thr Leu Leu 
50 

Thr Thr Ala Phe 
65 

Ser Phe Ser Leu 

Thr Phe Gly Thr 
100 

Asn Val Ala Val 
115 

Tyr He Leu He 
130 

Lys Phe He Thr 
145 

Leu Leu Leu Pro 

Glu Ala Thr Phe 
180 

Lys Lys Thr Leu 
195 

He Val Ser Tyr 
210 

Lys He Arg Asn 
225 

Ser Ser Ser Gly 

Ala Arg Glu Asp 
260 

Cys Phe Leu Val 
275 

Asp Glu Arg Asn 



Tyr Phe Gin Asp 
5 

Gin Ser He Tyr 

Cys Val Phe Val 
40 

Ala Leu Leu Lys 
55 

Val He Ser Leu 
70 

Pro Leu Thr Ala 
85 

Thr Leu Cys Lys 

Ser Leu Leu Ser 
120 

Ala Cys His Ser 
135 

Leu Gin Leu Leu 
150 

Pro He Leu Gly 
165 

Ser Cys Thr He 

Phe Val He Gly 
200 

Ser Cys He Tyr 
215 

His Asp Asn Phe 
230 

Gly Gly Ser Tyr 
245 

Asn Arg Leu Thr 

Cys Phe Leu Pro 
280 

Thr Ser Tyr Pro 



Ala Asp Met Gin 
10 

Pro His Ser Ala 
25 

Thr He Gly Val 

Ser Pro Thr He 
60 

Ser He Ser Asp 
75 

Val Arg Phe Phe 
90 

He Phe Pro Val 
105 

Met Val Gly He 

Arg Tyr Ser Gin 
140 

Phe Val Trp Ala 
155 

He Trp Gly Glu 
170 

Leu Lys Lys Glu 
185 

Phe Leu Leu Pro 

He Thr Val Leu 
220 

Gin' He Ala Ala 
235 

Met Thr Thr Thr 
250 

Val Met Met Val 
265 

Leu Met Leu Ala 
Trp Leu His He 



Met Asp Glu Pro 
15 

Thr Leu Phe Ala 
30 

Leu Gly Asn Leu 
45 

Arg Glu His Ala 

Leu Leu Phe Cys 
80 

Gin Glu Ser Trp 
95 

He Phe Tyr Gly 
110 

Thr Leu Asn Arg 
125 

He Tyr Lys Pro 

Val Ser Phe Leu 
160 

Met Gly Leu Asp 
175 

Gly Arg Ser He 
190 

Cys Leu Val He 
205 

His Gin Lys Lys 

Ala Lys Gly Ser 
240 

Cys Thr Arg Lys 
255 

Thr He Phe Leu 
270 

Asn Val Val Asp 
285 

He Ala Ser Val 
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290 










295 










300 










Met 


Ala 


Trp 


Ala 


Ser 


Ser 


Val 


lie 


Asn 


Pro 


He 


He 


Tyr 


Ala 


Ala 


Ser 


305 










310 










315 










320 


Asn 


Arg 


Asn 


Tyr 


Arg 


Val 


Ala 


Tyr 


Tyr 


Lys 


He 


Phe 


Ala 


Leu 


Leu 


Lys 










325 










330 










335 




Phe 


Trp 


Gly 


Glu 


Pro 


Leu 


Ser 


Pro 


Met 


Pro 


Ser 


Arg 


Asn 


Tyr 


His 


Gin 








340 










345 










350 






Ser 


Lys 


Asn 


Ser 


Lys 


Glu 


Leu 


Ser 


Gly 


Val 


He 


Arg 


Ser 


Thr 


Pro 


Leu 






355 










360 










365 








Phe 


His 


Ala 


Val 


Gin 


Lys 


Asn 


Ser 


He 


Asn 


Gin 


Met 


Cys 


Gin 


Thr 


Tyr 




370 










375 










380 










Ser 


Val 






























385 

































<210> 19 
<211> 3574 
<212> DNA 
<213> Drosophila 



<400> 19 

gttcgtttga 

ccctactcat 

gtttctagtc 

aatcacacac 

tcttcgctct 

attaagtaat 

tcataatcaa 

ggtctgtcca 

cttttgcctg 

ttgatgcttt 

tggttaggag 

caaacacaac 

aattgctcga 

aatgatgctc 

agcaaccaac 

agattaactt 

taaacacttt 

ttgccgtggt 

acgtggagca 

acctcacatt 

agacctggcg 

catttccatt 

agcaccttca 

atgtcggtga 

acaatagata 

ttttgctgcg 

taccctactt 

tgtacaatat 

gtaagctatg 

gttgtggggc 

tagccatttc 

gaaggtaata 

agggttggga 

tcctcttttc 

gcactcacag 

aatgatttaa 

tgcgcctgct 

ggcaacgcct 

tcaggctgcg 



tcggttgacc 
agatgaagtt 
ttaaataatt 
caagcggaat 
tccagctctt 
tcaatgaaat 
aacagaagtg 
tccgaagcgc 
gcttctgctc 
ttgctttgtg 
aacagcaata 
cggtggagga 
atttatgggt 
ctgccttttt 
tagatgaaac 
caaaattagc 
ctatctgtgc 
tggcaaccta 
ctcgctctac 
gattgcctgt 
gcatagcggc 
tcgtaatgga 
gctgccagtc 
cgaggcccct 
tgtaataaag 
gacatttgct 
gaattttgtt 
atattttctg 
ataaattatt 
ataagtttcg 
tacggcaaca 
caccatttcc 
gtactcggcg 
ctacattcga 
cggtcgcgag 
ttattgaaat 
ggctgcccat 
ttaaatgatt 
aaatctcgcc 



tcgatttatt 
ttttgcacac 
ggcaagatta 
tatactaagg 
gtctttatta 
tgacagctgt 
ctgccaatga 
aatcatcata 
ctgcgaacgc 
cccaagtctc 
atgaggcaca 
aactggtgtc 
tggtagtggc 
agttcgatac 
ctattttaag 
agttactact 
acttttatct 
ttggtcctgc 
cttcgccatt 
gcggatatca 
gtttgtgcct 
atgctaatgt 
gtcgacgctg 
caagccacgg 
tgctgggcaa 
gatttgttga 
cgtaaatgca 
ttgcttcatt 
gaaatgacaa 
gattagctgc 
atggtgtctg 
ttggaaacgc 
ctgctgttca 
atgttgcaag 
aatgtggtag 
atcatgtttt 
aattgtggtc 
tattttgatt 
cgatctttac 



tactgctatg 
tttaggtgta 
tctgcctaac 
tttatttccc 
aacctggcca 
ttgcttgggt 
atgggaagcg 
taattgtcat 
tgatgttgct 
gtgactccac 
ccaaccggcc 
ggccgcacag 
ataacgacaa 
acacaaagaa 
gacggttgaa 
actaataacg 
gcttgaatag 
tcggtcgcta 
tggccgccag 
gttttcgcgg 
tcgcaggtgc 
gtattacgta 
ctgctcacat 
gatacggaaa 
agttattgtt 
ctgctcaaat 
attatgtttg 
ttagtttgca 
cttccacagc 
agctccactt 
cttatcgctg 
ctgttctaat 
tcctggtcaa 
cgataaggga 
caactcggta 
tacagctttg 
aaattggctg 
atcaacttat 
gcctggctgg 



caaagctaaa 
acttggtgct 
ttttggtgac 
ttggcatcag 
aggacctgta 
caagcataat 
gaagtctggc 
ggaaccacac 
ttgtgctttg 
tccactcgga 
aattggcaat 
ggaagactta 
ttgtccaagt 
aaggggacca 
ttgttggctt 
agctaagcca 
tgtctgggtg 
cttctacaaa 
tgatttcctc 
cgagtatatc 
gtatccttcg 
ttcgccgtgt 
tggtcacctg 
aagttcggta 
tattctatta 
atcaatgcac 
ttttgaccgc 
aaagctatgg 
attgtcctac 
cttcccaatc 
catatccacg 
attccaaaat 
tacgttgtca 
ttcgggtggc 
agtttggata 
ccatcattgt 
ctctttcagg 
tcgtatatat 
cggttcttgt 



attcaaaaat 
ttaacttggc 
cccaactttt 
actgtcttct 
acttcgttta 
tgatgactac 
ataaattaca 
ttggcactca 
tgctttctgc 
gtggcatcca 
tgtgcgaggt 
aattttcttt 
caaggtcaca 
tacatataca 
ttaacatagt 
ctcatttgcg 
atggcctcga 
tcacggagca 
atgggcattt 
aaatacgagg 
acatcctagc 
aggcttcctc 
ggatcgtctg 
agtgtgaact 
cattaattac 
agaatttgtt 
cgccttacta 
ttttgtatgt 
gccttctgct 
cgtactttgg 
atccctatgc 
gtatattttt 
ctgatcttca 
ggaatgcgaa 
attcaagtaa 
gaccaccgat 
caagttcatc 
ataacccctt 
gctgccagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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aactcggccc 
cgtcgctact 
acccagactg 
ggcggcgggt 
gcaactggct 
acaacaggac 
atacttattc 
ccttagacaa 
tcgttttaag 
ccatttattc 
tttaatttta 
tcgaaggccg 
acatcgagat 
gtttcggtgg 
tacttaatgt 
gtattcttta 
gcggccattt 
gctcccccag 
attagaagaa 
tcttgaagtc 
gcttcttagt 



tcaatccagt 
gtcacaccct 
cctacgagtc 
cggggcgaaa 
taaaatgcag 
acttccatgt 
taatatcaat 
gagattccag 
gcaaacctct 
gacctgccca 
taaatacata 
caaatgcctg 
ccgcctggac 
gcgaaaactc 
ccctgtgcag 
gagctacaat 
gaagatcggg 
cgaagcactc 
aaacttcaat 
ataaatcaca 
atgcgagcca 



gctctacaca 
gcccagctgc 
cggactgagt 
gcggatgtca 
cacgagatca 
aattggtccc 
tttaaattac 
ctatttaagt 
tacaactgtc 
atgggatatt 
ttcagatgtt 
ccatccggct 
gtgatagggt 
agcgctagcg 
atccttttga 
gccccagttg 
gaacggagtg 
gtaccagttt 
tatagagtta 
tccagggagt 
tacttcttaa 



ttgaccactg 
tcgctggtga 
gtcagtctgg 
caccggcaga 
acagactggg 
ttagactatt 
ttataacatt 
ttcttttttg 
caaagaatat 
gttacatgtg 
aattgaagct 
ttctatgcgt 
ccctcgttca 
gacagaatct 
cgcgaaagaa 
tacttacaaa 
tttatcaggt 
ttaccaatgt 
agcattacat 
tgagcagcct 
agga 



cggcctttaa 
acaacgagac 
cgcacttggg 
tgagctatct 
gctagtgggt 
ggtattatac 
tgaggatatg 
tatattattt 
cctaactatt 
atctaatagt 
tttgaacata 
tgccctgagg 
cataggccgt 
ccataatttg 
ggaactccac 
tggttttcaa 
ccataatgat 
tactgaaaaa 
gatttctgtg 
cttcttgatg 



gcaacagctg 
ccgatcccag 
cggtggtgtg 
gtaggcgggc 
gggtggcatt 
gtacttattg 
ataacataaa 
attttaccaa 
tgttgcatcc 
taaatcaaat 
ggacttgcaa 
ttcagtggct 
gaaacggatg 
ccacttcata 
agcaccgccg 
gaacgacaag 
attagcatgc 
gaagagaatc 
gcaggcggcg 
tgcgccgata 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3574 



<210> 20 
<211> 1015 
<212> DNA 
<213> Drosophila 



<400> 20 

tgtctgggtg 

cttctacaaa 

tgatttcctc 

cgagtatatc 

cctcagcacc 

tctgatgtcg 

acgccttctg 

tccgtacttt 

cgatccctat 

gtcactgatc 

tggcggaatg 

cattgtgacc 

ttcaggctgc 

gaactcggcc 

gcgtcgctac 

gacccagact 

gggcggcggg 



atggcctcga 
tcacggagca 
atgggcattt 
aaatacgagg 
ttcagctgcc 
gtgacgaggc 
ctgttgtggg 
ggtagccatt 
gcgaagggtt 
ttcatcctct 
cgaagcactc 
accgattgcg 
gaaatctcgc 
ctcaatccag 
tgtcacaccc 
gcctacgagt 
tcggggcgaa 



ttgccgtggt 
acgtggagca 
acctcacatt 
agacctggcg 
agtcgtcgac 
ccctcaagcc 
gcataagttt 
tctacggcaa 
gggagtactc 
tttcctacat 
acagcggtcg 
cctgctggct 
ccgatcttta 
tgctctacac 
tgcccagctg 
ccggactgag 
agcggatgtc 



tggcaaccta 
ctcgctctac 
gattgcctgt 
gcatagcggc 
gctgctgctc 
acgggatacg 
cggattagct 
caatggtgtc 
ggcgctgctg 
tcgaatgttg 
cgagaatgtg 
gcccataatt 
cgcctggctg 
attgaccact 
ctcgctggtg 
tgtcagtctg 
acaccggcag 



ttggtcctgc 
cttcgccatt 
gcggatatca 
gtttgtgcct 
acattggtca 
gaaaaagttc 
gcagctccac 
tgcttatcgc 
ttcatcctgg 
caagcgataa 
gtagcaactc 
gtggtcaaat 
gcggttcttg 
gcggccttta 
aacaacgaga 
gcgcacttgg 
atgagctatc 



tcggtcgcta 
tggccgccag 
gttttcgcgg 
tcgcaggctt 
cctgggatcg 
gcattgtcct 
ttcttcccaa 
tgcatatcca 
tcaatacgtt 
gggattcggg 
gctttgccat 
tggctgctct 
tgctgccagt 
agcaacagct 
cccgatccca 
gcggtggtgt 
tgtag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1015 



<210> 21 
<211> 334 
<212> PRT 
<213> Drosophila 



<400> 21 

Met Ala Ser lie Ala Val Val Gly Asn Leu Leu Val Leu Leu Gly Arg 

15 10 15 

Tyr Phe Tyr Lys Ser Arg Ser Asn Val Glu His Ser Leu Tyr Leu Arg 

20 25 30 

His Leu Ala Ala Ser Asp Phe Leu Met Gly lie Tyr Leu Thr Leu lie 

35 40 45 

Ala Cys Ala Asp lie Ser Phe Arg Gly Glu Tyr lie Lys Tyr Glu Glu 
50 55 60 
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Thr 


Trp 


Arg 


His 


65 








Phe 


Ser 


Cys 


Gin 


Arg 


Leu 


Met 


Ser 








100 


Val 


Arg 


He 


Val 






115 




Leu 


Ala 


Ala 


Ala 




130 






Tyr 


Gly 


Asn 


Asn 


145 








Ala 


Lys 


Gly 


Trp 


Leu 


Ser 


Leu 


He 








± O \J 


lie 


Arg 


Asp 


Ser 






195 




Asn 


Val 


Val 


Ala 




210 






Cys 


Trp 


Leu 


Pro 


225 








Glu 


He 


Ser 


Pro 


Val 


Asn 


Ser 


Ala 








260 


Phe 


Lys 


Gin 


Gin 






275 




Leu 


Val 


Asn 


Asn 




290 






Gly 


Leu 


Ser 


Val 


305 








Ser 


Gly 


Arg 


Lys 



Ser Gly Val Cys 
70 

Ser Ser Thr Leu 
85 

Val Thr Arg Pro 

Leu Arg Leu Leu 
120 

Pro Leu Leu Pro 
135 

Gly Val Cys Leu 
150 

Glu Tyr Ser Ala 
165 

Phe He Leu Phe 

Gly Gly Gly Met 
200 

Thr Arg Phe Ala 
215 

He He Val Val 
230 

Asp Leu Tyr Ala 
245 

Leu Asn Pro Val 

Leu Arg Arg Tyr 
280 

Glu Thr Arg Ser 
295 

Ser Leu Ala His 
310 

Arg Met Ser His 
325 



Ala Phe Ala Gly 
75 

Leu Leu Thr Leu 
90 

Leu Lys Pro Arg 
105 

Leu Leu Trp Gly 

Asn Pro Tyr Phe 
140 

Ser Leu His He 
155 

Leu Leu Phe He 
170 

Ser Tyr He Arg 
185 

Arg Ser Thr His 

He He Val Thr 
220 

Lys Leu Ala Ala 
235 

Trp Leu Ala Val 
250 

Leu Tyr Thr Leu 
265 

Cys His Thr Leu 

Gin Thr Gin Thr 
300 

Leu Gly Gly Gly 
315 

Arg Gin Met Ser 
330 



Phe 


Leu 


Ser 


Thr 








80 


Val 


Thr 


Trp 


Asp 






95 




Asp 


Thr 


Glu 


Lys 




110 






He 


Ser 


Phe 


Gly 


125 








Gly 


Ser 


His 


Phe 


His 


Asp 


Pro 


Tyr 








160 


Leu 


Val 


Asn 


Thr 






175 




Met 


Leu 


Gin 


Ala 




190 






Ser 


Gly Arg 


Glu 


205 








Thr 


Asp 


Cys 


Ala 


Leu 


Ser 


Gly 


Cys 








24 0 


Leu 


Val 


Leu 


Pro 






255 




Thr 


Thr 


Ala 


Ala 




270 






Pro 


Ser 


Cys 


Ser 


285 








Ala 


Tyr 


Glu 


Ser 


Val 


Gly 


Gly 


Gly 








320 


Tyr 


Leu 







<210> 22 
<211> 8412 
<212> DNA 
<213> Drosophila 



<400> 22 

ttttccgcca aattcgctgg cttttaagtg ctcacataaa tgaaattttc aaatagcaaa 60 
taacatagga ataatccttc taaatcgcaa tataaaaaca tatatacaag gagtggtaaa 120 
tgacagtatt ctattaatgg caaaacgatt tatttaaaca gacgtcgcat ccgcataaac 180 
gatctttcac actttttatt aacggttagt ataaataaaa aaaaacatta atccttagtt 240 
tatttaacca attttgtttg ctgaaatcga ttcttgaaag agaagctgac atttttttta 300 
tatatgcctc cctacttcag gcagtgcgaa cataccgcat caacgcgact tgtagcggcc 360 
gtgtgaaaag gctattttgt ggttagcgca gatctgcgct gcagtgtgac cccggctaca 420 
ggtatttcgg catcgcacag ggtcacactc gcccaggcac ccgtataaaa tccccgcccc 480 
cctgatcgac gatagtattt cgaaaaaaac gcgaggaaaa gtctgatgcg cgttaaaatc 540 
ggagcgcaaa tcgtgcatat attttttttt acacaccgcg cgccgtgttt acgactgaaa 600 
acggagactg actgaaaagc cttggcgcac aagcacgcaa tgaaaacgca acgataagaa 660 
tgtgatattg tgataaatac atgattcgct gacaactcta ttttatattg gaatgcattc 720 
gaaatcgaac gcagcgcagc gtgtgttgtt tacttaacga ttttcggtgt ttgtttgccc 780 
aacttgcggc ggtgaaattc agcattttct gattttccat ttgccaacga aagagaggaa 840 
aagtgaaaag tgctattcga cgaagcgaaa cggaacgcaa aatcaaaaga aaaacagcat 900 
agctaaagct atgtgtagta catagccaca taaattgcaa attgaggcca agagacaaga 960 
gaacctgaaa tccacacaga tcttgacgag tgagctaacg atggacagga gtcggagtag 1020 
cagagctgcc agcagcaacc agttcatcag gccatgtggc ctactgacga ccgtcatcct 1080 



37 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCT/US01/09341 

gctgcagacg cttgtgtcga tgagtttggc catcgaggag atgtcgccac caccggccgc 1140 
cccgccacgt ccatcgccgc cgcccaccgt gaagctcaac aagtgctgcc attcgggcga 1200 
gtatctgaac gatggcacct gcatcgccgg cagcgaagcc ctgtggctgc ccatggtgta 1260 
cttggtgcag cagcagcgct tcttcgagcc ccatggcgcc agtccgcgat tcttgaagtt 1320 
cctgcccaat acgagaccca cgtgccgaaa ggaccagacg acggagatct ttcgcagccg 1380 
cggagccaat gtgatgctct ttcccaatgg cacgctctat gtgcgggaac gcgctctaat 1440 
ggtccagccg tcggattact gtgtcgactg ggaggtggcc gtcgtttgct tgaacgacag 1500 
tcagccgatc aatgctctgg aggatcccga ttatgcagcc aatcctctcg tgcagcagga 1560 
gccgcccaag gctagcctgc gactgagcaa gtgctgcggc aagtggggca gctataacac 1620 
gcagctccag aactgcgatc tgcagcccaa ccatcaggca gccgtcgatg gactgctgcg 1680 
attgtcgccg cagctaccgg agggcagcta tcagactagc tacggcctgc ccgactgtgg 1740 
ccaacccggt ggctactcca ttgccggtga ttggcaggat gccaagctgg atcggaacac 1800 
ggccatgctc cagttgccgc acaagaacct cagcgccgga caatactgcc tggagcacac 1860 
gcagcgcgag ggcgaggtga agatcatagc ctgccagcat ttgtttagct ccgcggcagg 1920 
agccggcatc cacgacggct ccattggagg caccatcgag caggccaatg gccagaatct 1980 
gcagaaggcc gtcctgactg gtggcatact tgtgtccatt gtcttcctgt ccgccacttt 2040 
ggtggccggc ttcctgctgc ccgctgtgca tcatgcgctt cattggcgct gccaaatctg 2100 
ctatgtcacc tgcttgctat tcggcaaaat actgctggcc atcgaggagc tgagctccag 2160 
cctgcagccg ggcagtgccg cctgccatac gcttgctatc accatgcagt tcttctttct 2220 
ggccgccttc ttttggctga acaccatgtg cttcaacatc tggtggacgt tccgggactt 2280 
tcggcccagt tcgttggaac gcaatcagga ggcactgcgt cgctatctgt actccctgta 2340 
cgcttggggc ggtcctctgc ttatcacctt cgtggccgcc tgcgtggacc agctgccgga 2400 
gacgacgcta ctgcgtccgg gattcggaca gctctactgc tggttcgaca atcgcaatct 24 60 
ctcgatcttc gcctacttct acggacccat tggcctgctg ctgtgcgcca atatagcgct 2520 
cttcgtgtcc accacccatc agttgacgtg cggcctgtgg aaacgggacg atgtcaagtc 2580 
ctcatcggag aagtccgccc tgggacgcgt ctgcctgaag ctagtcgtcg ttatgggtgt 2640 
cacctggata gcggacattc tgtcctggct agtgggtgga ccgcatggcg tatggttctt 2700 
caccgacctg atcaacgccc tgcaaggcgt cttcatcttc atcgtggtgg gctgccagcc 2760 
gcaggtgtgg accgcctgcc gcaggatctt ctgtccgcga ttgcgtcatg acatcaccaa 2820 
tacgaccaac ggtgtccagc attcgagcag ctcgcagggt ttgccctcga tggccggcgg 2880 
cacggagatt acacagaaca ccaccaccac caccacgaca accaacacga ccgccactca 2940 
catgcccagc aatccggcgg aggatgaggt accggagaag gctcccattg cccccgtcgc 3000 
acccatcgtc aaaatggaga ccatttgcta aacgtaagtt gaacacccga tagccaatga 3060 
agtgtgcttg taaagagata gctctgtttg ccagcgcgca aacaaagcgc ctgctgttcc 3120 
cactcccagc gcagtttgcc aagtggctta caaaacgcgg ctttaagcac acacacacac 3180 
acacacacac ttgctgatag acttatcggc cttaatgggc tttagggccc agccgggaat 3240 
gcaaagtatc tcaatgagct ggctggctgg ctggctggct gtttgctcgc cggttacgcc 3300 
atgcatcgca gcagtttcga tcctataaag cgcacgtatg tacaggtggg gccacttcaa 3360 
tatcactaag cagctttagc ttcaatagac ttcaatcggt ggaaggtttg gtatttggac 3420 
ccaagaacct aagtttctta ttaagtttta gtctttcaaa aatagcgtgc taaatagagg 3480 
gtcttaaatt aactcgaaaa tgatttagaa tccgtgtaaa ttgaattgac acattagctg 3540 
aattttccgg tatataacca aaaatgtccg gacgatatat gtgtcgtaga agatagtgtg 3600 
ataattgcgc tgtctgtgga tttaacccaa ataccaatga ggtattcaca aaatgtgaaa 3660 
acaaatagga aaccccgaat ttgtcatgcg ccgacaactt tcgagaacca accaccacca 3720 
tatgaagata aaaaacaaaa gaaactagtt tggttttcta tgggggaaaa cgatacgaac 3780 
ttttcaattc aaaacctttg gagtttcaac ggtcaatggc catgttggaa aatacccttc 3840 
tctataggac actattcatc tcacaccccc catgtggact atacgctatc cagatgaaat 3900 
ttaagggttc gagatttcct gacaacttgc ggctacctgg agcgtttttc caaccaggcg 3960 
ccggttgaaa aagtctagca cgttttcgtt tcgttttatg gttttatagg gggaaattta 4020 
cgtatttacg taggataaga agatcgtcct gcgatggtca aatagtgacc atattcttca 4080 
tttgcatgac cgaaattgtt taacaacatg cgccaagttt acgatttcgc gtgcaccact 4140 
tgagttcgag tatggtcata aagatgaaaa tgttgccgct gtgtgccgtt atgtttttca 4200 
tcggaaaact ctgacgccgg gccgggcgaa ttcgattttt gaatttcctg tcgatttcca 4260 
catttccaca tggccataaa gtaatcagcg caacccttga cacacacgca agcgaagata 4320 
atggaaatcg ttgctggcat ccgtatcggt tggattcaaa aaatgtgtgt tgccgaatcc 4380 
gtaagggaaa atcgagttta aattggactc accggcgacg atggcgatgg tgatggtgat 4440 
aagcacacaa cacaccatcg tcttgtttgg atttcggccc atttacttat ttacttgtta 4500 
ctttgttagt ttgttagttt gttagtttgg tcgcctcgag tggttttgac acatctccca 4560 
atgatccaa tggcaaacat gcggacaggt ctcgaactcg gcacaatcta caattatcta 4 620 
aagaaacc tctgtgcggc gatcataaag ttttgaaaaa caataattcc attttcgggc 4680 
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actaaaaaaa aaacagggca acttgaaatc tagcattaaa ctggtggctt aatgttccaa 4740 
tgaaaagata aggatttaac aataaaaaaa aacttttttt ttgtatgcac tgcatgttga 4800 
tagcaacatt aatgaaagaa actaatcaat gaagtttctg tttctctata ttaacaacaa 4860 
atgctgcttt gtttaggcca tgtttttgtt attagagccc aagaaacatt ggtttgcatt 4920 
gcacaaaggc gtatggaaaa tgtgtaatta cttcagtttg taacattttc gaatatgaca 4 980 
gctgctggtg gcagttaatt ctttgccctc gcgttttatt gtttgttact ttttactttt 5040 
attttttatt tttttttttg ggccctgatc gcgtcatact ttttgggatg cgtgtctgtc 5100 
cgggtgaaca gatgtgctgt ctgtctgcca ttcgcttgtg tgtaggtccg tgttggaatc 5160 
agaggttttg cccaaacgaa agagaaaaca aaacaaacag tgggtttttg gcccatgggc 5220 
ccattcgatg agattgcgca accgagttaa atggcctggc ccctgaaaag ctgaaaaccg 5280 
aaaacagaaa catgaaatct gaaatctgaa attgtggaaa tctgaaattg tggctggcac 5340 
acgcaacact cgtaatcaga ctaaaaaaaa aaggggcagt cagtcggctt aaaactgtga 5400 
cctgcatgca tttgccaact gcagactgca aactgcaaac tgcaacttcg aaggcaaccc 5460 
gaaatgcatc tgcaactgca actgcagctg ccgtgccatt tgtgttcatc tacataatcc 5520 
cagagcagcg tgtccacgtc acgctcatca caacgcgacg gtgcaaaaat gcagcgaaat 5580 
tggatgcgtt gctcgtttgt caactgaaat ggtccagatg cacaatgcaa gggagggaag 5640 
atgcggcaga aatgcgccgg aatcgatgca gtaatgcgtt tcaagctatc attatgggat 5700 
cacatggggt cacatcagtg cgaaaatgct aatgctatat gggctaaaac cagttgggaa 5760 
ttagagcgcc gtacaagctg taagctaagc tgatatgcaa taccgcttat taaagagcat 5820 
ttaaactatt tgacttgaaa atatgactaa caaattgagc agggaaaatg cttttccttc 5880 
ggaaaaaatg attgtgtgcc aaaaactaaa aacgaaagtt tccctactat tacttttggc 5940 
atcgcccaaa aaacactttc tttcattagc catttcactg attatctttt ttggtaacca 6000 
agcggagtaa tcatcataag cacacagcgt taaactattc agctatttcc agcttgttga 6060 
gataactaaa agctcggaca cgcttgagtt ccactttttt tttttttttt ttatctccat 6120 
caacatttaa taattaagtg atttcgctta ttgttctagt atttattagc gcattaccta 6180 
attgtttgac acgacgaaac atgaaatgat tactaatttc atcatagaat gccagtgacc 6240 
catgctcgtt ggcatttcga ttataataat attcgcagtt gcataaatat agattgtttc 6300 
cgaacgtgac ctgtacttcc tagtgcaact gaatcttttg cgtttctggg tcagctttta 6360 
tctgcctagt tcgctgacat ttcaaaagtt tgcaacactt ggcagcttgg cagcgtaatg 6420 
aattccttcc tgttgactcc gccacgcaaa ttggcacaaa tttcgcaata agaatactta 6480 
atgtccaact ccagacgcaa tcgcagcggc acacgcacac atgctctgcg ttaatttgag 6540 
tttagcggca atttttaatt tgaacgttct gcagtttttc aacttataat tgcactctgg 6600 
cccatttaca ttttacgcag ggcattatta ttatttcttt tttttttttt aatttttgcc 6660 
tgctgagtcg gtcgacatcg g.attcctcgt tggaggccga aggaagcgaa gtagggggca 6720 
ttgttgtcat gcatccacag ttgcggttaa tgcaatagct ctgagatgga taataattac 6780 
ctcggtcctt gaatggtgcg aaaatatctt ggcagttcct ataaaagcat agcattgcgt 6840 
ttggtacatg aatttcattg gcaaataata agtatttcat ctggcagcag ttcacacctc 6900 
aaatcatatc gcgcaaatgc caccttttcg caagaatttg aaatcaaaat aaatctcaga 6960 
attcctgatt acgcgatgcc tggaaatcta aacagatcat gagtaagccg caaatgtttc 7020 
agctaagaaa aatgaaaaga acaaagtttc cgctaagggt agttggaaaa gtagtcaaat 7080 
tggtttactg atcaatatta cttttagatc tctctgttgt tttatttatt tagcagtgta 7140 
aactaagact gcaagtccat gaactatagc cattttcaac actgcttcta agtactaatg 7200 
tcttgagtgc actgtaccca catgcgcacg catatcatat gaaaatttgt acttatttgt 7260 
gccccttttc tttcttattt ttttttttgc agacaactga gacgacgagc gacaatgcga 7320 
aagtcaggcc cggatacaaa agtccagacc tcgagcgagc gaaataatgg caatgggcag 7380 
ctcagaaata agaaagaaag aatggaaaaa acacgaaaaa aaaacgctgc caaagctgga 7440 
gtttcgagac gagcgcggca cacacttgcg caatcgcagc aacaaaacca agctgaccag 7500 
ctaattgtgc gagtgtacga gcgaatgagt gtgtgtgtgg ttgtgtgtgt gtgcaagcga 7560 
cttgcgcgtg attttgaaat cggccaacaa ccgctgggga acaaaacgta ctcagtatgc 7620 
cagtattcaa cttcaattta agccagcgca caaaagaaac gtattatttt attcatcatc 7680 
atcatcataa tcgcatagca tcgcatcgca tcgcatcgca tcgtatcgca gaataacggt 7740 
aaagcagttc gggtcacatt aacacgttct aattggacac tccggtcagt gttgcgaaat 7800 
ggaatgtgca gcataacgaa acaggaattc agcctagaca aaacaaagca gcgctattgt 7860 
ggctctttct ttcttttaag cacttttagc caacagaaat aaaaacacga gggctttatg 7920 
tgtgtgcatc tggctgtatg tatgtgtgtg gcaaccacaa ggaattttct gttgaggcac 7980 
ttttgttgtt tagtttagat cgcgttgctc tggttttttt ttttttactc aaatctcaag 8040 
tgtgagaaaa acagcaggcg caccacacac aaacacacac acacatgcat gcacatatcg 8100 
gatgtttaac atttaaatta gctgcttctg attggcattt tcggtatgac actgcttgca 8160 
ttcagttctt tatgtgtggg tgtgtgtgtg tgtgtgcctt tgacgttttt attgcctaat 8220 
cacttttttt tatttgcact ttactacgag aatactggac gaaaaccgct gactggaact 8280 
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atttttatgt ttgccccata atgtttttca accgaaaaat tgtatatttt ttttttacgt 8340 
ctagtgtaaa caattgaaat ttatttactt cagggcataa acaaaatcgt tatacacaca 8400 
cattgatagg eg 8412 



<210> 23 
<211> 2153 
<212> DNA 
<213> Drosophila 



<400> 23 

atggacagga 

ctactgacga 

atgtcgccac 

aagtgctgcc 

ctgtggctgc 

agtccgegat 

aeggagatet 

gtgegggaac 

gtcgtttgct 

aatcctctcg 

aagtggggca 

geegtcgatg 

tacggcctgc 

gecaagctgg 

caatactgcc 

ttgtttagct 

caggecaatg 

gtcttcctgt 

cattggeget 

atcgaggagc 

accatgeagt 

tggtggacgt 

cgctatctgt 

tgcgtggacc 

tggttcgaca 

ctgtgcgcca 

aaacgggacg 

etagtegteg 

ccgcatggcg 

atcgtggtgg 

ttgcgtcatg 

ttgccctcga 

accaacacga 

gctcccattg 

agacgacgag 

cgaaataatg 



gteggagtag 
ccgtcatcct 
caccggccgc 
attegggega 
ccatggtgta 
tcttgaagtt 
ttcgcagccg 
gcgctctaat 
tgaacgacag 
tgcagcagga 
gctataacac 
gaetgetgeg 
ccgactgtgg 
ateggaacac 
tggagcacac 
ccgcggcagg 
gecagaatet 
ccgccacttt 
gecaaatctg 
tgagctccag 
tcttctttct 
teegggaett 
actccctgta 
agetgeegga 
ategcaatet 
atatageget 
atgtcaagtc 
ttatgggtgt 
tatggttctt 
gctgccagcc 
acatcaccaa 
tggceggegg 
ccgccactca 
cccccgtcgc 
egacaatgeg 
gcaatgggca 



cagagctgcc 
getgeagacg 
cccgccacgt 
gtatctgaac 
cttggtgcag 
cctgcccaat 
eggagecaat 
ggtccagccg 
tcagccgatc 
gccgcccaag 
gcagctccag 
attgtcgccg 
ccaacccggt 
ggccatgctc 
geagegegag 
agccggcatc 
geagaaggee 
ggtggccggc 
ctatgtcacc 
cctgcagccg 
ggccgccttc 
tcggcccagt 
cgcttggggc 
gaegaegcta 
ctcgatcttc 
cttcgtgtcc 
etcateggag 
cacctggata 
caccgacctg 
gcaggtgtgg 
tacgaccaac 
caeggagatt 
catgcccagc 
acccatcgtc 
aaagtcaggc 
gctcagaaat 



agcagcaacc 
cttgtgtcga 
ccatcgccgc 
gatggcacct 
cagcagcgct 
acgagaccca 
gtgatgetet 
teggattact 
aatgctctgg 
gctagcctgc 
aactgegate 
cagctaccgg 
ggctactcca 
cagttgccgc 
ggcgaggtga 
cacgacggct 
gtcctgactg 
ttcctgctgc 
tgettgetat 
ggcagtgccg 
ttttggctga 
tcgttggaac 
ggtcctctgc 
ctgcgtccgg 
gcctacttct 
accacccatc 
aagtccgccc 
geggacatte 
atcaacgccc 
accgcctgcc 
ggtgtccagc 
acacagaaca 
aatceggegg 
aaaatggaga 
ccggatacaa 
aagaaagaaa 



agttcatcag 
tgagtttggc 
cgcccaccgt 
gcatcgccgg 
tcttcgagcc 
cgtgccgaaa 
ttcccaatgg 
gtgtcgactg 
aggatcccga 
gactgagcaa 
tgcagcccaa 
agggcagcta 
ttgccggtga 
acaagaacct 
agatcatagc 
ccattggagg 
gtggcatact 
ccgctgtgca 
teggcaaaat 
cctgccatac 
acaccatgtg 
gcaatcagga 
ttatcacctt 
gatteggaca 
acggacccat 
agttgacgtg 
tgggacgcgt 
tgtcctggct 
tgcaaggcgt 
gcaggatctt 
attcgagcag 
ccaccaccac 
aggatgaggt 
ecatttgeta 
aagtccagac 
gaatggaaaa 



gccatgtggc 
catcgaggag 
gaagctcaac 
cagcgaagcc 
ccatggcgcc 
ggaccagacg 
cacgctctat 
ggaggtggcc 
ttatgeagee 
gtgetgegge 
ccatcaggca 
tcagactagc 
ttggcaggat 
cagcgccgga 
ctgccagcat 
caccatcgag 
tgtgtccatt 
teatgegett 
actgctggcc 
gettgetate 
cttcaacatc 
ggcactgcgt 
cgtggccgcc 
gctctactgc 
tggcctgctg 
cggcctgtgg 
ctgcctgaag 
agtgggtgga 
cttcatcttc 
ctgtccgcga 
ctegcagggt 
caccacgaca 
aceggagaag 
aacacaactg 
etcgagegag 
aac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2153 



<210> 24 
<211> 676 
<212> PRT 
<213> Drosophila 

<400> 24 

Met Asp Arg Ser Arg Ser Ser Arg Ala Ala Ser Ser Asn Gin Phe lie 

15 10 15 

Arg Pro Cys Gly Leu Leu Thr Thr Val lie Leu Leu Gin Thr Leu Val 

20 25 30 

Ser Met Ser Leu Ala He Glu Glu Met Ser Pro Pro Pro Ala Ala Pro 
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35 










40 






Pro 


Arg 


Pro 


Ser 


Pro 


Pro 


Pro 


Thr 


Val 


Lys 




50 










55 








Ser 


Gly 


Glu 


Tyr 


Leu 


Asn 


Asp 


Gly 


Thr 


Cys 


65 










70 










Leu 


Trp 


Leu 


Pro 


Met 


Val 


Tyr 


Leu 


Val 


Gin 










85 










90 


Pro 


His 


Gly 


Ala 


Ser 


Pro 


Arg 


Phe 


Leu 


Lys 








100 










105 




Pro 


Thr 


Cys 


Arg 


Lys 


Asp 


Gin 


Thr 


Thr 


Glu 






115 










120 






Ala 


Asn 


Val 


Met 


Leu 


Phe 


Pro 


Asn 


Gly 


Thr 




130 










135 








Ala 


Leu 


Met 


Val 


Gin 


Pro 


Ser 


Asp 


Tyr 


Cys 


145 










150 










■Val 


Val 


Cys 


Leu 


Asn 


Asp 


Ser 


Gin 


Pro 


He 










165 










170 


Asp 


Tyr 


Ala 


Ala 


Asn 


Pro 


Leu 


Val 


Gin 


Gin 








180 










185 




Leu 


Arg 


Leu 


Ser 


Lys 


Cys 


Cys 


Gly 


Lys 


Trp 






195 










200 






Leu 


Gin 


Asn 


Cys 


Asp 


Leu 


Gin 


Pro 


Asn 


His 




210 










215 








Leu 


Leu 


Arg 


Leu 


Ser 


Pro 


Gin 


Leu 


Pro 


Glu 


225 










230 










Tyr 


Gly 


Leu 


Pro 


Asp 


Cys 


Gly 


Gin 


Pro 


Gly 










245 










250 


Asp 


Trp 


Gin 


Asp 


Ala 


Lys 


Leu 


Asp 


Arg 


Asn 








260 










265 




Pro 


His 


Lys 


Asn 


Leu 


Ser 


Ala 


Gly 


Gin 


Tyr 






275 










280 






Arg 


Glu 


Gly 


Glu 


Val 


Lys 


He 


He 


Ala 


Cys 




290 










295 








Ala 


Ala 


Gly 


Ala 


Gly 


He 


His 


Asp 


Gly 


Ser 


305 










310 










Gin 


Ala 


Asn 


Gly 


Gin 


Asn 


Leu 


Gin 


Lys 


Ala 










325 










330 


Leu 


Val 


Ser 


He 


Val 


Phe 


Leu 


Ser 


Ala 


Thr 








340 










345 




Leu 


Pro 


Ala 


Val 


His 


His 


Ala 


Leu 


His 


Trp 






355 










360 






Val 


Thr 


Cys 


Leu 


Leu 


Phe 


Gly 


Lys 


He 


Leu 




370 










375 








Ser 


Ser 


Ser 


Leu 


Gin 


Pro 


Gly 


Ser 


Ala 


Ala 


385 










390 










Thr 


Met 


Gin 


Phe 


Phe 


Phe 


Leu 


Ala 


Ala 


Phe 










405 










410 


Cys 


Phe 


Asn 


He 


Trp 


Trp 


Thr 


Phe 


Arg 


Asp 








420 










425 




Glu 


Arg 


Asn 


Gin 


Glu 


Ala 


Leu 


Arg 


Arg 


Tyr 






435 










440 






Trp 


Gly 


Gly 


Pro 


Leu 


Leu 


He 


Thr 


Phe 


Val 




450 










455 








Leu 


Pro 


Glu 


Thr 


Thr 


Leu 


Leu 


Arg 


Pro 


Gly 


465 










470 










Trp 


Phe 


Asp 


Asn 


Arg 


Asn 


Leu 


Ser 


He 


Phe 










485 










490 


He 


Gly 


Leu 


Leu 


Leu 


Cys 


Ala 


Asn 


He 


Ala 








500 










505 




His 


Gin 


Leu 


Thr 


Cys 


Gly 


Leu 


Trp 


Lys 


Arg 
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45 



Leu 


Asn 


Lys 


Cys 


Cys 


His 




60 










He 


Ala 


Gly 


Ser 


Glu 


Ala 


75 










80 


Gin 


Gin 


Arg 


Phe 


Phe 


Glu 










95 




Phe 


Leu 


Pro 


Asn 


Thr 


Arg 








110 






lie 


Phe 


Arg 


Ser 


Arg 


Gly 






125 








Leu 


Tyr 


Val 


Arg 


Glu 


Arg 




140 










Val 


Asp 


Trp 


Glu 


Val 


Ala 


155 










160 


Asn 


Ala 


Leu 


Glu 


Asp 


Pro 










175 




Glu 


Pro 


Pro 


Lys 


Ala 


Ser 








190 






Gly 


Ser 


Tyr 


Asn 


Thr 


Gin 






205 








Gin 


Ala 


Ala 


Val 


Asp 


Gly 




220 










Gly 


Ser 


Tyr 


Gin 


Thr 


Ser 


235 










240 


Gly 


Tyr 


Ser 


He 


Ala 


Gly 










255 




Thr 


Ala 


Met 


Leu 


Gin 


Leu 








270 






Cys 


Leu 


Glu 


His 


Thr 


Gin 






285 








Gin 


His 


Leu 


Phe 


Ser 


Ser 




300 










He 


Gly Gly 


Thr 


He 


Glu 


315 










320 


Val 


Leu 


Thr 


Gly 


Gly 


He 










335 




Leu 


Val 


Ala 


Gly 


Phe 


Leu 








350 






Arg 


Cys 


Gin 


He 


Cys 


Tyr 






365 








Leu 


Ala 


He 


Glu 


Glu 


Leu 




380 










Cys 


His 


Thr 


Leu 


Ala 


He 


395 










400 


Phe 


Trp 


Leu 


Asn 


Thr 


Met 










415 




Phe 


Arg 


Pro 


Ser 


Ser 


Leu 








430 






Leu 


Tyr 


Ser 


Leu 


Tyr 


Ala 






445 








Ala 


Ala 


Cys 


Val 


Asp 


Gin 




4 60 










Phe 


Gly 


Gin 


Leu 


Tyr 


Cys 


475 










480 


Ala 


Tyr 


Phe 


Tyr 


Gly 


Pro 










4 95 




Leu 


Phe 


Val 


Ser 


Thr 


Thr 








510 






Asp 


Asp 


Val 


Lys 


Ser 


Ser 
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515 

Ser Glu Lys Ser Ala 
530 

Met Gly Val Thr Trp 
545 

Pro His Gly Val Trp 
565 

Val Phe He Phe He 
580 

Cys Arg Arg He Phe 
595 

Thr Asn Gly Val Gin 
610 

Ala Gly Gly Thr Glu 
625 

Thr Asn Thr Thr Ala 
645 

Val Pro Glu Lys Ala 
660 

Glu Thr He Cys 
675 



520 

Leu Gly Arg Val Cys Leu 
535 

He Ala Asp He Leu Ser 
550 555 
Phe Phe Thr Asp Leu lie 
570 

Val Val Gly Cys Gin Pro 
585 

Cys Pro Arg Leu Arg His 
600 

His Ser Ser Ser Ser Gin 
615 

He Thr Gin Asn Thr Thr 
630 635 
Thr His Met Pro Ser Asn 
650 

Pro He Ala Pro Val Ala 
665 



525 

Lys Leu Val Val Val 
540 

Trp Leu Val Gly Gly 
560 

Asn Ala Leu Gin Gly 
575 

Gin Val Trp Thr Ala 
590 

Asp He Thr Asn Thr 
605 

Gly Leu Pro Ser Met 
620 

Thr Thr Thr Thr Thr 
640 

Pro Ala Glu Asp Glu 
655 

Pro He Val Lys Met 
670 



<210> 25 
<211> 3187 
<212> DNA 
<213> Drosophila 



<400> 25 

gaccagacaa gtgcaagcaa tggagctcag 
caaaagcatc ttcatcagat tcagttcctc 
ggacgtgaca cggtggagcg gtggagaaaa 
ctttctccac acgtagtgtg tgtcaataaa 
acttttagag ttcttaaaca aagaatgtgg 
ccatgaatga tgcgtccttt cctgccagca 
acacactgat agacaatgtt tgtgcactga 
tttcgttatt acgctgccgg ggattttatt 
ttgtggcttt ggcttaaagc cgcgccaagc 
agctttatca gcaccaaatg cagggccatt 
ttaaatctgc gaataattca aagacgtgcg 
ccattaagga gttatctggc gaaacatttt 
tatactttgt ttgttttaat ttaaaaaatc 
gattgaattt gatttgaata tatgatttca 
ggccctaatg ccgattcaca tcagcagcag 
gaatctcacg ggaattcggt atgatcacat 
cctgacgtac atcatctacg atcgattctt 
gtgcaagccg tccaccgatg gagtcagctc 
acgttactcc gcctgggtga tggccacgtt 
gggacgcttt atctaccgcg atgagaatgt 
tctggctgat atgctgatgg gcttctattt 
ccgcaacgag tattacaagt aagtcaaatg 
ttttagggtt gtgctggact ggataacctc 
cgtgagctcc tcggaggttt cgatgcttat 
cttgatcgcc gatccctttc gcggccatcg 
tttgatttgc atttggatta cgggcgtggg 
gaccagtacc ctaccctact acggatccta 
taaaatgtag tttaccaagt gacgaaatta 
gaggccttcc cgatgggctg gctttactcg 
ctgctggtga tgattgcgat gctctacacg 
agcgccactc cgctcactct gttggattgc 



cgtcagtatc cactgtctca cacacatcca 60 
agatctcgca tcagctgctg cgattcccat 120 
catcatcata ataataataa agccgaataa 180 
tcggttgttt ctcgcagagt ttgaaaggaa 240 
ttttggaacc actgtgccgc atcctgtctg 300 
cataaagccc cggccagttg catctattcc 360 
tttatactca gccagctatt aaataagaat 420 
atgtgacttt caaaagcccc cgccaacttg 480 
tgaaaattgt tattttaacc ttttaaagtc 540 
atgcgtggcc cacgcggcgt atgagcaacg 600 
tttcttttta cagccgcctt gtcggtaatc 660 
tgcacaacac tcgactggag gcactgtgag 720 
acaaaatata tcatgaagtt gttaaaacgg 780 
tttaatttct caccaatatt taagttcgtt 840 
tttaatggag ccactgaata tatcgttttt 900 
cgacttcgag gccatcaatt caatgcgcaa 960 
ctactgctca atgacgccgc gggtgagaat 1020 
ctttcaagat cttctgagca aaccagtgtt 1080 
gacaatagct ggaaatgtgc tagtcctatg 1140 
ggctgtgaca atggtgattc ggaatctggc 1200 
ggtcaccatt ggagtgcagg actatcgcta 1260 
tgatggattg gcatttgtta cgccccaatc 1320 
gtggcagtgc acactgattg gaacgctggc 1380 
cctggctttt atgtcgctgg agcggtttct 1440 
cagcattgga aatcgggtga tgtggctggc 1500 
cttggctgtg gcaccggtgc tcctgtggcg 1560 
ttccggaacc tgtgagcaca atgtttatga 1620 
aacttaacag gctttccatt gcacatccac 1680 
gctttcgtgt tcctgggcgt caatctgctg 1740 
gcactcctta tttcgatatg gcgaacgagg 1800 
gaattcgctg tgcgattctt tttcatcgtg 1860 
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ctgaccgact 
tacaatatct 
gcggttaatc 
cgcggctgga 
accacaacga 
gccaaagccg 
gcgcagatcg 
aaatctatat 
tgaaataaat 
gttagtgtaa 
agaaatagca 
tgccattacc 
ctgcaccacc 
agtgttactt 
agctctgatg 
tgagagatga 
atccagagtg 
cacagagatc 
ctttggcaat 
tcgaatcgac 
aacgaaagtt 
tcctgaactc 
caatgtc 



tcctgtgctg 
ccgatgacat 
ctctcttgta 
agaagataac 
cgggcacggc 
cgatgagatg 
aacttgtata 
atgtatgtga 
tgaatgtgat 
gattaagcta 
gcgctatcat 
aacttggagc 
atgttatgct 
tcaaataatg 
gcagtgggcg 
agatgaggta 
cttaacctgc 
acaggaatag 
ggcaaaaacc 
gtgaactttc 
gggacgattg 
cttgtggcag 



ggtgcccatc 
ctacgcctgg 
caccttcacc 
ttctagaaag 
aactggatcc 
ccattaggtc 
ccaccagatt 
atatctatcc 
atttgtaagg 
ttatctaagt 
ttttggcatg 
gcacgatgtg 
tacgctccag 
tacactttaa 
cactcaaatg 
aaggacttct 
tggtggacag 
ggaaagtctc 
ttgccgcatg 
agcgcctgct 
gccaaatgaa 
aacttgcagt 



atcgttatga 
ttggtggtct 
acgcccaagt 
cgagcggagg 
tctcagcatc 
ctaacgctgt 
tagcagcatt 
ctgtccgtgt 
caattttacg 
tttggtaata 
gtgtgattct 
aaactttcgc 
gtgcatgcgt 
acaaaataac 
gcatggccgt 
ggcagccggg 
tcaaatggga 
tgttatgata 
tgtcgcagtg 
ttgtgaggta 
tcatctggtg 
tggattttcc 



agatctgggt 
ttgtgctgcc 
accgaaatca 
ctggcaatgg 
cggatgactt 
cgaactgaag 
tattttgtct 
aagataacca 
gaatttaaaa 
gttttaaata 
ttctggtccg 
atttcctcca 
agtctaaaaa 
taacttgtta 
atgcgtgcac 
aaactcgcaa 
tcgctgtgcg. 
gcgtttatga 
gactccagac 
aacccggtcg 
cttggccatg 
cgtatgccga 



ttttttcaac 
acttaactcg 
aatcttcctt 
aaatgtggcc 
cacgatcttt 
gattcttgtg 
atgatatcga 
aatgaaagtg 
tgggatgcga 
ctttttattt 
agtaaatctt 
ggtcctccat 
attattaagg 
cgagctggat 
tcgtgattcc 
atgaagcgag 
aaacgacgat 
atctccaggg 
tgcctgtgct 
cactggctgc 
cgggaatggt 
ttgcgtggca 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3187 



<210> 26 
<211> 1080 
<212> DNA 
<213> Drosophila 



<400> 26 

atgacgccgc 

cttctgagca 

ggaaatgtgc 

atggtgattc 

ggagtgcagg 

tcgtggcagt 

atcctggctt 

cgcagcattg 

ggcttggctg 

tattccggaa 

tcggctttcg 

acggcactcc 

tgcgaattcg 

atcatcgtta 

tggttggtgg 

accacgccca 

aagcgagcgg 

tcctctcagc 



gggtgagaat 
aaccagtgtt 
tagtcctatg 
ggaatctggc 
actatcgcta 
gcacactgat 
ttatgtcgct 
gaaatcgggt 
tggcaccggt 
cctgctttcc 
tgttcctggg 
ttatttcgat 
ctgtgcgatt 
tgaagatctg 
tctttgtgct 
agtaccgaaa 
aggctggcaa 
atccggatga 



gtgcaagccg 
acgttactcc 
gggacgcttt 
tctggctgat 
ccgcaacgag 
tggaacgctg 
ggagcggttt 
gatgtggctg 
gctcctgtgg 
attgcacatc 
cgtcaatctg 
atggcgaacg 
ctttttcatc 
ggtttttttc 
gccacttaac 
tcaaatcttc 
tggaaatgtg 
cttcacgatc 



tccaccgatg 
gcctgggtga 
atctaccgcg 
atgctgatgg 
tattacaagg 
gccgtgagct 
ctcttgatcg 
gctttgattt 
cggaccagta 
cacgaggcct 
ctgctgctgg 
aggagcgcca 
gtgctgaccg 
aactacaata 
tcggcggtta 
cttcgcggct 
gccaccacaa 
tttgccaaag 



gagtcagctc 
tggccacgtt 
atgagaatgt 
gcttctattt 
ttgtgctgga 
cctcggaggt 
ccgatccctt 
gcatttggat 
ccctacccta 
tcccgatggg 
tgatgattgc 
ctccgctcac 
acttcctgtg 
tctccgatga 
atcctctctt 
ggaagaagat 
cgacgggcac 
ccgcgatgag 



ctttcaagat 
gacaatagct 
ggctgtgaca 
ggtcaccatt 
ctggataacc 
ttcgatgctt 
tcgcggccat 
tacgggcgtg 
ctacggatcc 
ctggctttac 
gatgctctac 
tctgttggat 
ctgggtgccc 
catctacgcc 
gtacaccttc 
aacttctaga 
ggcaactgga 
atgccattag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



<210> 27 
<211> 359 
<212> PRT 
<213> Drosophila 

<400> 27 

Met Thr Pro Arg Val Arg Met Cys Lys Pro Ser Thr Asp Gly Val Ser 

15 10 15 

Ser Phe Gin Asp Leu Leu Ser Lys Pro Val Leu Arg Tyr Ser Ala Trp 
20 25 30 
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Val 


Met 


Ala 


Thr 


Leu 


Thr 


He 


Ala 


Gly 


Asn 


Vai 


Leu 


va i 


Leu 


Trp 


biy 






35 










A C\ 










A 

*i D 








Arg 


Phe 


He 


Tyr 


Arg 


Asp 


Glu 


Asn 


Val 


Ala 


vai 


Thr 


Met 


Val 


He 


Arg 




50 










55 










ti ft 
bU 










Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Met 


Leu 


Met 


Gly 


Phe 


Tyr 


Leu 


Val 


Thr 


He 


65 










70 










/ O 










D A 

80 


Gly 


Val 


Gin 


Asp 


Tyr 


Arg 


Tyr 


Arg 


Asn 


Glu 


Tyr 


Tyr 


Lys 


Val 


Val 


Leu 










85 










90 










95 




Asp 


Trp 


He 


Thr 


Ser 


Trp 


Gin 


Cys 


Thr 


Leu 


He 


Gly 


Thr 


Leu 


Ala 


Val 








100 










105 










110 






Ser 


Ser 


Ser 


Glu 


Val 


Ser 


Met 


Leu 


He 


Leu 


Ala 


Phe 


Met 


Ser 


Leu 


Glu 






115 










120 










IOC 
l^D 








Arg 


Phe 


Leu 


Leu 


He 


Ala 


Asp 


Pro 


Phe 


Arg 


Gly 


His 


Arg 


Ser 


He 


Gly 




130 










135 










140 










Asn 


Arg 


Val 


Met 


Trp 


Leu 


Ala 


Leu 


lie 


Cys 


He 


Trp 


He 


Thr 


Gly Val 


145 










150 










155 










160 


Gly 


Leu 


Ala 


Val 


Ala 


Pro 


Val 


Leu 


Leu 


Trp 


Arg 


Thr 


Ser 


Thr 


Leu 


Pro 










165 










170 










175 




Tyr 


Tyr 


Gly 


Ser 


Tyr 


Ser 


Gly 


Thr 


Cys 


Phe 


Pro 


Leu 


His 


He 


His 


Glu 








180 










185 










190 






Ala 


Phe 


Pro 


Met 


Gly 


Trp 


Leu 


Tyr 


Ser 


Ala 


Phe 


Val 


Phe 


Leu 


Gly 


Val 






195 










200 










205 








Asn 


Leu 


Leu 


Leu 


Leu 


Val 


Met 


He 


Ala 


Met 


Leu 


Tyr 


Thr 


Ala 


Leu 


Leu 




210 










215 










220 










He 


Ser 


He 


Trp 


Arg 


Thr 


Arg 


Ser 


Ala 


Thr 


Pro 


Leu 


Thr 


Leu 


Leu 


Asp 


225 










230 










235 










240 


Cys 


Glu 


Phe 


Ala 


Val 


Arg 


Phe 


Phe 


Phe 


He 


Val 


Leu 


Thr 


Asp 


Phe 


Leu 










245 










250 










255 




Cys 


Trp 


Val 


Pro 


He 


He 


Val 


Met 


Lys 


He 


Trp 


Val 


Phe 


Phe 


Asn 


Tyr 








260 










265 










270 






Asn 


He 


Ser 


Asp 


Asp 


He 


Tyr 


Ala 


Trp 


Leu 


Val 


Val 


Phe 


Val 


Leu 


Pro 






275 










280 










285 








Leu 


Asn 


Ser 


Ala 


Val 


Asn 


Pro 


Leu 


Leu 


Tyr 


Thr 


Phe 


Thr 


Thr 


Pro 


Lys 




290 










295 










o a a 
300 










Tyr 


Arg 


Asn 


Gin 


He 


Phe 


Leu 


Arg 


Gly 


Trp 


Lys 


Lys 


He 


Thr 


Ser 


Arg 


305 










310 










315 










320 


Lys 


Arg 


Ala 


Glu 


Ala 


Gly 


Asn 


Gly 


Asn 


Val 


Ala 


Thr 


Thr 


Thr 


Thr 


Gly 










325 










330 










335 




Thr 


Ala 


Thr 


Gly 


Ser 


Ser 


Gin 


His 


Pro 


Asp 


Asp 


Phe 


Thr 


He 


Phe 


Ala 








340 










345 










350 






Lys 


Ala 


Ala 


Met 


Arg 


Cys 


His 





















355 



<210> 28 
<211> 3759 
<212> DNA 
<213> Drosophila 



<400> 28 

cactcagatt 

cgatgccaac 

ggaatcgtcg 

taatttccat 

cggtggtgta 

tttaaagatc 

atcgatgttt 

aaacgttaca 

tatctcacaa 

atattttatt 



tgccgacagc 
atcgatgtca 
atgttttcag 
attttctaat 
ggctggtgtc 
aatttatttg 
ctccacttct 
aaaataccaa 
atagttttta 
ttatgctaca 



agaaatattt 
tcgatgtttt 
cgcatagaaa 
acaattatta 
ttggaagaca 
ttttttgcca 
aatgccaacg 
ctatcgcata 
aggaacctac 
gaaaactgat 



tagggttgta 
cgaaaaatca 
cagcaaagtg 
atagcatatt 
tggataaata 
aagacatcga 
aaaacgtgac 
gcggaatcac 
agtcagtacc 
aagctgcatt 



tcttagaggt 
aaaacgtcga 
tgtaaattgt 
gcatatttat 
tctgaaatgt 
aaacatggat 
tgggaacgta 
cttggcgggc 
aaatacttaa 
acttttgaaa 



ggagaaacat 
tgtttttcga 
aagtctaaat 
tgtgactcgc 
aagttttttt 
gttcgccaac 
tcgatgggaa 
tattcgaatg 
aattccattt 
agttaaaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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tgtgtttttt aaatgattta tttatataaa taagcttatg taatactatt gttaatatgt 660 
aaaaaaaaaa gaataaaaga aaaagaatag tttacctaag cagttgcttg aataatacat 720 
acataattta agttttatac ataatttatc aaccctcgct cagtaggaaa aaaagatggc 780 
tcagaaatga gcaacgaatt ggaatgtccg ataagccttt tttatcaaaa taataatttt 840 
tacattccct tttcataacg ctttcccagt tagtttgtgc caatcgtaaa acagcatccg 900 
caaagtgaag tcgctgctat tttgtagtac cactacattc ttatttattt aagtaatatt 960 
taaagaatgt ccagctaact atatgattgc tagttcaaaa atgttacttt ctgcatcgat 1020 
cctaatatat ttcttgctga atctgcaatc ttcgagtgct gagattgccg actgcagttt 1080 
ctatgatacc gttgatatct cggaaggcca gaggctctcg aatggatcat acctctacga 1140 
gggattactt attcccgccc atttgacggc caagtatgaa ttcaagcttc tggcgaatgg 1200 
agataaggag caggtgccaa gtcacgtgag aggatgtgtg tgcaagctaa ggacttgcgt 1260 
caggttttgc tgcccacatg accacataat ggatatgggc gaatgctacg ccaacatgac 1320 
gacggaggag aacgaattac tggaccccat gctaaatgtg accctggacg acggatcagt 1380 
ggtgcaaagg cactataaga aggagctcat ggtgcagtgg gatctgccaa agccctgcga 1440 
tgatatgttt tacctagaca accgggatat aatggatgag tacaccttgt tcgaggtatg 1500 
gctgctcgaa aaataagcat ttttgggcag gatttattta tgctttggct ctctgcagaa 1560 
tggaagattg ctgcgccact acgatcaagt gtatttggac aagtcggagt actgtctgca 1620 
gcaccgaaca ttcggcgaag gtaacaacaa ctccattcgc attatacccc acaactgctt 1680 
gatactgcca tccagaacgg gacagaccgt tggtaagatg atgcccattg cttgcatggt 1740 
agctcccagt tctatatctt tcctatttcc atagtgatga tcacatcgtt gatatgcttg 1800 
gttcttacca tcgccgtgta cctgtgcgtt aagaagctaa tgaatttgga gggcaagtgt 1860 
ttcatctgtt acatgatgtg cctatttttc ggatacctgt tcctactcct tgatctgtgg 1920 
gagttatcgc tcgatttctg taaagcagca ggttagctta gaggggatgc atttcaacac 1980 
aacttgacaa accattgacc atttacgaca ggtttcttgg gatacttttt cgtcatggct 2040 
gcgttctttt ggctatctat catcagtcgg cactattgga aatgtcttac caatccctgt 2100 
gcctcgatga acattcggtc agaacgtgcg tttctgctat acagctgctt cgcctgggca 2160 
atgcctttgg ccttgaccgg agtcacctat ctggctgata acgttgtgaa caacgaggag 2220 
tggcagcctc gcgtgggcga cgaagggcat tgctggatct atagtgggtt ctgttttgta 2280 
tccggcttgt ggtaccagct caccaatgtt tttctttcca ccctcagcta aaagctggtc 2340 
tgcaatggtg tacttctatg gaccgatggt gctcctgatt ctcttcaaca taactatgtt 2400 
tgttctgacg gctaaacaca taatagattc caagaggact cttcgaaaga ttgctcgcaa 24 60 
tgaaggtaga atacagaagc tgaattcgga caaacagaag tatatcaatt ataatagttg 2520 
aaaatgtcag tttaagataa atattatcaa ttcgttacag ttacacacaa tttctgctgc 2580 
tcttcactgt gatggggatg tcgtggagct ttgagatatt ctcctacttg gtgcaacgcg 2640 
agaaactgtg ggtcaatatc tttctggtag ctgactactt caactggtcc caaggcgtca 2700 
ttatatttgt gctattcatt ctgcggcgca aaactcttgt actattcaag aaacagtaag 2760 
tcttatagtt catagttcgg gtaaccagtt ccctaaaatg tctcttttca gaatatttcc 2820 
caaacagaga gcctttagta gatctgcaac tcaaagtacg attgaatcca tttctcaaac 2880 
aaaaaggcat tttaacatga catagtattt tcttctcttc gaattaatca atttaatagt 2940 
aaagtataac tttttttctc acaaaatgta tgaatatatt tcacttcatt ctacattcta 3000 
tatagaattc atttaatcac aaaatgtatg aatatttttc tttatatgac gtgcatcgac 3060 
tatagaaaat cgactatagt gtgccacaat atattttact catcatttcc taatcaaact 3120 
attgcaaact taactttgag ccgaggaaaa accctgtatg tctatcaaaa ctatgagtca 3180 
aaagtatgac gcgatgctga aagtagttgc ctttaaatat tgaagtgata aatgtagaaa 3240 
gaagacattt gtatggaaat aaactaagtt tgtaaatcta cattcgcttt atttcatgta 3300 
caaattttaa tttaaaaact ttgaaggtaa atagtgcttt atcagatgcg gattacacgg 3360 
tagcagttat catcgatgtc gaatcttctt gtttatgtga ccgcttttct tcgctgtcgc 3420 
actggagttc gtggatagtt tgctctttgt attcgaagtg ggtgtgccga ggagatcctt 3480 
gttccctcca ctgttcgtag atccagacac cattccgctc actacgctgc cctcattgcg 3540 
atgcgagggt gaatctgtaa ataaaaaggt gtaattaaac atatatactt ttctggagaa 3600 
aagcgctaaa gttgaatctc aatgtgctgt gctgccttac ccttttcgtc tttcggtgtg 3660 
ctcttgattg tcgtggttcc gtcgtgggcc gttgtggtgt tctgattgat tgagatctga 3720 
ttgtcgctgg agtcttcatt tgtgatggaa tccgattcg 3759 

<210> 29 
<211> 1449 
<212> DNA 
<213> Drosophila 

<400> 29 
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atgttacttt 
gagattgccg 
aatggatcat 
ttcaagcttc 
tgcaagctaa 
gaatgctacg 
accctggacg 
gatctgccaa 
tacaccttgt 
tcggagtact 
ataccccaca 
tcgttgatat 
ttggagggca 
ctccttgatc 
tttttcgtca 
cttaccaatc 
tgcttcgcct 
gtgaacaacg 
aaaagctggt 
ataactatgt 
attgctcgca 
tttctgctgc 
gtgcaacgcg 
caaggcgtca 
aaacagtaa 



ctgcatcgat 
actgcagttt 
acctctacga 
tggcgaatgg 
ggacttgcgt 
ccaacatgac 
acggatcagt 
agccctgcga 
tcgagaatgg 
gtctgcagca 
actgcttgat 
gcttggttct 
agtgtttcat 
tgtgggagtt 
tggctgcgtt 
cctgtgcctc 
gggcaatgcc 
aggagtggca 
ctgcaatggt 
ttgttctgac 
atgaaggtag 
tcttcactgt 
agaaactgtg 
ttatatttgt 



cctaatatat 
ctatgatacc 
gggattactt 
agataaggag 
caggttttgc 
gacggaggag 
ggtgcaaagg 
tgatatgttt 
aagattgctg 
ccgaacattc 
actgccatcc 
taccatcgcc 
ctgttacatg 
atcgctcgat 
cttttggcta 
gatgaacatt 
tttggccttg 
gcctcgcgtg 
gtacttctat 
ggctaaacac 
aatacagaag 
gatggggatg 
ggtcaatatc 
gctattcatt 



ttcttgctga 
gttgatatct 
attcccgccc 
caggtgccaa 
tgcccacatg 
aacgaattac 
cactataaga 
tacctagaca 
cgccactacg 
ggcgaaggta 
agaacgggac 
gtgtacctgt 
atgtgcctat 
ttctgtaaag 
tctatcatca 
cggtcagaac 
accggagtca 
ggcgacgaag 
ggaccgatgg 
ataatagatt 
ctgaattcgg 
tcgtggagct 
tttctggtag 
ctgcggcgca 



atctgcaatc 
cggaaggcca 
atttgacggc 
gtcacgtgag 
accacataat 
tggaccccat 
aggagctcat 
accgggatat 
atcaagtgta 
acaacaactc 
agaccgttgt 
gcgttaagaa 
ttttcggata 
cagcaggttt 
gtcggcacta 
gtgcgtttct 
cctatctggc 
ggcattgctg 
tgctcctgat 
ccaagaggac 
acaaacagaa 
ttgagatatt 
ctgactactt 
aaactcttgt 



ttcgagtgct 
gaggctctcg 
caagtatgaa 
aggatgtgtg 
ggatatgggc 
gctaaatgtg 
ggtgcagtgg 
aatggatgag 
tttggacaag 
cattcgcatt 
gatgatcaca 
gctaatgaat 
cctgttccta 
cttgggatac 
ttggaaatgt 
gctatacagc 
tgataacgtt 
gatctatact 
tctcttcaac 
tcttcgaaag 
ttacacacaa 
ctcctacttg 
caactggtcc 
actattcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1449 



<210> 30 
<211> 482 
<212> PRT 
<213> Drosophila 



<400> 30 



Met 


Leu 


Leu 


Ser 


Ala 


Ser 


He 


Leu 


He 


Tyr 


Phe 


Leu 


Leu 


Asn 


Leu 


Gin 


1 








5 










10 










15 




Ser 


Ser 


Ser 


Ala 


Glu 


He 


Ala 


Asp 


Cys 


Ser 


Phe 


Tyr 


Asp 


Thr 


Val 


Asp 








20 










25 










30 






lie 


Ser 


Glu 


Gly 


Gin 


Arg 


Leu 


Ser 


Asn 


Gly 


Ser 


Tyr 


Leu 


Tyr 


Glu 


Gly 






35 










40 










45 








Leu 


Leu 


lie 


Pro 


Ala 


His 


Leu 


Thr 


Ala 


Lys 


Tyr 


Glu 


Phe 


Lys 


Leu 


Leu 




50 










55 










60 










Ala 


Asn 


Gly 


Asp 


Lys 


Glu 


Gin 


Val 


Pro 


Ser 


His 


Val 


Arg 


Gly 


Cys 


Val 


65 










70 










75 










80 


Cys 


Lys 


Leu 


Arg 


Thr 


Cys 


Val 


Arg 


Phe 


Cys 


Cys 


Pro 


His 


Asp 


His 


He 










85 










90 










95 




Met 


Asp 


Met 


Gly 


Glu 


Cys 


Tyr 


Ala 


Asn 


Met 


Thr 


Thr 


Glu 


Glu 


Asn 


Glu 








100 










105 










110 






Leu 


Leu 


Asp 


Pro 


Met 


Leu 


Asn 


Val 


Thr 


Leu 


Asp 


Asp 


Gly 


Ser 


Val 


Val 






115 










120 










125 








Gin 


Arg 


His 


Tyr 


Lys 


Lys 


Glu 


Leu 


Met 


Val 


Gin 


Trp 


Asp 


Leu 


Pro 


Lys 




130 










135 










140 










Pro 


Cys 


Asp 


Asp 


Met 


Phe 


Tyr 


Leu 


Asp 


Asn 


Arg 


Asp 


He 


Met 


Asp 


Glu 


145 










150 










155 










160 


Tyr 


Thr 


Leu 


Phe 


Glu 


Asn 


Gly 


Arg 


Leu 


Leu 


Arg 


His 


Tyr 


Asp 


Gin 


Val 










165 










170 










175 




Tyr 


Leu 


Asp 


Lys 


Ser 


Glu 


Tyr 


Cys 


Leu 


Gin 


His 


Arg 


Thr 


Phe 


Gly 


Glu 








180 










185 










190 






Gly 


Asn 


Asn 


Asn 


Ser 


He 


Arg 


He 


He 


Pro 


His 


Asn 


Cys 


Leu 


He 


Leu 






195 










200 










205 








Pro 


Ser 


Arg 


Thr 


Gly 


Gin 


Thr 


Val 


Val 


Met 


He 


Thr 


Ser 


Leu 


He 


Cys 




210 










215 










220 
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Leu 


Val 


Leu 


Thr 


He 


Ala 


Val 


Tyr 


Leu 


Cys 


Val 


Lys 


Lys 


Leu 


Met 


Asn 


225 










230 










235 










240 


Leu 


Glu 


Gly 


Lys 


Cys 


Phe 


He 


Cys 


Tyr 


Met 


Met 


Cys 


Leu 


Phe 


Phe 


Gly 










245 










250 










255 




Tyr 


Leu 


Phe 


Leu 


Leu 


Leu 


Asp 


Leu 


Trp 


Glu 


Leu 


Ser 


Leu 


Asp 


Phe 


Cys 






260 










265 










270 






Lys 


Ala 


Ala 


Gly 


Phe 


Leu 


Gly 


Tyr 


Phe 


Phe 


Val 


Met 


Ala 


Ala 


Phe 


Phe 




275 










280 










285 








Trp 


Leu 


Ser 


He 


He 


Ser 


Arg 


His 


Tyr 


Trp 


Lys 


Cys 


Leu 


Thr 


Asn 


Pro 




290 










295 










300 










Cys 


Ala 


Ser 


Met 


Asn 


He 


Arg 


Ser 


Glu 


Arg 


Ala 


Phe 


Leu 


Leu 


Tyr 


Ser 


305 










310 










315 










320 


Cys 


Phe 


Ala 


Trp 


Ala 


Met 


Pro 


Leu 


Ala 


Leu 


Thr 


Gly 


Val 


Thr 


Tyr 


Leu 








325 










330 










335 




Ala 


Asp 


Asn 


Val 


Val 


Asn 


Asn 


Glu 


Glu 


Trp 


Gin 


Pro 


Arg 


Val 


Gly 


Asp 








340 










345 










350 






Glu 


Gly 


His 


Cys 


Trp 


He 


Tyr 


Thr 


Lys 


Ser 


Trp 


Ser 


Ala 


Met 


Val 


Tyr 






355 










360 










365 








Phe 


Tyr 


Gly 


Pro 


Met 


Val 


Leu 


Leu 


He 


Leu 


Phe 


Asn 


He 


Thr 


Met 


Phe 




370 










375 










380 










Val 


Leu 


Thr 


Ala 


Lys 


His 


He 


He 


Asp 


Ser 


Lys 


Arg 


Thr 


Leu 


Arg 


Lys 


385 










390 










395 










400 


He 


Ala 


Arg 


Asn 


Glu 


Gly 


Arg 


He 


Gin 


Lys 


Leu 


Asn 


Ser 


Asp 


Lys 


Gin 










405 










410 










415 




Asn 


Tyr 


Thr 


Gin 


Phe 


Leu 


Leu 


Leu 


Phe 


Thr 


Val 


Met 


Gly 


Met 


Ser 


Trp 








420 










425 










430 






Ser 


Phe 


Glu 


He 


Phe 


Ser 


Tyr 


Leu 


Val 


Gin 


Arg 


Glu 


Lys 


Leu 


Trp 


Val 






435 










440 










445 








Asn 


He 


Phe 


Leu 


Val 


Ala 


Asp 


Tyr 


Phe 


Asn 


Trp 


Ser 


Gin 


Gly 


Val 


He 




450 










455 










460 










He 


Phe 


Val 


Leu 


Phe 


He 


Leu 


Arg 


Arg 


Lys 


Thr 


Leu 


Val 


Leu 


Phe 


Lys 


465 










470 










475 










480 



Lys Gin 



<210> 31 
<211> 4469 
<212> DNA 
<213> Drosophila 



<400> 31 

acgttgtcag 

caatggaaat 

catgacaggc 

gctatatgtc 

tctataatcc 

tgctgatgag 

cagacgggcc 

gccagttgcc 

gtcagtgagc 

cgtcggctta 

aattcaatag 

ataatgttta 

ttattctgtt 

ttgagctgtt 

cgtgtaacct 

gcattccctt 

ttgaatgtta 

gcccctctgc 



tgtgtgtgac 
tcggtacaat 
gtggcttgcc 
tatatggcat 
ccagctaaac 
gcaacagagt 
agtgagtcct 
cagacagtta 
gagtgagtca 
agtcaaaggc 
cctgtggcaa 
ctgaaataat 
tttaattgca 
catttagcca 
tttttttttt 
aaacatttgc 
ttgcttgaat 
cgctccactg 



ggttgttaat 
ttgccgtcca 
accaattaag 
atagatgccc 
ttcgagacat 
ggacgcggcc 
ggatgtcctg 
gtccgccagg 
gtcagtcaat 
agtcggccaa 
tcgcacttgc 
gttatgccac 
ctttaaaatt 
tttgattttg 
tttgggctct 
tggggaaaca 
ggccccaatt 
acccctgacc 



ttcatttgac 
ttaaataagc 
tggaatacta 
acagcccctt 
cgcatccggc 
ggaaacaagg 
gtgaacgcag 
cagtcagtcg 
cgaacaatca 
gaagggatct 
aactttaata 
aattggcaca 
gatgattgta 
tctacgttgg 
cccaacaatt 
cattgattta 
tacttaaaag 
ccattataca 



ttgcgagggt 
acgcaattaa 
tatggctata 
ggggtcgatc 
accgcaatcc 
cactcctccc 
ctcagctccc 
gtctgtcagt 
atcatacgga 
tgtctaactt 
attccaagaa 
tacaagctca 
ttagccatca 
ttttttatca 
tgagctcctt 
tatgccggca 
ctacgctcct 
tgttgtgggc 



tttaatattg 
ttgtggcagc 
tcgctatatc 
gcctacgccg 
gaaatcggaa 
atcaatctgt 
agcgggtgag 
gagtcagtta 
atgacagcta 
acagtcgcaa 
cgaagcagtc 
taatagtgtt 
taggcatggc 
agtttcttgc 
tttcgttttc 
tcgccaaaaa 
ccgccccact 
tccactggct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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cctgattggt atttatgagc gtgtttattt 
acggaaatgg ttggcaattt tgcatttgat 
tcgcaaatga actcatctgc ccaatattcc 
acatataagt tagttaagta tgtatattag 
caaatggcat cacgaccacg gcgaccagga 
aattcgtgac cgtctgcagg gagcgctcgc 
cgaagagggt caggatgggg aacagaatca 
aaacgaagcg agttgtagtt acattatatc 
tgctgccacc caaattccct ggccaaagtt 
aacgggcacg taagtcaatt acacaggcag 
taattttaat ggcaacacaa gtaaggacta 
tttaatataa tctatctaat atagagtgat 
ttaaagttat ttcacatatt aaaaatggca 
aacttgaaat acctttatct tcattatgaa 
tttcttgaga ttgcacgaat aaatgagtta 
caaaactgat ttctaaatga acatctttat 
aatactccca tcttagaaat ctaaggaatt 
cgatgaaaat atattatact aaagaagttg 
tgatgataat gtttaaaggt atttaatgag 
tttaaagcct ttgaatgata ttcaataata 
tataat.gcgg ttttaagatg ttaagttgta 
cctcgtactc acccactctc agctcgccgc 
tgcagttcag ccggtagccg tcgaggtaca 
cgctgcccct gctccccgtg atggccactc 
tgctcccatt gtagttggga aaataatcgt 
tgccccctga tcctccagcg ccgcctacac 
ccacgcccac atccccagag ccgcctccat 
tgcccagcgt ctcgcgcgcc accgttgctg 
cgcccagcat gccatagtcc gtttccgcat 
cgctgtcgat cagcgcgatg ccagcatcgc 
ccgccagctg ttgttccaca aagctgcgca 
cactggagtt gctctgggct ggctgcgtgc 
cgttggcgcc cgtcaccgcc aagtttgtgc 
tgcccaccga acccaagagc cattcgctga 
ccgccgccgt ttccacatcc tcgtagagcg 
caacgcccgc ctgcgcatcg ctgccgttcg 
ggtggtgatt tatccacgtg gctcccgcgg 
cggctgtgct caaggtcgcc gatgtggttg 
ccgttgccgc cgcagttgtt gccgccgtgc 
cggttccgct gctgctgatg ccccgccgcc 
ggggcgtggc agcggaggcg agctgcaact 
ctgcagcgaa agagacgaag ttcggaggtg 
taatcaaagt tcttgaagat agttcttaac 
cctaattggt cgcgttaatg ttgcttaatt 
tcatggttct tttcgactta atatttttag 
tttcctactt acacattaca cattgcatgc 
acttcatgct tgaccacgcc cccccaacca 
cccatgccat tttggacgta cggacccttt 
actctttaat caatttattg tcaattattc 
ttaattaaat tagcaaaagg tttttgtgtc 
cccacttcca ttctcgcaac caaaatgcca 
acacattttg caattttcgc ttttgtattt 
tcgatgggcc ataattaaaa gatggcctaa 
gctgggggtc aaaggtaatc ggatcggagt 
cttccggaga tcgggttatt cgaactctga 
ttccggatcg gaggcagaga tccgagagct 
ttgggcacac ttccaggcga cgagaggca 

<210> 32 
<211> 1389 



attatatgac ccaacacttg ataaaattaa 1140 
tctgacaccc aaacacacag gcaggcaaat 1200 
gaccataaag catcccgacc aacggcactc 1260 
tggcttggca acttaccaga aagtagacag 1320 
gatcggcgat ggccaacgag actataaaat 1380 
ggcacacgga caggatgacc aggatgttgc 1440 
ggatgagcgc ccagtagttg tggtccactg 1500 
cgggatgcat ccatccgcag gctctcctgc 1560 
caccttgcag gaagatttca gggaaagctc 1620 
tccgtttaat caaggactct ggctgacaac 1680 
gcacacaaag acgtgaatct taaaggctgt 1740 
gcatgttacc aacagctgcg ttgttaaaaa 1800 
aaatatggct atgtaatatg aaagttatta 1860 
acaaattaaa caaacaaata tgtaggaact 1920 
tgtaacgcgt ttaaattaaa ttcttgctat 1980 
aaataaatta attatcggcg gtttcattca 2040 
ctgttctcaa aacctagtca cctcactatc 2100 
ctgttgcaag gtttaaatga tgctccaact 2160 
attcaaggct atctaatcta atttaataat 2220 
tctaatgagc tttaatgatt tttaaagatt 2280 
gtagtttgat gatttcgaat ggtaatcctg 2340 
aggagtcggt taggttgagc gtctcgttgg 2400 
gtcccgtcag gttctgctcg aggagcagtc 24 60 
ccggcgcaat ggtgcttgtt cccgaaaccg 2520 
taaagttttc gagcagcagc atgaaggtgc 2580 
ctccgcccgc cgtaccggcc agtccgccag 2640 
cgtaactggt cgtcgagggc gccgtt.cgat 2700 
tccgctggag cagttccgca tcaccgaagc 2760 
cagccacatt gtccagggct gcctcctccc 2820 
cgccgtcgcc cgctcctgct gcccctccgc 2880 
gcgagtaacg cccccgctca ccatcctccg 2940 
tgttcagtgc ctccagccag gccagggttc 3000 
tgttctcggg cccgcccact gccgcctcac 3060 
tgtcgtcgat cagcgcgtag ccggcgtcct 3120 
gcaagtagcc ggctggcatt gtggggccct 3180 
ccggttgaga gctgtccgct tccaccgcca 3240 
atggggctgc cgtggtcgag gtggaagcag 3300 
ctgccgttgc tgcagttgcc gcagttgctc 3360 
tgctcagtgg ctgggctgcc atggtgccgc 34 20 
aatcgaatgg gctcaacatc tagtggggcg 3480 
cctgcaacgc cagcggcatg tttccgtttc 3540 
agaaaacata aatcacaaaa atttccatta 3600 
gatacgcaca actaagatta caattttaag 3660 
tgataaattc ttgtaatccg caaatgagag 3720 
gatatctaaa aaacaaagtc gttctcaatt 3780 
cccttcgttt gcattttcaa cacaaggcgt 3840 
acacaacaac ataccatcca gcaccacgcc 3900 
gcccaactgc catggccgca tgtttaatta 3960 
tacaaaaagc gacagcgaaa attgtgtaat 4020 
tctctcgccc caccaaccat tccgcccact 4080 
cgcccccatg ctcgacgctg tcttgctttt 4140 
gccaattggc gctgattaaa gccgcccgag 4200 
tgagaggcga ggcgagtacg tctcgagttg 4260 
ggtcttaagt cccctgcctt ctgcctctgt 4320 
gcttgtggat tctggactct gaattctgga 4380 
gcacacccag acggctggca ctaatccaat 4 4 40 

4469 



48 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCT/US01/09341 



<212> DNA 

<213> Drosophila 

<400> 32 

atgttgagcc cattcgattg gcggcggggc atcagcagca gcggaaccgg cggcaccatg 60 

gcagcccagc cactgagcag cacggcggca acaactgcgg cggcaacggg agcaactgcg 120 

gcaactgcag caacggcagc aaccacatcg gcgaccttga gcacagccgc tgcttccacc 180 

tcgaccacgg cagccccatc cgcgggagcc acgtggataa atcaccacct ggcggtggaa 240 

gcggacagct ctcaaccggc gaacggcagc gatgcgcagg cgggcgttga gggccccaca 300 

atgccagccg gctacttgcc gctctacgag gatgtggaaa cggcggcgga ggacgccggc 360 

tacgcgctga tcgacgacat cagcgaatgg ctcttgggtt cggtgggcag tgaggcggca 420 

gtgggcgggc ccgagaacag cacaaacttg gcggtgacgg gcgccaacgg aaccctggcc 480 

tggctggagg cactgaacag cacgcagcca gcccagagca actccagtgc ggaggatggt 540 

gagcgggggc gttactcgct gcgcagcttt gtggaacaac agctggcggg cggaggggca 600 

gcaggagcgg gcgacggcgg cgatgctggc atcgcgctga tcgacagcgg ggaggaggca 660 

gccctggaca atgtggctga tgcggaaacg gactatggca tgctgggcgg cttcggtgat 720 

gcggaactgc tccagcggac agcaacggtg gcgcgcgaga cgctgggcaa tcgaacggcg 780 

ccctcgacga ccagttacga tggaggcggc tctggggatg tgggcgtggc tggcggactg 840 

gccggtacgg cgggcggagg tgtaggcggc gctggaggat cagggggcag caccttcatg 900 

ctgctgctcg aaaactttaa cgattatttt cccaactaca atgggagcac ggtttcggga 960 

acaagcacca ttgcgccggg agtggccatc acggggagca ggggcagcgg actgctcctc 1020 

gagcagaacc tgacgggact gtacctcgac ggctaccggc tgaactgcac caacgagacg 1080 

ctcaacctaa ccgactcctg cggcgagctg agagtggtgg accacaacta ctgggcgctc 1140 

atcctgattc tgttccccat cctgaccctc ttcggcaaca tcctggtcat cctgtccgtg 1200 

tgccgcgagc gctccctgca gacggtcacg aattatttta tagtctcgtt ggccatcgcc 1260 

gatctcctgg tcgccgtggt cgtgatgcca tttgctgtct actttctgga gccagtggag 1320 

cccacaacat gtataatggg gtcaggggtc agtggagcgg cagaggggca gtggggcgga 1380 

ggagcgtag 1389 



<210> 33 
<211> 462 
<212> PRT 
<213> Drosophila 



<400> 33 
Met Leu Ser Pro 
1 

Gly Gly Thr Met 
20 

Ala Ala Ala Thr 
35 

Thr Ser Ala Thr 
50 

Ala Pro Ser Ala 
65 

Ala Asp Ser Ser 

Glu Gly Pro Thr 
100 

Glu Thr Ala Ala 
115 

Glu Trp Leu Leu 
130 

Glu Asn Ser Thr 
145 

Trp Leu Glu Ala 

Ala Glu Asp Gly 
180 

Gin Gin Leu Ala 



Phe Asp Trp Arg 
5 

Ala Ala Gin Pro 

Gly Ala Thr Ala 
40 

Leu Ser Thr Ala 
55 

Gly Ala Thr Trp 
70 

Gin Pro Ala Asn 
85 

Met Pro Ala Gly 

Glu Asp Ala Gly 
120 

Gly Ser Val Gly 
135 

Asn Leu Ala Val 
150 

Leu Asn Ser Thr 
165 

Glu Arg Gly Arg 
Gly Gly Gly Ala 



Arg Gly lie Ser 
10 

Leu Ser Ser Thr 
25 

Ala Thr Ala Ala 

Ala Ala Ser Thr 
60 

He Asn His His 
75 

Gly Ser Asp Ala 
90 

Tyr Leu Pro Leu 
105 

Tyr Ala Leu He 

Ser Glu Ala Ala 
140 

Thr Gly Ala Asn 
155 

Gin Pro Ala Gin 
170 

Tyr Ser Leu Arg 
185 

Ala Gly Ala Gly 



Ser Ser Gly Thr 
15 

Ala Ala Thr Thr 
30 

Thr Ala Ala Thr 
45 

Ser Thr Thr Ala 

Leu Ala Val Glu 
80 

Gin Ala Gly Val 
95 

Tyr Glu Asp Val 
110 

Asp Asp He Ser 
125 

Val Gly Gly Pro 

Gly Thr Leu Ala 
160 

Ser Asn Ser Ser 
175 

Ser Phe Val Glu 
190 

Asp Gly Gly Asp 
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195 










inn 










205 








Ala 


Gly 


He 


Ala 


Leu 


He 


Asp 


Ser 


Gly 


Glu 


Glu 


Ala 


Ala 


Leu 


Asp 


Asn 




210 










215 










220 










Val 


Ala 


Asp Ala 


Glu 


Thr 


Asp 


Tyr 


Gly 


Met 


Leu 


Gly Gly 


Phe 


Gly 


Asp 


225 










n O A 

230 




















Z *i u 


Ala 


Glu 


Leu 


Leu 


Gin 


Arg 


Thr 


Ala 


Thr 


Val 


Ala 


Arg 


Glu 


Thr 


Leu 


Gly 










245 










250 










o c c 
ZOO 




Asn 


Arg 


Thr 


Ala 


Pro 


Ser 


Thr 


Thr 


Ser 


Tyr 


Asp 


Gly 


Gly 


Gly 


Ser 


Gly 








260 










265 










270 






Asp 


Val 


Gly Val 


Ala 


Gly 


Gly 


Leu 


Ala 


Gly 


Thr 


Ala 


Gly 


Gly 


Gly 


Val 






275 










280 










285 








Gly 


Gly 


Ala 


Gly 


Gly 


Ser 


Gly 


Gly 


Ser 


Thr 


Phe 


Met 


Leu 


Leu 


Leu 


Glu 




290 










295 










300 










Asn 


Phe 


Asn 


Asp 


Tyr 


Phe 


Pro 


Asn 


Tyr 


Asn 


Gly 


Ser 


Thr 


Val 


Ser 


Gly 


305 










310 










315 










320 


Thr 


Ser 


Thr 


He 


Ala 


Pro 


Gly 


Val 


Ala 


He 


Thr 


Gly 


Ser 


Arg 


Gly 


Ser 










325 










330 










335 




Gly 


Leu 


Leu 


Leu 


Glu 


Gin 


Asn 


Leu 


Thr 


Gly 


Leu 


Tyr 


Leu 


Asp 


Gly 


Tyr 








340 










345 










350 






Arg 


Leu 


Asn 


Cys 


Thr 


Asn 


Glu 


Thr 


Leu 


Asn 


Leu 


Thr 


Asp 


Ser 


Cys 


Gly 






355 










360 










365 








Glu 


Leu 


Arg 


Val 


Val 


Asp 


His 


Asn 


Tyr 


Trp 


Ala 


Leu 


He 


Leu 


He 


Leu 




370 










375 










380 










Phe 


Pro 


He 


Leu 


Thr 


Leu 


Phe 


Gly Asn 


He 


Leu 


Val 


He 


Leu 


Ser 


Val 












•3QA 










395 












Cys 


Arg 


Glu 


Arg 


Ser 


Leu 


Gin 


Thr 


Val 


Thr 


Asn 


Tyr 


Phe 


He 


Val 


Ser 










405 










410 










415 




Leu 


Ala 


He 


Ala 


Asp 


Leu 


Leu 


Val 


Ala 


Val 


Val 


Val 


Met 


Pro 


Phe 


Ala 








420 










425 










430 






Val 


Tyr 


Phe 


Leu 


Glu 


Pro 


Val 


Glu 


Pro 


Thr 


Thr 


Cys 


He 


Met 


Gly 


Ser 






435 










440 










445 








Gly 


Val 


Ser Gly 


Ala 


Ala 


Glu 


Gly Gin 


Trp Gly 


Gly 


Gly 


Ala 







450 455 460 



<210> 34 
<211> 11501 
<212> DNA 
<213> Drosophila 



<400> 34 

tttgtaccgc 

tttattaata 

aacatgaaag 

aaggatgtta 

aaattagttg 

caactcatat 

aaaattctca 

catggccagg 

atattaaaat 

ctcagcagac 

acctgaccca 

ccggcagacc 

aacaacacac 

gccaacttat 

aaaaaaggac 

aaaggatgcc 

ctcagtcgta 

atgccttaaa 

taccaagtaa 



agtcaagcgc 
ctgtgtaatt 
taatggtatc 
atgttaattt 
gtatttcctt 
attataaata 
caagctaacc 
cataaattcc 
aatatcacct 
caagtgaact 
cccatgccca 
aagaaacacc 
aagtggccgt 
gatggcttct 
ctgctctttg 
gacaagcaac 
cgcgagtcct 
tccccgagcc 
tagcttgtcc 



aaaaattcct 
attatttgtg 
tgtttttctg 
aataggtatt 
ttgaataagc 
taaataataa 
ttgtaatgat 
tgttactggc 
atctataggg 
tatcaatgaa 
gtggccagca 
cactaacatc 
aatgaaatgt 
gaatagccaa 
ctcgctcgtt 
aattttcatt 
cagtcagtca 
gtttggcggt 
aaagccacta 



cgaaaaacca 
atatatgtgt 
tattaaaata 
ccaaaactaa 
ttgaccttta 
taaatgagtg 
ggtacgccta 
accgcatcat 
ctgcggctct 
cacccgccac 
ccacccgaac 
tcgggagatt 
gaatagcctt 
ccgcatagta 
ggctttctct 
gataaaaaca 
gtttcgagct 
gttat tgaaa 
cgctgtcaaa 



tagaccaagg 
gggaaattgg 
acatatctga 
gagtgaactg 
acttacttca 
ctaataataa 
actgataccc 
catcatactg 
tttgacacgc 
acccacacgg 
tcgcagaagc 
cttcgtctgc 
ggcaaacatg 
cacggctaag 
ttcgcagcct 
ccagctgctg 
ctgggatgca 
attgcgtatt 
ggacgggctt 



tgtttatgag 
taatagatca 
taatttccta 
gcaaaatgta 
tagcctatga 
tttttaatca 
attatgttac 
gcattcggtc 
acacgtgctg 
tggcccaagc 
caatgaatgg 
cgagttcaag. 
agtccggaaa 
agcggggacc 
cggctcggaa 
ggtcctgaaa 
acggatgcgg 
cgccgcgtat 
aaaacgctac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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ttgcattttt acgtactata catagataaa ttttccgggt aatcggcgga aattgagtta 1200 

aagttttgtg tccggttggc gccagttggc aattggtgct ggccaacttt cggacccgaa 1260 

cgatgacaaa ctgaaaagta aactttgtaa agactcagca ccttacaaac aaattcaagt 1320 

gtttcgcgaa aaaaaataaa aaattttgaa ttgaagagga cacagttcgg tgaaaaaaaa 1380 

atatttgcaa tgcacaatgg aaaatcagca aagtgacctc caagtttacg aaaacagaaa 1440 

ctaactcaat aaccatccct gcaggtgagc agaggcaaca ttttttcttt aaaacatttt 1500 

ttaccatatt tccgtaaacg gtttgcgtcc tttggcaaat ttatggttgt cttggaataa 1560 

aatgaacttt atttacttgg caacaaaacg agtgtttatt cttgtcgcat agggaatcaa 1620 

gcagccaaca taaggactat attttgttta attcgattta atgtgagttg cagaattaaa 1680 

tttcttttta attcaacacg catctgtaca ccatttgtaa aattacaagc taaagctcag 1740 

caaacactaa attttaatca gtgaaacgaa ctgtattctg tcaattttaa tatcaaaagc 1800 

atatacattg ctcttaaaaa tattaaactg cgttagtaac gtaaacagaa aatgctatga 1860 

aatttgttat aatcaaaatt cacttgaaag agatttcaac aaaattacca atctcgtgtt 1920 

ctaaggaatt aatataataa ttataataat atcatagctt gttccctgtc acctaatata 1980 

taccatttaa ataaaagtaa tgcctaggta attgtttgtg agcgataagc gcagatttgt 2040 

ctgcataatt atgtgttaat ttgattgagg tctactgaag agggccttat ttttataagc 2100 

caaaagaaca aattcaaaca aaacacttga gggcggcctt agctggcaat aagcggcttt 2160 

tgcctaatgg aaaataagtg gaatgcctag gaattattgg ccaagtggac gggctgctct 2220 

ctttctggca atcctgagtc ctttcgactg ' attcgttttt ggctcctttg gccccatctc 2280 

cgcacgcttt gttcggcttg acacacctga gtggcgatgt tccacacagg cactttccca 2340 

aacaaatgga acccggcatc tcgcatttgg ttgcccgaac gtaagtccca tgaagtggtc 2400 

ttctccagtg gacagccagt cagccgcatt tcttgggggc gctcttctcg agcgcttggc 2460 

ctctgagatg gcctctttgg atgtcttgga gatatgcagc tcatcctggc cacccacttc 2520 

gccgagccat tccatttcat ttccctggcc aacatacccg tatgaaacga tgagtgctct 2580 

tcacagtttt tgcctacatt ccggccactg caccgactga tggttgatgg agacccattc 2640 

tccggttagt gttcgtcttg gtcgtcttgg tctgggtctc ggtcctggtc acaatgggaa 2700 

ttcgcagctt aaaggcgccg tggctttatt caggccagat tggatacttg acatgttcta 2760 

atgcggcgca gatctcagtt tctgcaacgt gctctcccta tctcagttac atgaattagc 2820 

attcatcgcc actgcactga gcaaaaaata ggaccacaaa tcgctctaac tgctgtaaat 2880 

tgatatccat tcaattattt tatttaagta ttataacaca ttgatagaac attaacatgt 2940 

catatatttt tgaacacaat tcagcattac actttaataa agtcgaattt gaatttatcc 3000 

aactgacgcg ttgggtgata tcataaattc aatttcgaga atgacaggtc attaaaatgt 3060 

tgtcacctac actgtcaaat atttaagtca aaatattaag ttataaaata tcaaaaagct 3120 

ttgatagctt tagagtatga tttacgttta aaaggattgt acacgaatac aattttgttt 3180 

ttttaagcaa tcagcggcgt gtttctgttt tgccatcata aatcgcagag tttcataatc 3240 

cctctaaatt aaaatgtcct ttgggatttt tctgaaattt ttttcaaacg cttccccctt 3300 

tttgtttcct gcatgtttag gccgttaaga gcctgttgat gatggcggca tggtagaact 3360 

ttctgttttg tcacagcctt gggctgctgg ctttacattt ggtggaaaaa cgagaccagc 3420 

cgagcacgtg cggtccgcgg ggttgggggg ggttgtggaa gctaaggacg tggcaggacg 3480 

aaagcgctga gtgcagatga ggactgcaat aaactgccat acataagtga aaattgtgaa 3540 

atgaaaactt gcgcgagcat aaatcaatac acccaagggg ggtcatgggc tcccatggca 3600 

ggcaacaaaa aaatatatat gtatatatat cagctgggaa atgtgctccc tactcaggtt 3660 

tagcgacttg tatttcatgt tcccaaatca gctggtgtta tcctttgatt gccgggcaca 3720 

cgacaaaagt ggtggaccag gacacatgtg atgattgctt gcgtgggtcg caggaatgag 3780 

ccaactcaga ttggtaatat tgtgtatgaa cgtgatttcg tgactttgca atgtttctat 3840 

ttaacaaaaa ctcttgttaa agaaatacac acaaaacatc acttataacc ctttttggat 3900 

atctatgagg atttaaggtt tgtaaactaa ggcaatggta taccttaaaa agtctatagt 3960 

aaatcgcata tattacatca attttaacgt tcatgctgat cttccagttc caaaatatat 4020 

attttgaaat agcttaca'tt taattaagtg cacttgctat aaaagttttg agcttgcctg 4080 

aagtttatta cataaaaaat tcgcagaaag gcggctaaag aatgaaacta atttaagccc 4140 

aaatgataaa ctatttgcca agcatgtttc gtttgaatta tgctcccgtt gctctgccta 4200 

ccaaaaatgg caaaaactcg caatatgatt attgtggcag cttcttccgg tgaattaaaa 4260 

atacaaaaaa ccagctcaaa ttgaaactta attttaataa attgttttca tgctcttttt 4320 

cgctgccgtt gtagcgtttt cccgcccatt tcacccgagt ggctggataa ttgtatccgc 4380 

gcaatcatcc cgcggcttcc attgccgcag gataaaaacc aaactgcaac gaacagaaca 4440 

gaactgagca ctttaagcca tttccactcg taattaaatt ttgagagcgg atgaaattgt 4500 

ggcggctaaa taaaaggaac aaaatgcatc tgtgcataca gcaaaaaaaa aacagcaaaa 4560 

ataaataaca aaatgcagaa gagcaaaaaa tcgcgcatat ccgctttgta aaaatacgag 4620 

tctaagtgat gggttccgtt tttgtttagt agtgctcact agtgaacggc cttgtcgcat 4680 

ttggggctga tagtggatgt agtgtggggt gtattgtggt gtgcatgggg gtgtttgaat 4740 
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cgcagaagtc 
gaatgggcac 
gctaacagct 
gaatgacact 
cgctaatttc 
ttgaaaagtt 
aaattgcaaa 
aatccatcgg 
ggataagagg 
tgccttagta 
tgaaatttaa 
cgttgcgttg 
acatttaaat 
aacatgccac 
tttagtaaaa 
caaacaaaaa 
cctgtccctt 
gtacttttaa 
tggcagccaa 
tttattgcca 
caacgaaatt 
acccttttgt 
aaatggcaag 
tgtaaatgta 
tccagttgct 
ttagtttctg 
caaaagttgg 
ccaaacccgg 
aattaaatac 
cattgcggcg 
gtctggctgc 
cttcgtatac 
agtattttta 
ttcgaacaac 
acatgcgaag 
ctcgcgcttt 
ccacttccat 
gagatatata 
gcttgatata 
aaaatatata 
aagctttatc 
tacaattatt 
ccgcacttgc 
ggtttgtaaa 
agttcggcat 
gcttttaata 
gaaaaaatga 
ttgctgcacg 
atttcttagt 
ggtgactaaa 
aagagcatcg 
acagctaatt 
gcaaccacat 
cagcaaggga 
cggaatcctt 
ttgcgggcac 
agtcagtcat 
acagcagctt 
acagccattc 
ctgtgaatgt 



ggcccagccc 
aaacaagtca 
cttcgtttct 
caattggcct 
gaatataaca 
tatacctata 
aatgatggat 
tgttgaaagt 
atttatcttg 
ttcaacattt 
taacattgcg 
aaactaatat 
tttgaattta 
ctaacaaaag 
gcgaatatct 
tgcgtgttat 
gttgatttca 
cgagcattcg 
aaaagctgac 
atggaataac 
tcggttggca 
gaaagccccc 
ttacgccgtt 
ataagcaaat 
caagttttgc 
gccaagaatt 
tatttgagtt 
cgggctcacg 
attctcgcag 
ttaattaaat 
cagtcgttaa 
acttgagatt 
aaatatatac 
tcatttgaca 
ggggaatcga 
tgcggagaag 
gaactgaaaa 
tatgatatgt 
aaaaaagaaa 
aagtccatta 
gaactttatc 
tatgcacgct 
tacatttatc 
atacattata 
tgtgttaaac 
actttccctt 
agaaaaaaaa 
caaatgcttc 
tgcgaataaa 
catattttaa 
attaagtcaa 
gcagctaacc 
tgcaccacta 
atccgcaccg 
gccaaatgcg 
gtaccaccac 
ggaggccatc 
ctggctaacc 
cactgggagc 
aggcaaggac 



gttggccgtc 
cgcctgcctg 
ctctctcttt 
aatcggtttc 
aaaagcaaaa 
tgtataatat 
gaacatctca 
gagcacttta 
agctcaaaaa 
ttgccagatt 
cagctttttg 
tattttaaat 
aaaaggggtg 
cgaaaacaat 
aagcaaataa 
taaaatcatg 
tgctgataaa 
aaggtatcct 
aaatttgtgc 
aggttggctt 
ttcagtcgga 
ggagaaaaga 
ttcaaagggc 
ttcgcagttc 
tccttggcag 
tgtctttggc 
tcgcagtgga 
tggccgcctt 
agatttacat 
ttgcagaccg 
caaatcatga 
aaaaattctt 
aattaatcta 
acatcaaaga 
atttccaata 
ccgtgatatg 
ttggttttca 
aatgtctttc 
gaaactttga 
gctgtagctt 
ttatagatac 
gcattgacat 
agcaattgaa 
atgtttaata 
tttaagcgaa 
tcgatgccca 
gtttggctca 
tcaaaaaaaa 
ttccgattta 
gcagaccgct 
ctgtcttttt 
ccattggcat 
ccaccagcag 
ctgaaactga 
gccaaatgcg 
tactacctcc 
atcatgacga 
ggtgccctct 
acaacctcga 
cacgacaagc 



aagaggacct 
cgcatttcat 
ctctgctcag 
tgaaaagcat 
acgcttttga 
ggaacctttc 
acattcaaca 
tacaaaaata 
ctgatgcagt 
tatcgaaacg 
gtgaaagcag 
tatacaggat 
tgacagattt 
tgtttaatta 
gtcgtgtaat 
attactgcca 
ggacatacac 
gtgcccattt 
cggaaaggaa 
cagttgcgta 
atgaaagtga 
caactttcaa 
caactatttt 
tccataggac 
ctgcacttgt 
caagtactgc 
aaagcacttg 
ctagttattt 
aataagtaat 
attgcatttc 
ttgctgaaat 
taataagggg 
gtcagcgctt 
atccaatatt 
atttaaaaag 
gtgactgtat 
ctgcctgttt 
tatatattgt 
acaaactcaa 
acaggagcat 
atatatctat 
atgccaaaca 
ttaatttcag 
atcagcagag 
tagttaacac 
cacaacctga 
agtttattgg 
aaaaaaaaaa 
agctggttga 
caaactcaaa 
atctgtccca 
ttagcaaacg 
catcagcgcc 
atctgaaggc 
aggcgctggc 
tgttcggagt 
ccctgcccaa 
cgctctccga 
cggctgggag 
acgtgaatga 



ggcctaatcg 
caaagaacgg 
ctgggtcctt 
agtatgaatt 
tatctaaatt 
taaatatagt 
ggttgcccag 
ctcatgagtg 
gatgcaatat 
taaataaata 
ttttaattta 
acaaaaaata 
actgaataat 
catttgtgca 
ttgatatgca 
tacgtataga 
ttttaaatct 
caaaaaagct 
acaacttcct 
tgtccctgct 
aaacaaaaag 
ttggccaatt 
gtttttgaag 
acctttacac 
tgttcctgtt 
tcacgctcgg 
cggcagatct 
aataagtaat 
ctacgccgaa 
ggaaacttga 
ggtcaacttt 
aaataacact 
ttgacgtaat 
cagcaatttt 
gaatcggaat 
ttatgaaatt 
ttggtcgata 
agcatttacg 
gtatataaaa 
aagttttctt 
tactaggata 
cttaaagtat 
gttgcggttc 
agcaggaaaa 
agtgttattg 
gttattttcc 
tttcgcgctg 
actaactctt 
aaaaaaccca 
ccgtgggtgg 
cttttagggt 
gaaaatgcat 
accacttcca 
atcccagtgg 
aagtaaataa 
agctcgtaaa 
cctgacaacg 
gatgttggcc 
ttcggccacc 
cagcgtttcc 



aatgaatgta 
cgaacaagag 
tgaagatcct 
tcatacggta 
gtatttgcca 
aattgatacg 
ttacttaatc 
gttctaaagc 
tcaagatgtg 
aaaatataca 
ataaagcaat 
atatacatat 
aataacttta 
acttgtggca 
atatagcaac 
tgtttcatgt 
tgctaacttg 
gtaagctctt 
atttgcatga 
attcgatttt 
tgctactact 
agtgctattt 
ttccgcgagt 
tgagttcctt 
tgccagaaag 
tttctttggg 
tcaaattctg 
taagttccct 
cgtcttcctc 
tttggtttgc 
gggcatcgca 
tatttataga 
ttatattgag 
aaacatactc 
tttacatttt 
tgcttttacg 
taataatgct 
aagggttata 
aggtttcaat 
gtttaagacg 
agtaagtaat 
tcccacccag 
agctgagttt 
atatgtatta 
gttacccgct 
ttaatggtct 
ctttcatcaa 
catttatggc 
tcctttcaac 
ttaattaaat 
cacccggtgg 
cgtgaccgct 
cctccagcag 
cggcgacaca 
gcgtaatcct 
tatttggcca 
gatgcaggtg 
aactcgagcc 
gaatcatccg 
acgggtctga 



4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
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ggtaagtgca 
agctcatgac 
tgcattcaac 
aatcagatta 
aagtatctat 
ttttggctta 
ctgaaaccat 
att taaacat 
tcataaactt 
ttgaataaat 
ctacattcac 
gctacagttc 
gatttcggtg 
atctctgcgg 
gctcatcaaa 
cattggtaag 
ttcggcttgc 
taaattggcc 
gattctcttt 
tagcctacac 
caccgagcct 
tcccacagcc 
taccctcacc 
gagacgctgc 
cctgttcgct 
cctcaccacc 
actgttcttt 
tatactgaag 
ggagcagaag 
accgtacgcc 
gggatcgatg 
tgcggccacc 
gctgcgaagg 
ccaccgattg 
ctatctgatg 
cgttgtggtg 
aaggctgacg 
taacataaac 
ttagtcaaac 
ctgtttaatt 
ataaaaaaac 
gcactgcact 
ggactcaagt 
aactgaaatc 
ttgttcatct 
taataataaa 
caacgacgac 
ggggagtagg 
caggcatatg 
gaaattagta 
agattataca 
gttaacaagc 
caaagcagcc 



tgtgcctagg 
tgcgggctca 
taatgtgctt 
cttagaacct 
cgaaactgaa 
tgaagttgca 
aaattattaa 
aagttacact 
agcttagagc 
tgaaacattt 
tacagggaca 
gagccaccta 
gtgggatgtg 
actcctgcca 
tgcccgattg 
cttaaaactt 
atttatgtca 
caaaatgctt 
tcccgaaagc 
atcttggttc 
tttgtttgcc 
tgtcgcctct 
gccatcgctt 
tcccgcctgc 
gtgatgccgg 
tgcagcttcg 
gtggctgcct 
gtggtcttca 
ttggccttca 
atcgttgcca 
atacccgcgc 
catccgcggt 
gtctccacga 
aggacgagca 
aacaacaatc 
gccgagatca 
attatgccag 
atttttaatc 
ctatgatcac 
tccacttttt 
ctttcctatc 
gcaaggcata 
ttcattgttg 
agcaatggca 
gagtgaatgc 
tagaaatgaa 
acagtggtga 
cgatcagagc 
tacatatgta 
caaagaaaga 
tcatttcagg 
ggatgtgcag 
tgcggacatc 



cttttccatt 
acaagtcatc 
agaattctac 
cagaacagtc 
aagaagagga 
aaaagaattc 
caattaaata 
actgcaagga 
aatctaaacc 
acttttccca 
agtacgacct 
agtcgagcac 
tgggcaatgc 
atatcctggt 
ccatttacaa 
ggggaaataa 
aaaatcgatc 
ggactgattt 
aattaataga 
atgaaaaaca 
ttgaaacggg 
atggatttgt 
tggatcggta 
gatcctacct 
ccctggatat 
attatctgaa 
actgcattcc 
cagcgagtcg 
ttgtggcggc 
tgatgggtgt 
tcttctgcaa 
ttcgtgtcga 
ccagatcatc 
ccaccggtga 
tgatgatggt 
acaattcgat 
ttaagagaac 
agttttaagt 
gatgatactg 
ccaattgaaa 
tcgaaaagta 
tcaaaatgga 
aactaatcta 
gcaaagtaat 
gttgtcattg 
gcagggaaat 
tagtaaacat 
agggaatgga 
taagacatcg 
gggtggcgtg 
gaaagtcact 
attgtttgca 
ttgtatttct 



tccatggcaa 
gcgataagct 
tattgtacat 
aatacaatcc 
tcattagctt 
ttaaaaaatt 
ttaaattatt 
ggaaagtatc 
aaattgaaaa 
t ttccccagc 
gagctacatt 
cttcctgatt 
gttcgtcatc 
gatgaatctg 
taacatcaaa 
cttactaaat 
aacaaaagca 
cagctgtttc 
atttaaatga 
tattccaaaa 
tggttcatta 
gggtggccta 
caatgtggtg 
gatcatcctg 
cggactatct 
caaggagatg 
actgacctcc 
aatacaatcg 
tatcattggt 
cttcggcctg 
gacagctgcc 
ggtgcgaatg 
gtacatgacc 
gggcggaatg 
gcccgaggag 
tagcagcgtc 
ggaaattctg 
tttacagacg 
aaaactttgc 
atttcaattt 
aactgttcgc 
agctatcaat 
atctaatttg 
ttcgctcggc 
atggacctac 
ccgccgagca 
atttaatgag 
ttagttagcc 
gaatagcagg 
ggtaggggcc 
cggacccaag 
ttgtagttta 
gaaatggagt 



cacacacaca 
cagcttcatt 
aactaacata 
acttaatata 
gggtacaaca 
caatacaatg 
tatattgatt 
agatgacttg 
ctcattgaaa 
aattacagta 
gccaaggtga 
atggccgccc 
tttatgtttg 
gccatctgcg 
gagggtccag 
atctatataa 
atctacataa 
ttgcagaaat 
aacaataaaa 
tgtaatggct 
aaaatcctac 
agtggcacct 
gtgcatccac 
ctgatctggt 
gtctatgtgc 
ccagctcgca 
attgtgtact 
aacaaggata 
ttgtggttcc 
gagaggcaca 
tgcgtggatc 
ctcttctacg 
cgatcacgat 
ggtgatcacc 
acggaggaga 
atggaacaga 
tcaatctgtt 
tgttttatat 
ttggaatatc 
gatttttgcc 
tgttaattga 
aatcgtaaat 
tatgtagaaa 
aatgatatat 
aattgaaatg 
gtgggaaatt 
cttatgtgat 
tatatacata 
ccaattagag 
gaagtggaaa 
gtcaagccgt 
atattggaat 
c 



gattccatta 
ttccaggaga 
tagggaatat 
aatgaattaa 
tttataagaa 
tcaaatagtt 
taattcatac 
agctatactt 
cccattttaa 
attaccccag 
atcccttttg 
tgtattgcct 
ccaaccgcaa 
actttctgat 
ctttaggaga 
atgaaattca 
taaataaaat 
tcatttgtgg 
gttcaatttg 
attaaaattt 
tgactttccg 
gtgccatcgg 
tgcaaccgct 
gctacagctt 
cagagggctt 
ttttcatggc 
cctacttcta 
aggccaagac 
tggcctggtc 
taacgccact 
cctacttgta 
gacgcggagt 
cgtcgttcac 
gcatggaaaa 
acgaggagat 
gtaagttctg 
tcgaaccaac 
aatttgaaaa 
tgaaactcgc 
caggtaattt 
aaatgtttgt 
tgaatggatc 
taatagaccg 
gccaatggca 
aaaataataa 
ggcaaaggca 
gtttagacag 
tatatataca 
aggcagggcc 
ttcaatagtc 
ggtattggat 
aactcacaat 



8400 

8460 

8520 

8580 

8640 . 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11501 



<210> 35 
<211> 2177 
<212> DNA 
<213> Drosophila 
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<400> 35 

ctgggatgca 

attgcgtatt 

ggacgggctt 

aatcggcgga 

ggccaacttt 

ccttacaaac 

cacagttcgg 

caagtttacg 

gctaattgca 

accacattgc 

caagggaatc 

aatccttgcc 

cagtcatgga 

gcagcttctg 

gccattccac 

tgaatgtagg 

attacagtaa 

ccaaggtgaa 

tggccgccct 

ttatgtttgc 

ccatctgcga 

agggtccagc 

gcacctgtgc 

atccactgca 

tctggtgcta 

atgtgccaga 

ctcgcatttt 

tgtactccta 

aggataaggc 

ggttcctggc 

ggcacataac 

tggatcccta 

tctacggacg 

cacgatcgtc 

atcaccgcat 

aggagaacga 

aacagagtaa 



acggatgcgg 
cgccgcgtat 
aaaacgctac 
aattgagtta 
cggacccgaa 
aaattcaagt 
tgaaaaaaaa 
aaaacagaaa 
gctaacccca 
accactacca 
cgcaccgctg 
aaatgcggcc 
ggccatcatc 
gctaaccggt 
tgggagcaca 
caaggaccac 
ttaccccagc 
tcccttttgg 
gtattgcctg 
caaccgcaaa 
ctttctgatg 
tttaggagac 
catcggtacc 
accgctgaga 
cagcttcctg 
gggcttcctc 
catggcactg 
cttctatata 
caagacggag 
ctggtcaccg 
gccactggga 
cttgtatgcg 
cggagtgctg 
gttcacccac 
ggaaaactat 
ggagatcgtt 
gttctga 



atgccttaaa 
taccaagtaa 
ttgcattttt 
aagttttgtg 
cgatgacaaa 
gtttcgcgaa 
atatttgcaa 
ctaactcaat 
ttggcattta 
ccagcagcat 
aaactgaatc 
aaatgcgagg 
atgacgaccc 
gccctctcgc 
acctcgacgg 
gacaagcacg 
tacattcact 
ctacagttcg 
atttcggtgg 
tctctgcgga 
ctcatcaaat 
attgcctgtc 
ctcaccgcca 
cgctgctccc 
ttcgctgtga 
accacctgca 
ttctttgtgg 
ctgaaggtgg 
cagaagttgg 
tacgccatcg 
tcgatgatac 
gccacccatc 
cgaagggtct 
cgattgagga 
ctgatgaaca 
gtggtggccg 



tccccgagcc 
tagcttgtcc 
acgtactata 
tccggttggc 
ctgaaaagta 
aaaaaataaa 
tgcacaatgg 
aaccatccct 
gcaaacggaa 
cagcgccacc 
tgaaggcatc 
cgctggcaac 
tgcccaacct 
tctccgagat 
ctgggagttc 
tgaatgacag 
acagggacaa 
agccacctaa 
tgggatgtgt 
ctcctgccaa 
gcccgattgc 
gcctctatgg 
tcgctttgga 
gcctgcgatc 
tgccggccct 
gcttcgatta 
ctgcctactg 
tcttcacagc 
ccttcattgt 
ttgccatgat 
ccgcgctctt 
cgcggtttcg 
ccacgaccag 
cgagcaccac 
acaatctgat 
agatcaacaa 



gtttggcggt 
aaagccacta 
catagataaa 
gccagttggc 
aactttgtaa 
aaattttgaa 
aaaatcagca 
gcagggtcac 
aatgcatcgt 
acttccacct 
ccagtggcgg 
tcgtaaatat 
gacaacggat 
gttggccaac 
ggccaccgaa 
cgtttccacg 
gtacgacctg 
gtcgagcacc 
gggcaatgcg 
tatcctggtg 
catttacaat 
atttgtgggt 
tcggtacaat 
ctacctgatc 
ggatatcgga 
tctgaacaag 
cattccactg 
gagtcgaata 
ggcggctatc 
gggtgtcttc 
ctgcaagaca 
tgtcgaggtg 
atcatcgtac 
cggtgagggc 
gatggtgccc 
ttcgattagc 



gttattgaaa 
cgctgtcaaa 
ttttccgggt 
aattggtgct 
agactcagca 
ttgaagagga 
aagtgacctc 
ccggtggaca 
gaccgctgca 
ccagcagcag 
cgacacacgg 
ttggccaagt 
gcaggtgaca 
tcgagccaca 
tcatccgctg 
ggtctgagca 
agctacattg 
ttcctgatta 
ttcgtcatct 
atgaatctgg 
aacatcaaag 
ggcctaagtg 
gtggtggtgc 
atcctgctga 
ctatctgtct 
gagatgccag 
acctccattg 
caatcgaaca 
attggtttgt 
ggcctggaga 
gctgcctgcg 
cgaatgctct 
atgacccgat 
ggaatgggtg 
gaggagacgg 
agcgtcatgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2177 



<210> 36 
<211> 551 
<212> PRT 
<213> Drosophila 

<400> 36 

Met His Arg Asp Arg Cys Asn His lie Ala Pro Leu Pro Pro Ala Ala 

15 10 15 

Ser Ala Pro Pro Leu Pro Pro Pro Ala Ala Ala Arg Glu Ser Ala Pro 

20 25 30 

Leu Lys Leu Asn Leu Lys Ala Ser Gin Trp Arg Arg His Thr Glu Ser 

35 40 45 

Leu Pro Asn Ala Ala Lys Cys Glu Ala Leu Ala Thr Arg Lys Tyr Leu 

50 55 60 

Ala Lys Ser Val Met Glu Ala lie lie Met Thr Thr Leu Pro Asn Leu 
65 70 75 80 

Thr Thr Asp Ala Gly Asp Ser Ser Phe Trp Leu Thr Gly Ala Leu Ser 

85 90 95 

Leu Ser Glu Met Leu Ala Asn Ser Ser His Ser His Ser Thr Gly Ser 
100 105 110 
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Thr 


Thr 


Ser 


Thr 


Ala 


Gly 


Ser 


Ser 


Ala 


Thr 


Glu 


Ser 


Ser 


Ala 


Val 


Asn 






115 










120 










125 








Val 


Gly 


Lys 


Asp 


His 


Asp 


Lys 


His 


Val 


Asn 


Asp 


Ser 


Val 


Ser 


Thr 


Gly 




130 










135 










140 










Leu 


Ser 


Asn 


Tyr 


Ser 


Asn 


Tyr 


Pro 


Ser 


Tyr 


He 


His 


Tyr 


Arg 


Asp 


Lys 


145 










150 










155 










160 


Tyr 


Asp 


Leu 


Ser 


Tyr 


He 


Ala 


Lys 


Val 


Asn 


Pro 


Phe 


Trp 


Leu 


Gin 


Phe 










165 










170 










175 




Glu 


Pro 


Pro 


Lys 


Ser 


Ser 


Thr 


Phe 


Leu 


He 


Met 


Ala 


Ala 


Leu 


Tyr 


Cys 








180 










185 










190 






Leu 


He 


Ser 


Val 


Val 


Gly 


Cys 


Val 


Gly 


Asn 


Ala 


Phe 


Val 


He 


Phe 


Met 






195 










200 










205 








Phe 


Ala 


Asn 


Arg 


Lys 


Ser 


Leu 


Arg 


Thr 


Pro 


Ala 


Asn 


He 


Leu 


Val 


Met 




210 










215 










220 










Asn 


Leu 


Ala 


He 


Cys 


Asp 


Phe 


Leu 


Met 


Leu 


He 


Lys 


Cys 


Pro 


He 


Ala 


225 










230 










235 










240 


lie 


Tyr 


Asn 


Asn 


He 


Lys 


Glu 


Gly 


Pro 


Ala 


Leu 


Gly 


Asp 


He 


Ala 


Cys 








245 










250 










255 




Arg 


Leu 


Tyr 


Gly 


Phe 


Val 


Gly 


Gly 


Leu 


Ser 


Gly 


Thr 


Cys 


Ala 


He 


Gly 








260 










265 










270 






Thr 


Leu 


Thr 


Ala 


lie 


Ala 


Leu 


Asp 


Arg 


Tyr 


Asn 


Val 


Val 


Val 


His 


Pro 






275 










280 










285 








Leu 


Gin 


Pro 


Leu 


Arg 


Arg 


Cys 


Ser 


Arg 


Leu 


Arg 


Ser 


Tyr 


Leu 


He 


He 




290 










295 










300 










Leu 


Leu 


He 


Trp 


Cys 


Tyr 


Ser 


Phe 


Leu 


Phe 


Ala 


Val 


Met 


Pro 


Ala 


Leu 


305 










310 










315 










320 


Asp 


He 


Gly 


Leu 


Ser 


Val 


Tyr 


Val 


Pro 


Glu 


Gly 


Phe 


Leu 


Thr 


Thr 


Cys 








325 










330 










335 




Ser 


Phe 


Asp 


Tyr 


Leu 


Asn 


Lys 


Glu 


Met 


Pro 


Ala 


Arg 


He 


Phe 


Met 


Ala 








340 










345 










350 






Leu 


Phe 


Phe 


Val 


Ala 


Ala 


Tyr 


Cys 


He 


Pro 


Leu 


Thr 


Ser 


He 


Val 


Tyr 






355 










360 










365 








Ser 


Tyr 


Phe 


Tyr 


He 


Leu 


Lys 


Val 


Val 


Phe 


Thr 


Ala 


Ser 


Arg 


He 


Gin 




370 










375 










380 










Ser 


Asn 


Lys 


Asp 


Lys 


Ala 


Lys 


Thr 


Glu 


Gin 


Lys 


Leu 


Ala 


Phe 


He 


Val 


385 










390 










395 










400 


Ala 


Ala 


He 


He 


Gly 


Leu 


Trp 


Phe 


Leu 


Ala 


Trp 


Ser 


Pro 


Tyr 


Ala 


He 










405 










410 










415 




Val 


Ala 


Met 


Met 


Gly Val 


Phe 


Gly 


Leu 


Glu 


Arg 


His 


He 


Thr 


Pro 


Leu 








420 










425 










430 






Gly 


Ser 


Met 


He 


Pro 


Ala 


Leu 


Phe 


Cys 


Lys 


Thr 


Ala 


Ala 


Cys 


Val 


Asp 




435 










440 










445 








Pro 


Tyr 


Leu 


Tyr 


Ala 


Ala 


Thr 


His 


Pro 


Arg 


Phe 


Arg 


Val 


Glu 


Val 


Arg 




450 










455 










4 60 










Met 


Leu 


Phe 


Tyr 


Gly 


Arg 


Gly 


Val 


Leu 


Arg 


Arg 


Val 


Ser 


Thr 


Thr 


Arg 


465 










470 










475 










480 


Ser 


Ser 


Tyr 


Met 


Thr 


Arg 


Ser 


Arg 


Ser 


Ser 


Phe 


Thr 


His 


Arg 


Leu 


Arg 










485 










4 90 










495 




Thr 


Ser 


Thr 


Thr 


Gly 


Glu 


Gly 


Gly 


Met 


Gly 


Asp 


His 


Arg 


Met 


Glu 


Asn 








500 










505 










510 






Tyr 


Leu 


Met 


Asn 


Asn 


Asn 


Leu 


Met 


Met 


Val 


Pro 


Glu 


Glu 


Thr 


Glu 


Glu 




515 










520 










525 








Asn 


Glu 


Glu 


He 


Val 


Val 


Val 


Ala 


Glu 


He 


Asn 


Asn 


Ser 


He 


Ser 


Ser 




530 










535 










540 










Val 


Met 


Glu 


Gin 


Ser 


Lys 


Phe 





















545 550 



<210> 37 
<211> 6472 
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<212> DNA 

<213> Drosophila 



<400> 37 

tatgcgcaac 

gctcgtctgc 

tggccacgag 

acctaattaa 

cccacgccag 

agccattaaa 

cctggggcat 

aatttatttg 

cgcatgcttt 

tagatctact 

ttcctcgctc 

ttgggttttg 

tcttggtgtt 

catgctaaac 

ctgcttcact 

catccgtcgc 

gcgccctgtc 

ctttattatg 

tgccgcgtcc 

cactccggga 

cgggccgaga 

aaacgggcaa 

aaaaggaggc 

gaaaaaaaac 

agggaggatt 

atatttaaac 

taagttcgcc 

ttggaatatt 

atactttttt 

aatgcaatgt 

agatacacgg 

atatatatgt 

ttatatccgg 

taatatgggc 

acaagcggtc 

cccagactgg 

tcctgcctgc 

accgtatcca 

gtttgctttc 

agtacaagct 

tcagcgactg 

tcgtgaccct 

ccatgctgcg 

tcctaatgct 

tcaccctgtg 

actacagctt 

ttctttacaa 

cagggcgatt 

ggcgcagagg 

caaatgccac 

gcgaggatat 

tggaggccat 

gtgatagccg 

gtttcgacga 

gtcgagaaaa 

aagcgagatg 



tcgacaaaca 
acgcccaccg 
atgtgtgagt 
ctgctgttct 
tgtgctgacc 
ggccggctac 
tgctgcgctc 
taaactagca 
ttggtgtttg 
tcgtttgtgg 
attaattata 
cacatttttc 
tggccttttg 
tgcgactgtc 
cgcttcatcg 
ctcaagttgc 
atctttaata 
gtgcgctccg 
ataacaatga 
gtcatggggt 
cccattaatt 
acatttttaa 
gtgtcaaata 
acggccatcg 
tgggctgggg 
ttgtctgttt 
atctactcac 
t tgagcttta 
aaaatgagtg 
tgatattata 
tatacatttt 
acctctccca 
ttttggttta 
ttcaacatat 
atgaaaatcg 
gttaaccttt 
tgaaacctct 
ttatccatta 
tctccccaat 
ggccgagtac 
gacgctggcc 
ggtggtgctg 
cgcccgtccc 
cggcgccgtt 
gatcatcctc 
gctgtacttc 
cctgatgtcg 
actgctggaa 
taccttgtca 
aggagccacg 
cagtcgcacc 
ggccatgctg 
tctcatgcct 
ggagtcactg 
gcttcctgga 
ggcgtgaaaa 



tcttcctaac 
tcctcgtgga 
ggcttcagct 
atcccaggca 
atcttggcca 
gtctgcacca 
tttacgaggt 
ttatacttta 
tagtcctaca 
cctatatccg 
agtgtcaatc 
cccgttttgt 
tgcgccatct 
gcctatctat 
ttgccattgc 
ggacggcttt 
agcaccgatg 
tccaaggacg 
aataaacagc 
catctatggt 
acgccacaca 
acacattcaa 
gctgacgcac 
cctgttgact 
ctgtgtttaa 
gctggatata 
ttatcccctt 
gttaaatgta 
gtttcatttt 
tttcaaattg 
ttttctagac 
tcttttagct 
aaaaccaatc 
attttaaatg 
cgtcctggcc 
tgcccatcgc 
tctttttccc 
tccacgcccc 
ctgcaatatc 
attgatggat 
ttcttcctga 
tacggcatca 
acgaagccgg 
gtgctgtcct 
agcacggacc 
tgtaggatca 
accaagtttc 
ttggtgacat 
ctgggcatgg 
agctccagca 
cgccttaaga 
cagcattcca 
ctgagaaatc 
gaggagaata 
atagcaagag 
atcgccttta 



caacctgagc 
ggtcaacact 
ccagtgctcc 
aggctgtgcc 
tctcgttcga 
aggggcgggc 
aagcgattag 
aatatcttaa 
tttcgtatgt 
tttattcatt 
gccgtcattt 
ttattttcca 
acgctggcgg 
tccatttcca 
catttccatt 
tgtccaagtt 
tccgctatca 
agcggcgacc 
acattgtcga 
tggcagctgg 
agaagcccga 
acagacgcgc 
gtaccaaaac 
gtcatggcat 
tcagcggctc 
tttacaaagt 
gtaagtgaaa 
ttttaagatg 
tattaaattt 
gcttaacgcc 
agaaatatac 
attaaatagt 
attgtttttt 
atgaacccaa 
agactggtac 
cattctctca 
gccactcttg 
cggtaacccc 
ccccgagcag 
cgtcggtggc 
tgaccatctc 
tcgcccggaa 
agcttagtct 
tcttcgtctg 
aaacactgca 
tgttgtacct 
gaaggggttt 
tgggaagaag 
gcaccaatac 
ttctctcgag 
tcgagatgca 
ccttgggcaa 
atcaaccacg 
agcgtagtga 
aaatagtaaa 
acccgtttac 



attgcggacc 
cgcccggaga 
tggtctattt 
gttcgtggag 
gcgctattac 
catcctcatc 
gctcgtactt 
ttagtttggt 
acatatggta 
tcgcccagtc 
taatgatttg 
ttatgacgtg 
cgtgtccatt 
ctcctgtcct 
cccaatccca 
ccggaagctt 
tggatttcat 
ctttttaatt 
gctggccacc 
gtccgcagga 
ggccaaaaaa 
cacacgacgc 
gcccggatca 
ttcgttttcg 
ggataactca 
caacagagta 
tgtaatcaca 
tatcgatata 
ctgttattat 
tacagattag 
tttaattcaa 
cgaggaactc 
tttaacgaca 
gaaatgaaaa 
attggaaact 
tcatcacaaa 
gccttttcct 
gccacctgat 
ccccattctg 
cgtgtgcctg 
ggtgttcttc 
tctggtctcc 
gaaggcccgc 
cctgctgccc 
cgatctggga 
caactcggcc 
caagcggctt 
gaaggaagac 
gaacacgaat 
aagctccaat 
gatgccatgt 
gggaattgcc 
acgccacaag 
agcgaaaatc 
tctgaccgaa 
acactcgctt 



tgctggtgct 
cctgggtact 
gttctcccca 
ctcacggtgg 
gcaatctgcg 
tgcgtcctgg 
gattttctta 
cgcttttcat 
ttatcctgtt 
aagcattaag 
tgttttgtta 
gaatttctgg 
tgtccattta 
accacttctc 
ttcccattga 
gccgttatca 
ttcattgaca 
ttgcaaatct 
gacccctcag 
cacccgccta 
aaaaaaaaaa 
ccatgggcgc 
tctttagtgg 
aggtctagcc 
aaataatcaa 
tatcccacca 
tcaatgaaat 
actcatacag 
aataatttta 
aatacactaa 
ctaaattaat 
gactatagta 
cagtactatt 
aaccataaaa 
tgaacttcat 
cagcttctct 
ccaaaatcca 
tacgtttctt 
tgggtggccg 
acccaggcca 
gtggtgccgt 
aacagggcgg 
aagcaggtgg 
ttccgcgtcc 
ttggtgcgct 
atgaatccga 
tgtcaggacg 
tcttctcgcg 
accaactcct 
cgcaggtgca 
ggcagcgacc 
aggcgagtga 
ccgcaaataa 
ccaacaaaat 
aataccctct 
cacttggctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 



56 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCT/US01/09341 

gaatgtgccg atgacgctga tcccgatccc gattcctatt cccgctcctg gtaaacacaa 3420 

atcgcttcct gaacgctttt ggcagccata atcgatttca gtcacccgct gacagctgcc 3480 

gagaatcccg aatcgtagtt gggctgccgc cttggacatt tccgtaccaa aaacaatcaa 3540 

tttttcgggc tattaaactc gaatgaatgt gccaaagata aataacagca acgacagccg 3600 

aacagcacta aaactaccac tactggtcag ggttaggagt ccagatatcg actggcagta 3660 

aggggtaagc cattaggtgg ccttcgagga gcggagctgg caaatggaaa tggcttaata 3720 

tgcgtttggg cgaaccatca ggtcggcgga taaacatgat ttaggtgtat tgaaatagtt 3780 

tccgtacgcc ggaccgtaca aaatggccgc cattacacgg gcggccatca ccggaagtcg 3840 

aaaaaactgt tgcctacttg ccggcagctc aaataaaaag cttagctcaa tatgcacaca 3900 

atgtccacat gtaaacacag ctgtacatac acatacatac ccccacacgc acacatagcc 3960 

aaatattaat atttatattt gtacgaacat atttatatac acatatttac atatatatag 4020 

tctgcttata aatatgtaca aggaaccaca cgcacacact cacacacaca cgagagtttg 4080 

taagtaaata aattttttat gacaactcat ggcgggactt ttccacatta ccccatceaa 4140 

tcggaatctg taatttatat aattctaagg aaatcggaca aatgttcgtc ctctgtaaat 4200 

atacgtattc gtaccatgcg cgtagctttt taatttgttt tcgtagccct atgacgtaga 4260 

ctcccacttt tgtagagtaa aaagtaataa gtagcttgaa cccacaccat cctgcgagta 4320 

tgatgagtat gagcagacgc tgccgtttcc gtttccgctg tgtgattaag ttgagtgctt 4380 

cgcccccgaa acttttccag tagttagccg ttgagtagct tttaaatatt tgtttgtttc 4440 

ctgcccgcag cgcgataaaa ctttgcagac agcgctagtc actctagtcc ccgctaaaac 4500 

tgataatatg attcttgcag tcccatcatc gccatatcca cctacagcgt ggaaccctac 4560 

ggggacggaa ccgatgctcc cgtttgcacc accgccgccg atggtttctg gtcgatcttc 4 620 

tacttcgtgg gctgcatcac ggtgtttttc ttcctgccct tcggcatcct ggttctteta 4 680 

tacgcggcca tcgcttacaa gctgctccgt cccaacaacg ccttccaccg accaacctcc 4740 

ccgcagccac agcagccgtc cggcggagcc accagtggct cctcacaggt gcccagcacc 4 800 

aagggtaaca gccatcaaca aagcaacggg atgaggaagc atcgcaagca ggtaatcttc 4 860 

atgctggtgg ccgtggtgtc tagctttttt gtttgccttc tacccttccg ggccttcacc 4 920 

ctctgggtga tcctggccag cgccgaggat gtcgagggtc tgggcattgc cggctactac 4980 

aacctgctgt acttctcgcg cttcatgctc tacctaaact cagccatgaa cccgatcctc 5040 

tacaacctga tgtcctccaa atttcgcagc ggcttctggc ggctgctgct cacttgtctg 5100 

ggccaacggc cacatcacca ccatcgccac cactaccatc agaggcagca tccaacggca 5160 

ggcggaagtg ggcgcaatgc gtccacgcga caggaacagg atgccgagga aggagctgcg 5220 

ctggcgggaa cgaccagcgc ccgacatcca cgtcgcacac tccgccgcga ggccaccttc 5280 

ttgatcaact ccatatccac ctcctcgggt acggatcgca ccacatcatc atcggcgtgg 5340 

cgcagcaaca gtctgtccat ttccggtctg agcgaacggg agcgcggcat actgggagcc 5400 

gctatcatcg gcacaacggc tgccaccgtt acaaccgcct gtctgcagga gcgacgcgcc 54 60 

agcaagatct gaatcccatc ctaaatatgc atccttcgag tccctgtcca ctgccagcaa 5520 

ccgttaaagt atttagttga attccaacag caaatagtca tagttttaga aaaattttgt 5580 

tgtaaagact gaagtacctc agaagaagtc aataaacgca agagaagatg tttttgaaaa 5640 

tcgaaactaa ttttcttcgg gcttcgaaaa attgaagttt gcaagattcg aaatagcttt 5700 

aacttgattt ttagaacaga aacgagtatg aacttttcat acgatcattt ttatcgcacc 5760 

tattgtgatt taatgggaga taggaggaga ccaattggga tatgcaaaaa taacgcaact 5820 

attattatct ttgtatgtac gattaatttt aatctctaaa gagtcaaagt acttcaataa 5880 

ttttacaaat ctgtcggagg gaatgtaagc ttgcccaaat caaatggaag ctacaatcgt 5940 

tgtcagatgg agatatttta tgttagaagg aatatctcgg atacgaaata tttaatttaa 6000 

aattcctttt catattattc tttttaaaaa ataataagtc ctgtatgtat acaatgcaat 6060 

acataaaggt agagtatgtc ctaaaaagaa aaatatcttt ccgaatatat gtatataatt 6120 

tataaaataa gtggatacat acatttttcg taagtatcct agaagatttg ccaacgtatc 6180 

aggtttacaa aaattgatta cctatgtgca gttaacttaa cggcagtgca ctaactaatc 6240 

tgatttgtga aataatgcag atataaaaca tttttaaata gaaatatatt tatacatctg 6300 

attcttcaac atctaagcac tcgtatagag tgctttgaat atattctaaa tacattggca 6360 

tgatccaagt tttaaatcag ccaaaaactt gctccaaaaa caggaagatt ctacacaaat 6420 

cgcctcgccc ttaaagtttc tgccaagtat ttgggtgcgt gcggatcagt tc 6472 



<210> 38 
<211> 2196 
<212> DNA 
<213> Drosophila 
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<400> 38 

atgcgcaact 

ctcgtctgca 

ggccacgaga 

ctgaccatct 

ggctacgtct 

gcgctcttta 

ggatcgtcgg 

ctgatgacca 

atcatcgccc 

ccggagctta 

tccttcttcg 

gaccaaacac 

atcatgttgt 

tttcgaaggg 

acat tgggaa 

atgggcacca 

agcattctct 

aagatcgaga 

tccaccttgg 

aatcatcaac 

aataagcgta 

tccacctaca 

gccgatggtt 

cccttcggca 

aacgccttcc 

ggctcctcac 

aagcatcgca 

cttctaccct 

ggtctgggca 

aactcagcca 

tggcggctgc 

catcagaggc 

caggatgccg 

acactccgcc 

cgcaccacat 

cgggagcgcg 

gcctgtctgc 



cgacaaacat 
cgcccaccgt 
tgtgcaaggc 
tggccatctc 
gcaccaaggg 
cgagccccat 
tggccgtgtg 
tctcggtgtt 
ggaatctggt 
gtctgaaggc 
tctgcctgct 
tgcacgatct 
acctcaactc 
gtttcaagcg 
gaaggaagga 
atacgaacac 
cgagaagctc 
tgcagatgcc 
gcaagggaat 
cacgacgcca 
ggttaggagt 
gcgtggaacc 
tctggtcgat 
tcctggttct 
accgaccaac 
aggtgcccag 
agcaggtaat 
tccgggcctt 
ttgccggcta 
tgaacccgat 
tgctcacttg 
agcatccaac 
aggaaggagc 
gcgaggccac 
catcatcggc 
gcatactggg 
aggagcgacg 



cttcctaacc 
cctcgtggag 
tgtgccgttc 
gttcgagcgc 
gcgggccatc 
tctgtgggtg 
cctgacccag 
cttcgtggtg 
ctccaacagg 
ccgcaagcag 
gcccttccgc 
gggattggtg 
ggccatgaat 
gctttgtcag 
agactcttct 
gaataccaac 
caatcgcagg 
atgtggcagc 
tgccaggcga 
caagccgcaa 
ccagatatcg 
ctacggggac 
cttctacttc 
tctatacgcg 
ctccccgcag 
caccaagggt 
cttcatgctg 
caccctctgg 
ctacaacctg 
cctctacaac 
tctgggccaa 
ggcaggcgga 
tgcgctggcg 
cttcttgatc 
gtggcgcagc 
agccgctatc 
cgccagcaag 



aacctgagca 
gtcaacactc 
gtggagctca 
tattacgcaa 
ctcatctgcg 
gccgagtaca 
gccatcagcg 
ccgttcgtga 
gcggccatgc 
gtggtcctaa 
gtcctcaccc 
cgctactaca 
ccgattcttt 
gacgcagggc 
cgcgggcgca 
tcctcaaatg 
tgcagcgagg 
gacctggagg 
gtgagtgata 
ataagtttcg 
actggcagta 
ggaaccgatg 
gtgggctgca 
gccatcgctt 
ccacagcagc 
aacagccatc 
gtggccgtgg 
gtgatcctgg 
ctgtacttct 
ctgatgtcct 
cggccacatc 
agtgggcgca 
ggaacgacca 
aactccatat 
aacagtctgt 
atcggcacaa 
atctga 



ttgcggacct 
gcccggagac 
cggtggccca 
tctgcgagcc 
tcctggcctg 
agctggccga 
actggacgct 
ccctggtggt 
tgcgcgcccg 
tgctcggcgc 
tgtggatcat 
gcttgctgta 
acaacctgat 
gattactgct 
gaggtacctt 
ccacaggagc 
atatcagtcg 
ccatggccat 
gccgtctcat 
acgaggagtc 
agggtcccat 
ctcccgtttg 
tcacggtgtt 
acaagctgct 
cgtccggcgg 
aacaaagcaa 
tgtctagctt 
ccagcgccga 
cgcgcttcat 
ccaaatttcg 
accaccatcg 
atgcgtccac 
gcgcccgaca 
ccacctcctc 
ccatttccgg 
cggctgccac 



gctggtgctg 
ctgggtactt 
cgccagtgtg 
attaaaggcc 
gggcattgct 
gtacattgat 
ggccttcttc 
gctgtacggc 
tcccacgaag 
cgttgtgctg 
cctcagcacg 
cttctgtagg 
gtcgaccaag 
ggaattggtg 
gtcactgggc 
cacgagctcc 
cacccgcctt 
gctgcagcat 
gcctctgaga 
actggaggag 
catcgccata 
caccaccgcc 
tttcttcctg 
ccgtcccaac 
agccaccagt 
cgggatgagg 
ttttgtttgc 
ggatgtcgag 
gctctaccta 
cagcggcttc 
ccaccactac 
gcgacaggaa 
tccacgtcgc 
gggtacggat 
tctgagcgaa 
cgttacaacc 



60 
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240 

300 

360 . 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2196 



<210> 39 
<211> 731 
<212> PRT 
<213> Drosophila 



<400> 39 



Met 


Arg 


Asn 


Ser 


Thr 


Asn 


He 


Phe 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Asp 


1 






5 










10 










15 




Leu 


Leu 


Val 


Leu 


Leu 


Val 


Cys 


Thr 


Pro 


Thr 


Val 


Leu 


Val 


Glu 


Val 


Asn 








20 










25 










30 






Thr 


Arg 


Pro 


Glu 


Thr 


Trp 


Val 


Leu 


Gly 


His 


Glu 


Met 


Cys 


Lys 


Ala 


Val 






35 










40 










45 








Pro 


Phe 


Val 


Glu 


Leu 


Thr 


Val 


Ala 


His 


Ala 


Ser 


Val 


Leu 


Thr 


He 


Leu 




50 










55 










60 










Ala 


He 


Ser 


Phe 


Glu 


Arg 


Tyr 


Tyr 


Ala 


He 


Cys 


Glu 


Pro 


Leu 


Lys 


Ala 


65 










70 










75 










80 


Gly 


Tyr 


Val 


Cys 


Thr 


Lys 


Gly 


Arg 


Ala 


He 


Leu 


He 


Cys 


Val 


Leu 


Ala 










85 










90 










95 




Trp 


Gly 


He 


Ala 


Ala 


Leu 


Phe 


Thr 


Ser 


Pro 


He 


Leu 


Trp 


Val 


Ala 


Glu 








100 










105 










110 






Tyr 


Lys 


Leu 


Ala 


Glu 


Tyr 


He 


Asp 


Gly 


Ser 


Ser 


Val 


Ala 


Val 


Cys 


Leu 
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115 










120 




Thr 


Gin 


Ala 


He 


Ser 


Asp 


Trp 


Thr 


Leu 




130 










135 






Ser 


Val 


Phe 


Phe 


Val 


Val 


Pro 


Phe 


Val 


145 










150 








He 


He 


Ala 


Arg 


Asn 


Leu 


Val 


Ser 


Asn 










165 










Arg 


Pro 


Thr 


Lys 


Pro 


Glu 


Leu 


Ser 


Leu 








180 










185 


Leu 


Met 


Leu 


Gly Ala 


Val 


Val 


Leu 


Ser 






195 










200 




Phe 


Arg 


Val 


Leu 


Thr 


Leu 


Trp 


He 


He 




210 










215 






His 


Asp 


Leu 


Gly 


Leu 


Val 


Arg 


Tyr 


Tyr 


225 










230 








He 


Met 


Leu 


Tyr 


Leu 


Asn 


Ser 


Ala 


Met 










245 










Met 


Ser 


Thr 


Lys 


Phe 


Arg 


Arg 


Gly 


Phe 








260 










265 


Gly 


Arg 


Leu 


Leu 


Leu 


Glu 


Leu 


Val 


Thr 






275 










280 




Ser 


Ser 


Arg 


Gly Arg 


Arg 


Gly 


Thr 


Leu 




290 










295 






Thr 


Asn 


Thr 


Asn 


Thr 


Asn 


Ser 


Ser 


Asn 


305 










310 








Ser 


He 


Leu 


Ser 


Arg 


Ser 


Ser 


Asn 


Arg 










325 










Arg 


Thr 


Arg 


Leu 


Lys 


He 


Glu 


Met 


Gin 








340 










345 


Glu 


Ala 


Met 


Ala 


Met 


Leu 


Gin 


His 


Ser 






355 










360 




Arg 


Arg 


Val 


Ser 


Asp 


Ser 


Arg 


Leu 


Met 




370 










375 






Arg 


Arg 


His 


Lys 


Pro 


Gin 


He 


Ser 


Phe 


385 










390 








Asn 


Lys 


Arg 


Arg 


Leu 


Gly 


Val 


Gin 


He 










405 










He 


lie 


Ala 


He 


Ser 


Thr 


Tyr 


Ser 


Val 








420 










425 


Asp 


Ala 


Pro 


Val 


Cys 


Thr 


Thr 


Ala 


Ala 






435 










440 




Tyr 


Phe 


Val 


Gly 


Cys 


He 


Thr 


Val 


Phe 




450 










455 






Leu 


Val 


Leu 


Leu 


Tyr 


Ala 


Ala 


He 


Ala 


4 65 










470 








Asn 


Ala 


Phe 


His 


Arg 


Pro 


Thr 


Ser 


Pro 










485 










Gly Ala 


Thr 


Ser 


Gly 


Ser 


Ser 


Gin 


Val 








500 










505 


His 


Gin 


Gin 


Ser 


Asn 


Gly 


Met 


Arg 


Lys 






515 










520 




Met 


Leu 


Val 


Ala 


Val 


Val 


Ser 


Ser 


Phe 




530 










535 






Arg 


Ala 


Phe 


Thr 


Leu 


Trp 


Val 


He 


Leu 


545 










550 








Gly 


Leu 


Gly 


He 


Ala 


Gly 


Tyr 


Tyr 


Asn 










565 










Met 


Leu 


Tyr 


Leu 


Asn 


Ser 


Ala 


Met 


Asn 








580 










585 


Ser 


Ser 


Lys 


Phe 


Arg 


Ser 


Gly 


Phe 


Trp 
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125 



Ala 


Phe 


Phe 


Leu 


Met 


Thr 


He 






140 










Thr 


Leu 


Val 


Val 


Leu 


Tyr 


Gly 




155 










160 


Arg 


Ala 


Ala 


Met 


Leu 


Arg 


Ala 


170 










175 




Lys 


Ala 


Arg 


Lys 


Gin 


Val 


Val 










190 






Phe 


Phe 


Val 


Cys 


Leu 


Leu 


Pro 








205 








Leu 


Ser 


Thr 


Asp 


Gin 


Thr 


Leu 






220 










Ser 


Leu 


Leu 


Tyr 


Phe 


Cys 


Arg 




235 










240 


Asn 


Pro 


He 


Leu 


Tyr 


Asn 


Leu 


250 










255 




Lys 


Arg 


Leu 


Cys 


Gin 


Asp 


Ala 










270 






Leu 


Gly 


Arg 


Arg 


Lys 


Glu 


Asp 








285 








Ser 


Leu 


Gly 


Met 


Gly 


Thr 


Asn 






300 










Ala 


Thr 


Gly 


Ala 


Thr 


Ser 


Ser 




315 










320 


Arg 


Cys 


Ser 


Glu 


Asp 


He 


Ser 


330 










335 




Met 


Pro 


Cys 


Gly 


Ser 


Asp 


Leu 










350 






Thr 


Leu 


Gly 


Lys 


Gly 


He 


Ala 








365 








Pro 


Leu 


Arg 


Asn 


His 


Gin 


Pro 






380 










Asp 


Glu 


Glu 


Ser 


Leu 


Glu 


Glu 




395 










400 


Ser 


Thr 


Gly 


Ser 


Lys 


Gly 


Pro 


410 










415 




Glu 


Pro 


Tyr 


Gly 


Asp 


Gly 


Thr 










430 






Asp 


Gly 


Phe 


Trp 


Ser 


He 


Phe 








445 








Phe 


Phe 


Leu 


Pro 


Phe 


Gly 


He 






460 










Tyr 


Lys 


Leu 


Leu 


Arg 


Pro 


Asn 




475 










480 


Gin 


Pro 


Gin 


Gin 


Pro 


Ser 


Gly 


490 










495 




Pro 


Ser 


Thr 


Lys 


Gly 


Asn 


Ser 










510 






His 


Arg 


Lys 


Gin 


Val 


He 


Phe 








525 








Phe 


Val 


Cys 


Leu 


Leu 


Pro 


Phe 






540 










Ala 


Ser 


Ala 


Glu 


Asp 


Val 


Glu 




555 










560 


Leu 


Leu 


Tyr 


Phe 


Ser 


Arg 


Phe 


570 










575 




Pro 


He 


Leu 


Tyr 


Asn 


Leu 


Met 










590 






Arg 


Leu 


Leu 


Leu 


Thr 


Cys 


Leu 
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595 










600 




Gly 


Gin 
610 


Arg 


Pro 


His 


His 


His 
615 


His 


Arg 


His 


Pro 


Thr 


Ala 


Gly 


Gly 


Ser 


Gly 


Arg 


625 










630 








Gin 


Asp 


Ala 


Glu 


Glu 
645 


Gly 


Ala 


Ala 


Leu 


His 


Pro 


Arg 


Arg 
660 


Thr 


Leu 


Arg 


Arg 


Glu 
665 


He 


Ser. 


Thr 
675 


Ser 


Ser 


Gly 


Thr 


Asp 
680 


Arg 


Arg 


Ser 
690 


Asn 


Ser 


Leu 


Ser 


He 
695 


Ser 


Gly 


He 


Leu 


Gly 


Ala 


Ala 


He 


He 


Gly 


Thr 


705 










710 








Ala 


Cys 


Leu 


Gin 


Glu 
725 


Arg 


Arg 


Ala 


Ser 



t 
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605 

His His Tyr His Gin Arg Gin 
620 

Asn Ala Ser Thr Arg Gin Glu 
635 640 
Ala Gly Thr Thr Ser Ala Arg 
650 655 
Ala Thr Phe Leu He Asn Ser 
670 

Thr Thr Ser Ser Ser Ala Trp 
685 

Leu Ser Glu Arg Glu Arg Gly 
700 

Thr Ala Ala Thr Val Thr Thr 
715 720 
Lys He 
730 



<210> 40 
<211> 3821 
<212> DNA 
<213> Drosophila 



<400> 40 

cagaaagtgt 

aaatatcacc 

gagccttggc 

tcgctaccac 

cgcagccttt 

caccaaatgg 

ggccaactcg 

tgttttattg 

gtttaaataa 

ctggtccagc 

ggcaaacact 

taggcaggac 

tgtacccaat 

cccttctatt 

gcaatttggc 

atggtcataa 

gagtaaaagt 

agtgaagaaa 

ttgtctacgt 

gacggaaacg 

tgcccgagtt 

tatccgcaca 

cccttcggta 

gccttctaca 

attttggatt 

gttggactca 

tgaaactgtt 

aacagtgctt 

tatggactga 

ttcataataa 

ctgttatttg 

aatgatgtta 

tcgaccagtg 

attgatccaa 

accgagcatc 



agggaatgta 
agactcggct 
ggcatgagta 
cgcccaacgc 
ttacgatgcc 
tgtcctgtcg 
tgtgtcgtgt 
ccagctgggg 
atttaagaga 
gatccgttaa 
ggagatttgt 
acttgtcagg 
tagatgccca 
ttgcgcccgg 
gggaagtggc 
acaaaataat 
tatcacgggg 
agggaaccgg 
ttgcgaatgt 
gcgccataat 
aatccctttg 
tcctttcagc 
tccgcatgct 
ccgcccacct 
tgcaacagga 
gctttaagtt 
tatgtaccat 
aaattgtaat 
taagcatatg 
taaatcataa 
ataatattta 
ctagtgctga 
atgggaccac 
cctagcgttg 
tttcgtggcg 



gggaaatgca 
cggttctgcc 
ataatgaccg 
gttgctcatt 
gaaccgactg 
ctctttctcg 
gtgttttgac 
aatgccaaag 
gagagagaga 
gccatcgagg 
ataataaggg 
cttggcgcaa 
tcattatgag 
gaagtcatat 
agcaggagga 
aaatcaataa 
gggaaaaaaa 
aaaggaaagc 
gtgtggttgt 
tatgccataa 
ttgttccact 
ttactccacc 
ggtccacaag 
ggagctttat 
ggagctttca 
cactcggata 
aaatggagcg 
aaaaagctat 
ttagaagata 
aatttattcc 
caaggaatat 
ccattgaacg 
ctgggtaagt 
tggtattcct 
aactgatcaa 



aagcgagacc 
gctcaagtgg 
aaatcagagc 
ttattaaaat 
gcggcccatt 
tcctcgtcct 
cagagttatg 
tggcaagagg 
gagagagatc 
gtccttcgaa 
gtgggtggtt 
acaaaactga 
agcaggctat 
atcgaacatt 
cgcacatgat 
acaacacaca 
tgaaggaaaa 
cattcatttg 
gtgcgtggca 
acccaaccat 
cgctaccgca 
gccgccctgc 
gatcgaggac 
ctgggcaacg 
agggtcgaag 
tactaaaaat 
taatgtgctt 
ttgttttagg 
atatttcata 
gttgggatca 
tatcccattt 
ttacgtgtcc 
cgcattgaaa 
cacctgcttg 
atgcatcctt 



ctttggctgc 
cagtaaataa 
accatcacca 
aacaaaaat.a 
tgttatttgg 
cgtccttgtc 
cagctgcgga 
cagattgccc 
tggcacgaag 
tgaaatgcca 
gtgccgaaat 
catttatatt 
ctctgctctc 
aggctctgtt 
attagtaata 
cgcacacgca 
atgggaggca 
gccccgcgac 
ggacttcctg 
aaaggacatt 
cccatctctc 
tggccatcgt 
agtgggagga 
gctgtttggg 
ctcttctaaa 
atatataaaa 
gatttaaagg 
agctagccag 
catttagata 
tttaatcctt 
gcactcctag 
gtttgccatc 
gtctgtagat 
gccacggtaa 
ggatcgagtg 



tcactgactg 
aagttgtgag 
tcaccaacca 
ttttgccgct 
cggtcattgc 
cgttccactg 
tgggcaaaaa 
gatcttagct 
cactccatca 
gcgctgacaa 
ttggcactga 
gatgtgtccc 
gtgtccttcc 
tgtgtgcaga 
atgtaattat 
ggcggcacac 
gagcagagga 
tcaagtgccg 
ttgccacatg 
tttatagatg 
atgtcctttt 
gttcgccatt 
gttcggccca 
tgagtctagg 
aggttcaaaa 
caacacgtta 
aacagatgtc 
acattattag 
cgcctataat 
gaacagatat 
gcgttggcgt 
caggatttgc 
ggtctattta 
tcgtctatct 
acgtatatgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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atatttgcga 
catgctggag 
catcatgatg 
gcatgcccag 
acatggacac 
gttcctgctg 
tctccacatg 
cagcgccaat 
cagcgaggat 
gttctgccac 
agcggtagcc 
gactctcaca 
aggtggacag 
gtagcattta 
gagatccgaa 
ccaactccaa 
tgccctttct 
aaaatgatat 
gcgtatataa 
tttttttttt 
ccgtaattgg 
ttgcgccgaa 
tcctcaagcc 
cgtattgata 
gcgacaggac 
gaacaatagc 
cttatatata 
tgtatacacg 
atagcattta 



cgcgacaaca 
gtgatcttta 
gtctacaggc 
ggacatggtc 
ggctatttga 
ctcggcagca 
acgatcgcct 
ctgctggagt 
ttccgcaaca 
caggccgagc 
ggaaccggaa 
cccgcagaac 
gggccatcaa 
gacaatgcct 
ctccgaacac 
aaccgtcatc 
ctcgcttgag 
acgataaaca 
gtgtctgtgt 
tttagcacta 
tgaaataatt 
tgatttcacc 
acacgtatag 
accaaagagc 
aacgacagac 
gagagagaga 
cacttgcata 
cgtaacttta 
agtgcgcgca 



ctatctacca 
aactggtgcc 
ggacctgcga 
acggacatgg 
aggacgacga 
catcgatttt 
cggaggtcta 
tgattaacta 
ccctggtgag 
acgaggtgag 
atggccacgt 
ccgacgaacg 
aagcaccaca 
accacccaac 
cgagaaccct 
accacttcca 
atcctgcgac 
actaacgatt 
gtgttcctta 
gaattaggta 
gatatcgata 
acaatgtagc 
atgcagatat 
tcgtttactc 
agacatattg 
gagaaagagt 
tgcctatatg 
agcgaatata 
agaggaatcc 



gcagaccatc 
cactttggtg 
gcgccgccgc 
gcacggacat 
tccgcgaaag 
gttcctggtc 
tccgagcttc 
ctcgctgacc 
gacgatcaag 
tgccagtcca 
ttctattttt 
gccccggtgt 
taactaccac 
aacaacacca 
cccagcccag 
cttacccccc 
atttgaacgt 
aacgataatt 
aacatatata 
cttaaaaacc 
ggtaaaacca 
ttgcggaatg 
gatagtttta 
tagcattaga 
acatatagac 
tttacgatga 
tgaatgatat 
tttatatatg 
ggattccaac 



ttctatcgcg 
atcggtggcc 
aagatggtgc 
ggtcatggac 
ttcgccgagg 
tgcgtctcgc 
cccttccagg 
ttctacatct 
tggccctggt 
ccggccacag 
catccggcaa 
gccaatggcg 
ctcgggcctg 
ccacaaccca 
agtaacccca 
aggtcgtcga 
gttcaagtat 
gatattcagc 
tttaaatata 
aaacttaatc 
gttcgacttt 
tgtatgtcgc 
tcaatggata 
tgcagacaga 
aggtatagac 
taatatgcag 
ttatgtaaac 
catatatgta 
c 



tctacaagat 
tcaacatgcg 
tcagtcgtcc 
atggacatgc 
agcggcgctt 
cgatggccat 
tgttccgtgc 
actgcctgtt 
tgaagggcaa 
ccggaaccgt 
taccggcgct 
tgcttcacta 
ggacgatcga 
tccgctactc 
ccaccaccac 
ggttaaaggg 
cgatgaacga 
gccacgcatt 
tggataaaca 
gaaaacacat 
cgatcagcat 
aggatcaaga 
gattgcttgc 
aacaggaaca 
aggtgagatg 
acatacaata 
gacgtattta 
tctattgatt 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3821 



<210> 41 
<211> 1158 
<212> DNA 
<213> Drosophila 



<400> 41 

atgggaggca 

gccccgcgac 

ttcgccattc 

ttcggcccag 

gttggcgtaa 

ggatttgctc 

acggtaatcg 

tcgagtgacg 

tatcgcgtct 

ggtggcctca 

atggtgctca 

catggacatg 

gccgaggagc 

gtctcgccga 

ttccaggtgt 

tacatctact 

ccctggttga 

gccacagccg 

ccggcaatac 

aatggcgtgc 

<210> 42 
<211> 385 
<212> PRT 



gagcagagga 
tcaagtgccg 
ccttcggtat 
ccttctacac 
tgatgttact 
gaccagtgat 
tctatctacc 
tatatgtata 
acaagatcat 
acatgcgcat 
gtcgtccgca 
gacatgcaca 
ggcgcttgtt 
tggccattct 
tccgtgccag 
gcctgttcag 
agggcaagtt 
gaaccgtagc 
cggcgctgac 
ttcactaa 



agtgaagaaa 
ttgtctacct 
ccgcatgctg 
cgcccacctg 
agtgctgacc 
gggaccacct 
gagcatcttt 
tttgcgacgc 
gctggaggtg 
catgatggtc 
tgcccaggga 
tggacacggc 
cctgctgctc 
ccacatgacg 
cgccaatctg 
cgaggatttc 
ctgccaccag 
ggtagccgga 
tctcacaccc 



agggaaccgg 
tactccaccg 
gtccacaagg 
gagctttatc 
attgaacgtt 
ggcgttgtgg 
cgtggcgaac 
gacaacacta 
atctttaaac 
tacaggcgga 
catggtcacg 
tatttgaagg 
ggcagcacat 
atcgcctcgg 
ctggagttga 
cgcaacaccc 
gccgagcacg 
accggaaatg 
gcagaacccg 



aaaggaaagc 
ccgccctgct 
atcgaggaca 
tgggcaacgg 
acgtgtccgt 
tattcctcac 
tgatcaaatg 
tctaccagca 
tggtgcccac 
cctgcgagcg 
gacatgggca 
acgacgatcc 
cgattttgtt 
aggtctatcc 
ttaactactc 
tggtgaggac 
aggtgagtgc 
gccacgtttc 
acgaacggcc 



cattcatttg 
ggccatcgtg 
gtgggaggag 
ctgtttgggc 
ttgccatcca 
ctgcttggcc 
catccttgga 
gaccatcttc 
tttggtgatc 
ccgccgcaag 
cggacatggt 
gcgaaagttc 
cctggtctgc 
gagcttcccc 
gctgaccttc 
gatcaagtgg 
cagtccaccg 
tatttttcat 
ccggtgtgcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1158 
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<213> Drosophila 
<400> 42 

Met Gly Gly Arg Ala Glu Glu Val Lys Lys Arg Glu Pro Glu Arg Lys 

15 10 15 

Ala lie His Leu Ala Pro Arg Leu Lys Cys Arg Cys Leu Pro Tyr Ser 

20 25 30 

Thr Ala Ala Leu Leu Ala lie Val Phe Ala lie Pro Phe Gly He Arg 

35 40 45 

Met Leu Val His- Lys Asp Arg Gly Gin Trp Glu Glu Phe Gly Pro Ala 

50 55 60 

Phe Tyr Thr Ala His Leu Glu Leu Tyr Leu Gly Asn Gly Cys Leu Gly 
65 70 75 80 

Val Gly Val Met Met Leu Leu Val Leu Thr He Glu Arg Tyr Val Ser 

85 90 95 

Val Cys His Pro Gly Phe Ala Arg Pro Val Met Gly Pro Pro Gly Val 

100 105 110 

Val Val Phe Leu Thr Cys Leu Ala Thr Val He Val Tyr Leu Pro Ser 

115 120 125 

He Phe Arg Gly Glu Leu He Lys Cys He Leu Gly Ser Ser Asp Val 

130 135 140 

Tyr Val Tyr Leu Arg Arg Asp Asn Thr He Tyr Gin Gin Thr He Phe 
145 150 155 160 

Tyr Arg Val Tyr Lys He Met Leu Glu Val He Phe Lys Leu Val Pro 

165 170 175 

Thr Leu Val He Gly Gly Leu Asn Met Arg He Met Met Val Tyr Arg 

180 185 190 

Arg Thr Cys Glu Arg Arg Arg Lys Met Val Leu Ser Arg Pro His Ala 

195 200 205 

Gin Gly His Gly His Gly His Gly His Gly His Gly His Gly His Gly 

210 215 220 

His Ala His Gly His Gly Tyr Leu Lys Asp Asp Asp Pro Arg Lys Phe 
225 230 235 240 

Ala Glu Glu Arg Arg Leu Phe Leu Leu Leu Gly Ser Thr Ser He Leu 

245 250 255 

Phe Leu Val Cys Val Ser Pro Met Ala He Leu His Met Thr He Ala 

260 265 270 

Ser Glu Val Tyr Pro Ser Phe Pro Phe Gin Val Phe Arg Ala Ser Ala 

275 280 285 

Asn Leu Leu Glu Leu He Asn Tyr Ser Leu Thr Phe Tyr He Tyr Cys 

290 295 300 

Leu Phe Ser Glu Asp Phe Arg Asn Thr Leu Val Arg Thr He Lys Trp 
305 310 315 320 

Pro Trp Leu Lys Gly Lys Phe Cys His Gin Ala Glu His Glu Val Ser 

325 330 335 

Ala Ser Pro Pro Ala Thr Ala Gly Thr Val Ala Val Ala Gly Thr Gly 

340 345 350 

Asn Gly His Val Ser He Phe His Pro Ala He Pro Ala Leu Thr Leu 

355 360 365 

Thr Pro Ala Glu Pro Asp Glu Arg Pro Arg Cys Ala Asn Gly Val Leu 
370 375 380 

His 
385 



<210> 43 
<211> 14111 
<212> DNA 
<213> Drosophila 
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<400>' 43 

aataatccac 

aacccctaag 

ggtatctatc 

cgcattcctg 

aatggaaacg 

gtggagcagg 

atggcgcctt 

cttctgccgc 

ggaaacgcta 

ctccatgtag 

ctgctggcag 

gaggaactga 

gtcctgcttg 

cgcgttgccc 

ccgggtgctg 

tggaaaaagg 

aaggccttca 

cgcagcagct 

agtagagtat 

ccttcttgag 

agatgacata 

tgatgacctt 

cggggatgga 

acatgggcag 

tgcccggggg 

cgtcggagaa 

tcaccagcgc 

gcttcagggg 

tttaccaagt 

cttcagatgc 

gtcgagtcga 

atgcaaatat 

taagcctttt 

cctgctcctg 

gcttgccata 

ggggccaaca 

cccgctcagt 

ggcccgaaac 

tcgcctgcgt 

gcggccatgc 

taacaatgtg 

caaacaaagc 

aaccaattgc 

cttaattgag 

gctggtgcag 

attaactaca 

tatgagttct 

ttcaattgct 

gcccatcaat 

ttagcatatt 

gcgcttttaa 

tgtgtttctg 

cccatacgca 

cgtaaagaca 

gcaatctagt 

ggcataagta 

gggcattacg 

ttaaatttcg 

gataaacgga 



cttcgaaggt 
gtagttcatc 
taccagagtt 
cgaggagaag 
tggtgggtgg 
ccgcgtcgct 
tgctctggtc 
ttatcgagca 
tccctgtcca 
gacacaattg 
aggggtcgcc 
cgcgatccgt 
agcatcatgt 
ttgctgctcg 
gtggtgctgg 
gtttttatcg 
catattttat 
gcgcgacttg 
ggggttgatg 
gatctcgaac 
gagcggcagc 
caccttgctc 
cctcgtggag 
caggtagcag 
ccacacctcc 
gacctcctcg 
gattacccat 
ccaccagatg 
agtcaaagaa 
cactgacggc 
ggcgacgatg 
ctccgctgat 
tctgctgagt 
ctcctccgcc 
atgaattgta 
acgagttcag 
ggtcatagac 
catgtaatta 
ctgtccgcca 
caggccacta 
gccgagaccc 
tgtccgcaaa 
atttgcataa 
ttctttttgg 
actaattcaa 
cttttgatga 
cttatacgat 
gtcggaaaat 
tccgttgagt 
tattaagaaa 
agcatttaat 
tctagatcca 
ctgttgacat 
caactatgtc 
tatatacata 
taaatataag 
ggctctaaga 
attcagttta 
catcaatgac 



aaaccatctt 
ccacttgttc 
aggagctcgt 
tcgaactgcc 
gtgcatcacc 
ggtcggcgaa 
gctggtccac 
gagagctcgg 
ccagtgtgtc 
agtcctgctt 
ggatcccaga 
gagagtcctg 
tgtggcgccg 
gactcttcct 
aggcgatggc 
aagggaacac 
caatatattt 
atgatggccg 
cagctgttcg 
tcctcctgcg 
caggacaaaa 
ttctgctgca 
accttgatcc 
gccaccaggt 
tggcagagga 
gcgggcacca 
atgccgatta 
gcaatgaacc 
ggagtcttcg 
tttgagccgt 
acgatgacga 
taattcgcac 
gttcgctttc 
ccaaaaaagc 
atttgaataa 
catttttcat 
aagacttgtc 
cgtgcattgt 
aaacatttgg 
ccccagaatc 
tcatctctgc 
gccaataatc 
tttgcccatg 
gtaaaaacag 
actaaacctc 
tgtaaaccat 
ttatgtatga 
taaattatta 
accatatgcg 
aatcttgcca 
aagccgcaag 
cgtggtttta 
ttgcgacgca 
tataagcgag 
tacgtattca 
tagaagtaga 
ccgtggagag 
cgatggctcg 
gctggactcg 



aattatgctt 
aactgaaatc 
ttaacccaat 
gtagctgagc 
cgtccggttg 
ggagatgacc 
aaacgagatc 
tgtggcgccc 
gttgagaccg 
ggacagggag 
agcatcactg 
cttaagcagc 
cagcgagttg 
cgagctgttc 
cacgttgtcg 
gatttcttac 
gaggttagaa 
cgaatccgcg 
aggagcccag 
agatgtccga 
caaacaggat 
tgcggtccat 
agatgagcac 
tggctagcag 
actgcggctg 
ggtcgaagaa 
tcatgatgcg 
tgcaagaggg 
taggagcagc 
caaggataat 
caccgtttca 
acgcaggatg 
caatctcgac 
ggcgctgcgg 
gagctgcgag 
ttagcttagt 
ccggacgaac 
gttcgtattt 
cagataacac 
ctattcctca 
gaaaaaaaaa 
aaattgtcat 
cgacttctat 
cggagtcgga 
taatttgatg 
tttatgagct 
tttgtttcac 
atattatttt 
taaatctcta 
caatcaactt 
tgtattaatt 
tggctcctat 
aagaagacac 
tgtgctttct 
gacaagaaaa 
agtataagca 
atttaagttt 
taaaaggaga 
gacttcgact 



taaaattgct 
gacccacaca 
attgcctatt 
tgcgagtcgg 
gccatcagcg 
gagtcctgct 
accgagtcct 
tgtcgccggc 
tgtcctgccg 
accttggagc 
caggagctct 
atctgctgcg 
tgctcgtaga 
tggtggtagt 
tagtacctag 
aaatcttcat 
attaactaac 
tcggtacttc 
ccactgggcc 
gccgaatttg 
gaccacggcc 
ttgcgcgtcc 
gtagcagagc 
gaagtagagg 
cgagtaggcc 
gagcagccaa 
ggcacgtcgc 
gagtcccaac 
agcagcagca 
ggcaatcatc 
actaggagga 
ccgatgccag 
tattgaaccc 
atcctgtcat 
gtagtgagac 
gcagccggct 
gaaactggtt 
tcgacttgga 
agcggtaatt 
ccgactaaaa 
aggtgggccg 
aacgtttatt 
ggtcataaat 
aaaagtaatc 
ttgaaatttt 
ttaaaaacta 
aaataaactc 
ggtaatcacg 
aggacgtcag 
aaacaaagta 
taatatttgc 
ttcgcttcct 
aaaagccgca 
tatagacttt 
gaccctaatt 
agtgataagc 
catgttacag 
gaataacagc 
gggacttgga 



ggattatata 
aatggtggtt 
cgtagagatc 
tcttcttcag 
agtcctgttt 
tcaccagttg 
gctttacgaa 
ccacatccat 
acctgcgtgc 
cattgctgtc 
cctgcttgat 
acaggttgct 
tgtaggacac 
gcgatgactt 
gtggttaagg 
atcaattttg 
ctgaggcgtc 
ttgttcactg 
accggcatca 
atccgcgcaa 
accaacatct 
ttggactcgc 
gtgatcaggg 
ttcccatccg 
gagaccaggg 
ggaatggtgg 
tttgtcatct 
aaaggggatt 
ggagctccgg 
aagctgtcga 
gcagctcggc 
cagccatcct 
gcctgctact 
tttttattct 
gagtcggtat 
cattgtggcg 
gtctgctcct 
ccttgggttg 
ccattagagg 
atagccagct 
cctgggcgac 
tataggagca 
caggcgagag 
agcagttttc 
gttaaatcgt 
aaatttctct 
tccttataga 
ggtaattgtt 
ttcagtattg 
gttaatactc 
cacacaatcc 
actgcggtta 
atgccagcac 
atgattataa 
aaatgacagt 
taagtcgaag 
ataataaatc 
aaaacaactg 
agtggaagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 



63 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCT/US01/09341 



gaattggaag tggatgagga tgtggatgtg 
gtgggcggag gctgacatgc gggcctgtgg 
ctttgtcccg ctggctgaaa aatgacagtg 
tcctctgatt gtggccagaa attgatataa 
tattgtcctt gcggctatgg ctagcacaat 
tgttgaccgc gacatttgcc gaatttattg 
agggctgaca catcgattcc tggcggccaa 
cggccaacaa cattctaagg caattgaaat 
gtaagcagcg cctaatacgg acaaggtgag 
agacgactgc tccactgact ctcctacata 
cagttttccc cgcagctgat tgcattgagt 
caaatccact ataaaagccg cactcggagc 
cattagccta gctacctttc ctacttcagt 
ttcaaagtag tgagtatcat tgtgcagaag 
tatcattatg atattatatc agtttgctct 
agtttatatt acaactcata tgagtcacct 
tcagaatgag taatatatgc tttaaagtct 
gcagccacct gctgctccgt ctaatgccac 
ccgcagctgt tcgtgctcgc cgccttcgtc 
gtggccgtgg cacccgtggt ggcgcacccg 
ggagcccact cggtgcactc gtaaggccat 
aactcacccc tttcattttc agccatgttg 
cacccgtcgt ccacaagccc gtggtggcgg 
ttccagtgca tcatgccgcc cccgccgtcg 
tcactttcaa tacctttccc gctcccgatc 
atcccttcct gagcaccata accacgatcc 
agcgagtgtt ttcaacttac ctgtccaggg 
ccacggagac accttgtatg tagggcacaa 
ctgcgagttc cagcagttcg cgaaacatga 
attggtcata aacaacgcac aaattatgct 
gtgtcctagc catacactga aggaaatgtg 
tgcagatttt tctaaaatac cccaaggaca 
ccacgaaaat caaagcaaat taatgtggtt 
gggaaatatt cgaaatagta gatggtgtat 
actgcttagc taagaaaaga gtgaaaatat 
ataatatatt tcatatctta cttttattct 
attgagttct ttatagcttt actgcttttc 
ggttgtggtg atggtggtgg tggtgatagt 
ggttagcgtt caatgcttct tgtcattgcc 
ctgagattta tgtgctgcat atcattcata 
ccatcgtagt gggtggggca gaatgtggag 
cccagactca aagccgcggt ggaccgcagg 
tccattttgt ggctggatga tgatggtaaa 
tgagcgttat gaattttatg ctaagaggca 
catacataca tgggcgtgca atgggagggg 
gagaaaggcg cccggttcat gacatatcga 
acacatcatt agcagttagc tgtttggcag 
ttcaatgctc gaccaggaga cgcctcctgc 
cgcctgtctt atctctattt gttgtcgctt 
aggcagccca ataaaagcta actttaatat 
ggcgctgtgg cgcttcgtgc aagtgaactc 
aggatcatgt gtgtctgctg ggaggacata 
gtgtgtacac aggacagcag accgaattgg 
ggctttcggg cagacctgca agtgcaacag 
tttggccact cacttgatat tatgctgata 
ggagctcttg actcggcaaa ctttatgctt 
attcggttcc gaaatgcttc atcacttgcg 
actttttttt ttttttaaat acatgttctt 
attgtgtatg caattctatt aggacaggca 
aagtttgcat cagtgcactt tggcagtcaa 



gatttggact cggttgggtt ggcgttaagc 3600 
gtgaccaaac gatcaagtga ggcgctatgc 3660 
gcaaaaacag aagcttctgg ccaaaactcc 3720 
atgaaagtgc acataaagcg acgggcgtct 3780 
ttgagggact tcagcatcac ttgataaatc 3840 
cttttttata ctttatgggc caggaaacgc 3900 
tcatcaatgg tatcaatatt cgcggtatcg 3960 
gcacataaaa tgagaaacat gaaattgatt 4020 
ccatagaagc atatgatgat cttgatcttg 4080 
actgcctaac tggctaactg ccttactgac 4140 
ggcgctcact ttcggttcgg tcgcctggtg 4200 
tcaagttgtc accagttgac aatttgaaat 4260 
tatcccgcca aaatccaacg cctcaaaatg 4320 
aagttgtaac cagtttgaat ccaagcatcg 4380 
atagcattta aaaataaatg caataaatta 4440 
agtttgttgt taggattggt cttatcttac 4500 
gcgacaacta accaaattca ctgtggagtg 4560 
taatcaggct aattcgtgaa ctctgatccg 4620 
gcctctcagg ctatcgccca tcccggcgtg 4680 
gcggtggtcc acacgcccat catccatcat 4740 
tcatgtccaa tctgcttatc accaccgaac 4800 
ttcatcatcc ggcagccgtc aaggtcatca 4860 
tgcatgctgt caggccggtg gttccattgg 4920 
tcgtgcatca ttaagtgcag cgaatacggt 4980 
ccgttcccga tccccaccag acccagacca 5040 
cacaataaaa gttcatagcc acgtaattgg 5100 
aaacggcaat caggctgtag accgaggctg 5160 
acttgcacat cagccatccg agcatccagg 5220 
aagggggaaa agtgcgggag gtgagtaaat 5280 
gagtaacttt cggtcacatt gtccttgctg 5340 
agggcctatg gcaacaaaaa aagaagaaga 5400 
ctttgctcga aaactttata aatatcctaa 54 60 
aaccgaccga aggatgctta ttgttctcat 5520 
cactttaaat acaaaccgta agtcatcttg 5580 
tataaaacta ataattctcc gctaagtatt 5640 
tgaataagta cagttttaca gctggggatc 5700 
cccgagtgca ctggtagtgg ttatgctcat 5760 
ggtagtcatg gacagttggg ttcgagatct 5820 
gccgcccacg ttcccgtcgt tgtttgctga 5880 
atttggtgac aagtctcaga ttgcgcccag 5940 
tcctgggggg cagagcattg tttgcccaga 6000 
actgcctaac ttaatttgta ttccttttac 6060 
tggaatacac gggccctact tgcgcatagt 6120 
taactcaccc agacacaaaa ggttataaag 6180 
caacttccca gggttgatga cagtcgggcg 6240 
atggcaatta aacaagtgca agtgcggccc 6300 
tttggcaatt tgcagaagga gtttaccgag 6360 
gtcatggggc tcctaatttc ctgtctcgtt 6420 
tttattatgc agacgcaccg taagccgcca 6480 
gatttggcac ataaaaatat aaatattatt 6540 
tgtggctgtg gacaggtttt ggttctggac 6600 
cacatcctcc cgcctggcaa acaatcaaca 6660 
tttttcggtt ttattcctcc gatttgaaat 6720 
actggatccg aggctattaa tgcgagtgta 6780 
agggtgaact ttgatatata gctgtctgcg 6840 
tcaagttgtt tgaaaacaat ttaaattcaa 6900 
ggacaccgga ttagcagtgc cactgggtga 6960 
ccaacttccc ttcggcaaat aaactatcta 7020 
aacgtgactc ggttggtttg tctgggatta 7080 
gtgtcttaac caagtttaca ttgtggttag 7140 
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cttagcgcag 
ggaaagtatt 
cgctgatttg 
aagctgacat 
attaatgaac 
ccagtagtta 
gagtcattcg 
caaacataat 
cctccttgtg 
cggccaacat 
agtgcgtttg 
tgctcaatat 
cttcgattgt 
aagtgcgccc 
tttttcggtt 
atttgataat 
atttggaaac 
taaacatttt 
tttttatccc 
cgggcattgc 
taatgaaata 
ggattgcagt 
tggcaaatgc 
ggcagctctt 
cggagtggga 
tgcgggagta 
cgacggcacg 
ggatagggaa 
ccggcacaaa 
caacgggaga 
gcacaacaat 
cttcgtctca 
caaagcacac 
cccaccaccc 
tacaggtgct 
gcacactcgc 
taggtcaatt 
gaggcttgaa 
gccagctgcg 
tcattttggt 
ctatcagaca 
gcactgtgct 
gactttacac 
ctgacgaagt 
tggattgttt 
tgtttagttg 
tgaaaccgtt 
ggggttgaga 
atgccagcgg 
agctgaagct 
tccaaacaac 
cacccccagt 
ggtcagtttt 
cacatgcaca 
gagagtgcgc 
tgcaggcgga 
gcactcggca 
ttggtatggg 
aaatctgata 
gttaataaat 



tcaagtgacc 
ttgacaggct 
gatatttaat 
ttaaaattgg 
cgcaaaaacc 
tttatagata 
aaaccaatct 
tgtagctacc 
gcattaccca 
ccagcctccc 
acatggcaac 
tgatcaacta 
gggtggtggg 
taagcctttt 
tcggtatata 
ttgcctgcag 
aaatttttct 
ctctactgtg 
caataccgtg 
ggattcgttt 
atcaggcgtg 
ggagtggagt 
gttattgtta 
cgggaaaatt 
ataggaattg 
aagaagctgc 
tagagactaa 
acagaaatgc 
tcacgtggcc 
tgctgatcta 
ggccataaag 
aggcggcatc 
acataaataa 
acaggccacc 
cccacacccc 
aagtgtgtgt 
catccagccg 
ggtgtgctta 
actgcggcgg 
ggcaaaacaa 
gcgatgtgaa 
acacagggag 
tgctaatttg 
tcaaaaacat 
aatgaaaatg 
gcaacacatg 
gccttcgttt 
gttgtgggga 
gagggctgca 
aagacatgcc 
catacaacca 
gacccagtaa 
gtgtcgatga 
gtagcgagca 
agtgacacgg 
ggtaaactca 
gccttcgttt 
gtgggatcga 
agtcaactgc 
aatctgaggg 



acttgataat 
gaggcattca 
ataaaaatgg 
tttttgttaa 
gcattatata 
ccacaagctt 
ttaattaacc 
•gaaataacct 
tttgaggact 
cgtcaaggtg 
atatggcaaa 
gaggagataa 
tggtgggctt 
attttttagc 
tgacttttag 
aattaaattt 
caaggacaac 
ctttatgagg 
ggtgattgtt 
cggtggcggc 
acaaatccgc 
ggagtggagt 
tgattatcgc 
gaataccaca 
gaatctggga 
agatcttagg 
attaaaatac 
gggagagtct 
ccctaatgaa 
aaaagggccg 
cgccatctga 
gttgctggga 
gggccagcgc 
agccaggaac 
acgcatacac 
gtgcataacc 
caaagagcct 
gataatggcg 
cgacgtcgtc 
cccgcccact 
atgctcgatg 
aaaaggtttt 
attatgaaga 
ttaagtttaa 
acgtgttaaa 
ttttttcccc 
taacttcagc 
tgaagaggtg 
atttgttgtg 
aagcgaaaat 
tctagcccac 
gagctgtcgg 
tcctttgggc 
aacacatggg 
acacaaaaac 
ttcacttttc 
actacatttc 
gtgcagtgcc 
atctgcatat 
gaagaggaac 



cccgaacaaa 
gcacggttca 
aaaaaccaca 
cataaaacga 
taaaaaaaat 
aggtaagaca 
ccaaccttaa 
aattgccttg 
caagtgaaac 
agaatgggat 
cagacgcctc 
taccaatcga 
gtgcgcccca 
ctcttaccct 
aaatgtgatt 
aacgcggtta 
tttaaaacgc 
ctttaatagc 
cctatgtagg 
attgtcatgc 
aggtttggac 
ggccatggaa 
cgcggcgtca 
gctagttcgc 
atcggtggtc 
tggagggctc 
aattggcgag 
gcctccaagt 
caaactggca 
taatgataac 
gacaattgcc 
aatcaattca 
cacattccga 
cacccacaaa 
acacacacac 
ctgactatta 
tgaatgcaaa 
ctgccagcgg 
tttgacggct 
ttgccactcg 
tacaaatatg 
gttagaggag 
atattcgtaa 
agccaaaaag 
cttttgtttg 
tgcattagtg 
aggttgtcga 
aagctgaaat 
actggcgttt 
tgcggagtgg 
ccaatctacc 
ccctgcgccc 
caccccgaaa 
cgtgagagcg 
ggagacacgg 
gccactcgct 
ccccacagtg 
gagtctaaga 
ttgttaccag 
taaaactatt 



caatggtttt 
acttattaaa 
ttttacagaa 
ttgagaagga 
caaataggtc 
tgattatttc 
ggccacatta 
cgtgacgggg 
caaacactcc 
gggtgtttga 
catttatacg 
tactcaggct 
tcagccctta 
ccttacaacg 
caactgaaaa 
tttatgaata 
gttatctttt 
gccaacaaag 
caaagtggat 
acggagtttg 
ttgggagtgg 
aaggcataac 
cgtcctgcgg 
agagaaatgc 
ctcgatgtga 
agaggttgga 
ctgaaaaatg 
tacgcatacg 
acgtgactga 
gattccccat 
gttgctgttg 
tttcgcattt 
ccaacaacca 
gagtacacgg 
accctcaacc 
ggtttgccaa 
caaccttcca 
cgatggccgt 
ggaatttgat 
ctgggggcga 
ctaaacttcc 
agcaaaggat 
gagttatagc 
tgtatatttc 
taaataatac 
ctatttcacc 
aagtgaatag 
gagccacccc 
ttgctaggca 
gcggccaaaa 
atccaccgcc 
agcttgaatt 
agcagccacc 
ggcgagagag 
acaccgatac 
tcattcgttt 
ggcttctctg 
gtagagctgc 
tttgtaatat 
tttattatta 



cagttca'tat 
ttgtccaact 
ccaacgagaa 
ataatacata 
aagggttatg 
aaggaacacc 
atgtgttgta 
ctaacccgaa 
acagtttgct 
tccgacttgg 
cgaaagctgg 
tatcgccagc 
agaaaatgtc 
agagcctttg 
tatttacctt 
atgtcccctt 
acagcaacaa 
ttttgttaca 
tggattattc 
atcgtaaagt 
gaacgggcgt 
ataacacttc 
cgacttctgt 
gcctgggacg 
gatacgggat 
tgccatttgc 
acttcatcag 
ccgcgttcgc 
aagatttagc 
tgataaaata 
caaggatatc 
tattggattt 
ccacccacca 
cccaacccaa 
acacacactg 
gcgaaatatt 
ttgcaattct 
gggcgtcgct 
ttcatataat 
gtagtgggaa 
gaactcccac 
taagtggtat 
atttcatcac 
tgaaaacgtg 
gctacacatg 
gttgccgctg 
gaatgtaagt 
aaaagcccca 
tcggcggctc 
gggtgcccca 
agccacccac 
tgtgaaatta 
aatgcaccta 
agagagcgga 
agatacacaa 
ccagcgattc 
ggggagtgta 
attggaaaat 
taatattcag 
ttattattaa 



7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 
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tttaatttaa 
tacttgtggc 
tattgtttta 
aagaaaagga 
cgagcctacg 
ccatcgaagg 
ttgatttacc 
ccaacctgct 
gttcgtttcg 
gacggcagac 
gaaatttatt 
ggtcgtctct 
catgcggatt 
aattgagtgc 
atgaattctt 
aataactggc 
tattaaattt 
ttatgttctt 
aatgcagctt 
atgaataaaa 
atttatgccg 
attacgcata 
ggctttatct 
caaaaaaaaa 
cgtcggcatc 
gctggcaggc 
gaatgtaaat 
ccagattcac 
cgatcacgaa 
agaccatgga 
cgtccagctg 
agatcaagct 
tggcgtctgc 
tggccgccaa 
tgagaaccgt 
atgagcccaa 
aactcaagag 
gggcggcggc 
gcatgggtac 
gcttgcgaat 
tggcactttc 
gaatctggta 
ttcgttggct 
taattaacac 
cgaagttgca 
cgttttcgtt 
tgcgaatccg 
gccttggctg 
aggccagatg 
gcattgggca 
agggacatgg 
gttctgccat 
tgacctcaaa 
gcgcataaaa 
ccaaaaaatt 
ttttgtgcgg 
cacacaattc 



tttaatatta 
aaagaaatgt 
cgagtatgtt 
gagttctcac 
acctggcatt 
caaattgcag 
aaggatttcg 
caccttgtcg 
aatgcaagcg 
agggacagat 
taaattaaag 
gggatttggt 
gtcgtttcca 
gaacatccgt 
ctttttttct 
ttttgtcttg 
tgatactaaa 
cttgtgtatg 
acaaataaac 
tcccataatt 
acattatgtc 
cgcagcgtaa 
aaagcggcga 
cttcggcttc 
ttgcggaatt 
agaagacaat 
tttatttcaa 
aatgaagtag 
gctattcccg 
catggccagc 
ctcccaccgg 
ggtggtcgcc 
gatggccagc 
ggtggctagc 
gcccagcaga 
ggagctgaca 
gttctgcaaa 
ttcctccatt 
actagtcgtc 
ccgcccgctg 
actggcgcct 
tctcgcttgt 
tgtgtaggat 
ccatgtccat 
caataaatgc 
tttttttttt 
agtgcggaat 
gcattacgaa 
ctgggtgcct 
ttgggcattg 
tcaccgccac 
tgtttggctg 
aaatcgacac 
attccattaa 
gccagtaagc 
ttccatggga 
a 



ttattattaa 
agtaacacaa 
gacaactctt 
agggctgagc 
gggatttggg 
tgaaggatgg 
atcatttttc 
tccaggtctg 
gggaagttgc 
agatagatag 
catacaatta 
ggaattggcc 
agaaggtaag 
tgctacggaa 
ttgctgcgta 
gcagtctatt 
gcttggcatt 
tagtaaaaat 
ggaaacacag 
tattattttc 
taatgagggt 
gcacgcagga 
ttaattgcag 
cccctagcag 
aaagagatac 
gaccaggatg 
attaatgggg 
ttcgtcaccg 
atgagtccca 
gagtgacggt 
agggcactgt 
ggcgataggt 
gacttgcccg 
agcgaactgg 
gtgtccgtaa 
gcagtcccac 
tcgacgaaat 
gtctccgcga 
gtactgggaa 
gctgccatca 
cgctttcgct 
tttagaaaag 
atgcaggaag 
acaatatctc 
agttttctag 
ttgattgaaa 
gcaatgcatt 
aaagtcggcg 
ggataaatgc 
ggcatcggac 
agttgcaaca 
ttgtttttgg 
caaacatcga 
gcacaactgc 
attgaaacaa 
gtgtccacat 



ttattttaaa 
tttgttaaac 
attttgccag 
cacttaattg 
actagggacc 
agcttgcatg 
catgtcagcg 
cgactatcaa 
ggaggggaaa 
atagatggac 
tgcgtagcct 
tctaattaaa 
ggctcccatg 
atctgacaca 
ctgtgtgaga 
ttaaaagtta 
ttccagaaca 
tcggaagaag 
tatgtctgtc 
caccagaaat 
caattacact 
gcaacaactt 
tacataaatc 
ctaacgtggc 
ttacgcacaa 
tccgcaatgg 
ttcatccacg 
tccgcatgcg 
ccaggaagac 
agagtgggtc 
cgttgtacga 
tcagcaagtt 
ccagggatcc 
ccgccggagc 
tggcgctgtt 
cgctaccact 
tatgcgtgaa 
ccattttccc 
cgtcgccact 
tcaaatcatc 
atcactatct 
gtctataaaa 
tttcgtggct 
gtttatccgg 
aaactcgctt 
tcacctacaa 
gcctgggtcg 
gcccaaggcc 
agctagaggg 
atcgggcaaa 
catgcgcgtc 
cacaattgaa 
acaaacgagc 
caagcgcacg 
ttggtaacaa 
cagacaaact 



taattttaat 
tatttcctat 
cgtactcgct 
gcaacacttg 
tggcactggg 
gcgtgctcca 
gcgatttgca 
tttggatgtc 
ccaaccaatt 
ggatggatag 
cgtttatcgg 
tggacacttg 
gagagagcac 
tttctggtaa 
tgtagttcct 
tcttgaccat 
actttaatga 
ccagggagtc 
aaaagtaaat 
atgagcactt 
tgcccaaaca 
tcgtcagtgc 
gggctaatat 
ccccagtggc 
agatgttgcc 
ccttttcgtt 
agatttacgg 
aggagcccgc 
gacgatgtag 
gaatccgaag 
cttggtggcc 
ggaatagtag 
tctcgcagat 
caccgtggcc 
attcagttta 
agtgccattc 
attattgtaa 
cggcgccatg 
tgtatggctt 
cggcgtcttg 
gtagacgtat 
atggcatcgt 
ttgggttaga 
ggattgtttt 
ctagcgccca 
aagtttgttg 
ttcgtcagct 
tcgtgtgcat 
gattggggat 
ggagccgaga 
gctaattaat 
aagccagcga 
aattgagcaa 
cagcactctt 
acgaagccag 
ggcgactaag 



tattttcata 
taatcgaaca 
gtatagcaac 
gcacaacctg 
attggggtag 
caatcggaat 
tagacatttg 
agtccacaaa 
gggggtgtaa 
gcatgtcagg 
gtcgtttgtg 
ttctttggtc 
aaattaattt 
agtacgctga 
tgatttaatg 
tgcagaaagc 
tgctgcatcg 
tggtatataa 
ataaccgagt 
tcgtttggac 
tatcgggcac 
catttggcag 
gcagaacaaa 
gaataaatct 
aatcagtgtg 
ggagaggggg 
ggtccaccca 
aggacgacgg 
gccacgcagt 
tccacgctcc 
gcagccgccg 
gtggatgaag 
gctgttgtgg 
gtggccgtag 
atggcactcg 
aattcgttga 
tccccggcga 
gtcggaatgg 
ttatgctttc 
attgggcgga 
ctcgaatctt 
cacttggcat 
gaacagtttt 
gcgtttgccc 
aggacccttc 
ccgtttcggc 
ggttgacgtt 
atttcagata 
cgggcattgg 
gcatcggctc 
tttaatgccc 
gcgtgcactt 
ctcaatagca 
cgattgagtg 
tgcgacattc 
ggcccctcgg 



10800 
10860 
10920 
1098 0 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14111 
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<210> 44 
<211> 1525 
<212> DNA 
<213> Drosophila 



<400> 44 

atgatggcag 

gttcccagta 

gtcgcggaga 

aatttcgtcg 

ggtgggactg 

attacggaca 

gccagttcgc 

gcgggcaagt 

aacctatcgc 

gacagtgccc 

taccgtcact 

gtgggactca 

acggtgacga 

tgcctgccag 

agatgccgac 

gaagcagatg 

ggtgaccacc 

cgacgccctg 

gggcacggat 

catgtccctg 

ccccggcgag 

catcaagatg 

catctttgcg 

gaaggtgatg 

ctactcagtg 

ctgcggacgc 



ccagcgggcg 
cgacgactag 
caatggagga 
atttgcagaa 
ctgtcagctc 
ctctgctggg 
tgctagccac 
cgctggccat 
cggcgaccac 
tccggtggga 
cgctggccat 
tcgggaatag 
actacttcat 
ccacactgat 
gagatttatt 
acaaagcgac 
attccttggc 
gtctcggcct 
gggaacctct 
atcacgctct 
tccaaggacg 
ttggtggccg 
cggatcaaat 
ccggtggccc 
aacaagaagt 
ctcaggttag 



gattcgcaag 
tgtacccatg 
agccgccgcc 
cctcttgagt 
cttgggctca 
cacggttctc 
cttggcggcc 
cgcagacgcc 
cagcttgatc 
gcagctggac 
gtccatggtc 
cttcgtgatc 
tgtgaatctg 
tggcaacatc 
cgccactggg 
gtgcccgcat 
tgctcttctt 
actcgcagcc 
acttcctgct 
gctacgtgct 
cgcaaatgga 
tggtcatcct 
tcggctcgga 
agtggctggg 
accgacgcgg 
ttaat 



cgaaagcata 
cccattccga 
ctcgccgggg 
ttcaacgaat 
tcgagtgcca 
actacggcca 
accacaacag 
acttcatcca 
tcggcggctg 
gggagcgtgg 
tactgcgtgg 
gccgtcgtcc 
gccattgcgg 
tttgtgcgta 
ggccacgttc 
catgataatc 
cgacctggtg 
gcagttcctc 
agccaacctg 
catctggatc 
ccgcatgcag 
gtttgttttg 
catctcgcag 
ctcctcgaac 
attcgcggcc 



aaagccatac 
ccatggcgcc 
attacaataa 
tgaatggcac 
ttaaactgaa 
cggccacggt 
catctgcgag 
cctactattc 
cggccaccaa 
acttcggatt 
cctacatcgt 
tgcgggctcc 
acatcctggt 
agtatctctt 
attgccatct 
ggcatatggg 
cccgccgagg 
tgccaggagg 
gtggcctgct 
aaggtctcca 
cagaagagca 
tcctggctgc 
gaggagttcg 
agctgcatca 
atcatcaagt 



aagtggcgac 
ggggaaaatg 
tttcacgcat 
tagtggtagc 
taacagcgcc 
ggctccggcg 
aggatccctg 
caacttgctg 
gtcgtacaac 
cgacccactc 
cgtcttcctg 
tcgcatgcgg 
cattgtcttc 
taattccgca 
ggtggcccct 
taatcgcgct 
aggtcttctc 
tgtggccccc 
acctgctgcc 
cgaggtccat 
aggtgaaggt 
cgctctatgt 
agatcctcaa 
accccatact 
cgcgcagctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1525 



<210> 45 
<211> 507 
<212> PRT 
<213> Drosophila 

<400> 45 

Met Met Ala Ala Ser Gly Arg He Arg Lys Arg 

15 10 
Thr Ser Gly Asp Val Pro Ser Thr Thr Thr Ser 

20 25 
Pro Thr Met Ala Pro Gly Lys Met Val Ala Glu 

35 40 
Ala Ala Leu Ala Gly Asp Tyr Asn Asn Phe Thr 

50 55 
Leu Gin Asn Leu Leu Ser Phe Asn Glu Leu Asn 
65 70 75 

Gly Gly Thr Ala Val Ser Ser Leu Gly Ser Ser 

85 90 
Asn Asn Ser Ala He Thr Asp Thr Leu Leu Gly 

100 105 
Ala Thr Ala Thr Val Ala Pro Ala Ala Ser Ser 

115 120 
Ala Ala Thr Thr Thr Ala Ser Ala Arg Gly Ser 

130 135 
Leu Ala He Ala Asp Ala Thr Ser Ser Thr Tyr 
145 150 155 

Asn Leu Ser Pro Ala Thr Thr Ser Leu He Ser 



Lys His 

Val Pro 

Thr Met 
45 

His Asn 
60 

Gly Thr 

Ser Ala 

Thr Val 

Leu Leu 
125 
Leu Ala 
140 

Tyr Ser 
Ala Ala 



Lys Ser His 
15 

Met Pro He 
30 

Glu Glu Ala 

Phe Val Asp 

Ser Gly Ser 
80 

He Lys Leu 
95 

Leu Thr Thr 
110 

Ala Thr Leu 

Gly Lys Ser 

Asn Leu Leu 
160 

Ala Ala Thr 
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165 










170 










175 




Lys 


Ser 


Tyr 


Asn 


Asp 


Ser 


Ala 


Leu 


Arg 


Trp 


Glu 


Gin 


Leu 


Asp 


Gly 


Ser 








180 










185 










190 






Val 


Asp 


Phe 


Gly 


Phe 


Asp 


Pro 


Leu 


Tyr 


Arg 


His 


Ser 


Leu 


Ala 


Met 


Ser 






195 










200 










205 








Met 


Val 


Tyr 


Cys 


Val 


Ala 


Tyr 


He 


Val 


Val 


Phe 


Leu 


Val 


Gly 


Leu 


He 




210 










215 










220 










Gly 


Asn 


Ser 


Phe 


Val 


He 


Ala 


Val 


Val 


Leu 


Arg 


Ala 


Pro 


Arg 


Met 


Arg 


225 










230 










235 










240 


Thr 


Val 


Thr 


Asn 


Tyr 


Phe 


He 


Val 


Asn 


Leu 


Ala 


He 


Ala 


Asp 


He 


Leu 










245 










250 










255 




Val 


lie 


Val 


Phe 


Cys 


Leu 


Pro 


Ala 


Thr 


Leu 


He 


Gly 


Asn 


He 


Phe 


Val 








260 










265 










270 






Arg 


Lys 


Tyr 


Leu 


Phe 


Asn 


Ser 


Ala 


Arg 


Cys 


Arg 


Arg 


Asp 


Leu 


Phe 


Ala 






275 










280 










285 








Thr 


Gly 


Gly 


His 


Val 


His 


Cys 


His 


Leu 


Val 


Ala 


Pro 


Glu 


Ala 


Asp 


Asp 




290 










295 










300 










Lys 


Ala 


Thr 


Cys 


Pro 


His 


His 


Asp 


Asn 


Arg 


His 


Met 


Gly 


Asn 


Arg 


Ala 


305 










310 










315 










320 


Gly 


Asp 


His 


His 


Ser 


Leu 


Ala 


Ala 


Leu 


Leu 


Arg 


Pro 


Gly 


Ala 


Arg 


Arg 










325 










330 










335 




Gly 


Gly 


Leu 


Leu 


Arg 


Arg 


Pro 


Gly 


Leu 


Gly 


Leu 


Leu 


Ala 


Ala 


Ala 


Val 








340 










345 










350 






Pro 


Leu 


Pro 


Gly 


Gly 


Val 


Ala 


Pro 


Gly 


His 


Gly 


Trp 


Glu 


Pro 


Leu 


Leu 






355 










360 










365 








Pro 


Ala 


Ser 


Gin 


Pro 


Gly 


Gly 


Leu 


Leu 


Pro 


Ala 


Ala 


His 


Val 


Pro 


Asp 




370 










375 










380 










His 


Ala 


Leu 


Leu 


Arg 


Ala 


His 


Leu 


Asp 


Gin 


Gly 


Leu 


His 


Glu 


Val 


His 


385 










390 










395 










400 


Pro 


Arg 


Arg 


Val 


Gin 


Gly Arg 


Ala 


Asn 


Gly 


Pro 


His 


Ala 


Ala 


Glu 


Glu 










405 










410 










415 




Gin 


Gly 


Glu 


Gly 


His 


Gin 


Asp 


Val 


Gly 


Gly 


Arg 


Gly 


His 


Pro 


Val 


Cys 








420 










425 










430 






Phe 


Val 


Leu 


Ala 


Ala 


Ala 


Leu 


Cys 


His 


Leu 


Cys 


Ala 


Asp 


Gin 


He 


Arg 






435 










440 










445 








Leu 


Gly 


His 


Leu 


Ala 


Gly 


Gly 


Val 


Arg 


Asp 


Pro 


Gin 


Glu 


Gly 


Asp 


Ala 




450 










455 










460 










Gly 


Gly 


Pro 


Val 


Ala 


Gly 


Leu 


Leu 


Glu 


Gin 


Leu 


His 


Gin 


Pro 


His 


Thr 


465 










470 










475 










480 


Leu 


Leu 


Ser 


Glu 


Gin 


Glu 


Val 


Pro 


Thr 


Arg 


He 


Arg 


Gly 


His 


His 


Gin 










485 










490 










495 




Val 


Ala 


Gin 


Leu 


Leu 


Arg 


Thr 


Pro 


Gin 


Val 


Ser 













500 



505 



<210> 46 
<211> 3688 
<212> DNA 
<213> Drosophila 



<400> 46 

tgtttgcaga 

ttatttgtga 

catcgcaaca 

ggggcgaact 

cattcttgaa 

ttaaaccaaa 

tgggttttgg 

ggatgagttg 

aaatttctga 



gcccacagca 
tgttgacttt 
attataaatg 
gaaaagtgga 
gtcgttgcct 
ttgactacaa 
tttaagatgc 
gaatttaaat 
taaatggccg 



accaatatgt 
ccaacagtga 
gtgtggtggc 
ttttccagcg 
ttattctgct 
cgttatgctc 
ttgggcatgt 
ggccttaata 
cgattcacat 



tttcattgtc 
cgtcagtgga 
gaaaaaacgt 
cgtgccagga 
ttcacacata 
tattaactct 
ttatccacca 
gctagttagg 
acaagtaaaa 



tgcccggttt 
ggccacagac 
ccttgtgccc 
aaatgaatga 
attatgaaat 
tatggtctac 
ttacgagtgg 
actggaaaga 
tcagtctgtt 



gtcgctcaat 
aataccacat 
attgttcagt 
aaatgcggga 
aaatattagc 
ttatgcttaa 
ctatatacac 
aatgcatttt 
taagagaata 



60 

120 

180 

240 

300 

360 

420 

480 
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taactaaacc 
taatagtttt 
tttaaagttg 
aaacctttgt 
aaatcttgaa 
ttaaatgccg 
tcgtagcgta 
ggtataacta 
cctttagaga 
tttccggctg 
ggtgggacgt 
cgtcaacgaa 
cgtgtggcag 
cttaaaaggc 
taaagcataa 
gaaagacctg 
ccagactttg 
agacctgcag 
tggtcaatgt 
ccctcctggc 
aaataaaatt 
ccgtgggtac 
cagatatgat 
gccaggcgat 
cctcgtacaa 
ccaggtgacg 
tatttctaat 
caagcagtgt 
tagaaaactt 
gatggcatcg 
gcagacctgc 
gatgcgccat 
tggactgggc 
tgccagctgt 
gaacagaaca 
ctgttcggtc 
aagttcattg 
atccaagcac 
atagagctat 
ttcttccagt 
tggacgctca 
ggaaattgct 
gcagtttcac 
cattcgaaat 
ctagtagata 
ttttaaaaat 
ctagcttatt 
agaatatcgc 
tgctgcaaat 
agtttgaatt 
ttacttaagg 
tcgtccttgg 
ataccaaact 



gagctactaa 
cgaaacactt 
attgcaaaaa 
accaatatca 
tagattcaat 
tataaatgtt 
gacttggcgt 
tcgaacttat 
aactccgcgt 
gtgtgggcca 
tcgcagatta 
gtggtcagtg 
cggttttgct 
ggaaagcgac 
gaaaatgtgt 
cgatgagaag 
gatgaaggtg 
cagatcgaag 
catcttgacc 
aggtgcagct 
atttttggga 
aaaaatggaa 
cagcgcagga 
cggtgaaccc 
aaccaggatg 
ggctcttttc 
ttaacttaaa 
aagcagaagc 
ttcatacgac 
tatctgtcga 
gagaagcgga 
atgatgtcgg 
attagaagca 
ttaatgaagt 
gctgaattgc 
tggaagagtg 
tcaacaaagt 
acacacacac 
gaattcaagc 
tcagtactaa 
cccacacacc 
gccaagatga 
agctcaaaat 
tcaattcata 
aagaataatg 
attttaaaca 
gccgaagaaa 
atatatgcta 
taaatttgct 
atggcctcgg 
gcaggccaag 
aactgcgctc 
tgcagtattc 



tggattgaaa 
atttcattag 
tgcctaatta 
aattataatg 
tacccatttt 
atcttaacta 
acttacgact 
tgtctaagcc 
cgctaaagga 
gctatcatac 
ccacacgggc 
tcctcatcga 
gcgagttgtt 
tggaaatata 
aagatttttt 
aggcaataga 
gcgatggcaa 
atgatatacg 
gttttgacct 
ccaaaaccag 
actccgattt 
ccctttgccc 
atggcaaata 
aactcaatcc 
ggaagcgaaa 
cctaaaagca 
tttgtatatt 
cacaaacagc 
gacttaactt 
tgctcatggc 
tggccttgca 
tgaggacgtt 
agggaagagg 
agttcatccg 
ccagaacgat 
caaagaacaa 
gttaagcacg 
acacgccttg 
tggttggtgc 
caagactagg 
cagacgtgat 
aggacctgca 
gcgtggcttg 
gcactcaaat 
aacccttatt 
gtgtgaagtg 
atgaaatatt 
atatgactaa 
ctcaaggcat 
ggtcttgcag 
taaacttgtc 
agactgaaga 
actttttg 



cctaggttct 
gataccatat 
aatgtttcag 
ggttctttag 
gacactttag 
tagattgcga 
agaaacatac 
tctggtgcct 
cgtgggacca 
ctccgacatg 
gttcgtcttg 
atcgcaactg 
gcggtatgac 
ataaaaataa 
aacatgaaat 
ttagtggatt 
tgttggtctg 
gcgaccagca 
ttgcccgtgg 
cacgttctgt 
ggcaatatgc 
aaatcgtctt 
gagtggccga 
agcattgcgg 
acaacgccga 
taaaatttaa 
ttttttggtt 
gctcatcttt 
acacgacttt 
caccagcacg 
ggccaggttg 
gagcagtccc 
tcagtcccaa 
cgacttgcga 
gacggtgaac 
agtgtggtca 
gaaatcacgc 
tttatggctc 
ggcgggcagg 
acaacacgac 
ggaagacaga 
cttcgaaaaa 
gtatttaatt 
ctaaaattgt 
acttttactt 
cagatatgac 
ctttatacgc 
tgatctcttc 
ctccccctat 
tctgacgcca 
ccgaaaaaaa 
cgccccacct 



gagtggtgga 
acttaaatta 
gtaagttttt 
ttaaaattta 
atttttaatt 
agtagaaata 
atttggcttt 
tcacaggatg 
tgaatcgaga 
gctggcacgg 
ttggtggacc 
ttgtgaagcc 
ttgcagcagc 
aaatatcaaa 
cttaacatac 
cgccgccgag 
tgagtgtgga 
gatgatgaac 
aataatgccc 
taaaatataa 
atagccagaa 
tacgatgatc 
caccaggctc 
atgtccttgg 
gatgagccat 
atatatattt 
aattcccgaa 
ggctaacaac 
gagaagttca 
taggtggacg 
cctgcccgcc 
acgcacagat 
tcccaaaact 
ttcttgttga 
aggatccaga 
catttcatta 
ataaaactca 
tttaaatagc 
atgtgccggc 
atcctgcact 
cagccaccgt 
tatttctttc 
tatagaactt 
aacaattctt 
ttttaactga 
agtgagtagt 
ccgcatgcag 
atggtatcta 
gcccacgtct 
acaccgatgt 
agctgaaatc 
tattctacaa 



caaaaaatct 
atttctccag 
tggaaatata 
ataaaaatca 
ttagagttaa 
ggtttgaaat 
acccatgcct 
tccgtgttcc 
acgcggatct 
taaccacctt 
ttcgggaaac 
gacgaccaac 
atgtgaacca 
gtaagtaaat 
cttaatgtgc 
ttcagcggtg 
atctggccaa 
acaaacacga 
ctcgagctgg 
ttaacaaata 
gatatcttac 
gcatagcagg 
atgtacacct 
atgagcttct 
gcgccagcca 
aagtgaaata 
ctaaattatc 
atttggcaat 

tggggtgtgt 
agtatgtgac 
acgaaatcgt 
ctggaaataa 
cacctgccaa 
tgaacatcac 
gcacagcaaa 
aacgcctatc 
atttaatgcc 
cataaaaatg 
ggaagtttgc 
caaacacccg 
tattccagca 
gacttttaag 
atactceagg 
atatagaaaa 
actgttatta 
tctgctagtt 
tttatatagc 
caaagttttt 
attgaagtta 
tgatgcggac 
ctgacaggct 
ctttttgtcc 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3688 



<210> 47 
<211> 1191 
<212> DNA 
<213> Drosophila 



<400> 47 
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atgaaatgtg accacacttt 
tggatcctgt tcaccgtcat 
aacaagaatc gcaagtcgcg 
tgcgtgggac tgctcaacgt 
gcaggcaacc tggcctgcaa 
acctacgtgc tggtggccat 
ttctcaaagt cgtggaaaag 
ttgttttcgc ttcccatcct 
tgctggattg agttgggttc 
actctatttg ccattcctgc 
atttgggcaa agggttccat 
gccaggaggg ccagctcgag 
acattgacca tcgtgtttgt 
ctgcaggtct ttggccagat 
caaagtctgg caccgctgaa 
caggtctttc gcacattaag 
tcataccgca acaactcgca 
agttgcgatt cgatgaggac 
aagacgaacg cccgtgtggt 
atgtcggagc gacgcggagt 

<210> 48 
<211> 396 
<212> PRT 
<213> Drosophila 



<400> 48 



Met 


Lys 


Cys 


Asp 


His 


Thr 


Leu 


Phe 


Phe 


Ala 


Leu 


Phe 


Gin 


Thr 


Glu 


Gin 


1 








5 










10 










15 




Phe 


Ala 


Val 


Leu 


Trp 


He 


Leu 


Phe 


Thr 


Val 


He 


Val 


Leu 


Gly 


Asn 


Ser 








20 










25 










30 






Ala 


Val 


Leu 


Phe 


Val 


Met 


Phe 


He 


Asn 


Lys 


Asn 


Arg 


Lys 


Ser 


Arg 


Met 






35 










40 










45 








Asn 


Tyr 


Phe 


He 


Lys 


Gin 


Leu 


Ala 


Leu 


Ala 


Asp 


Leu 


Cys 


Val 


Gly 


Leu 




50 










55 










60 










Leu 


Asn 


Val 


Leu 


Thr 


Asp 


He 


He 


Trp 


Arg 


He 


Thr 


He 


Ser 


Trp 


Arg 


65 










70 










75 










80 


Ala 


Gly 


Asn 


Leu 


Ala 


Cys 


Lys 


Ala 


He 


Arg 


Phe 


Ser 


Gin 


Val 


Cys 


Val 










85 










90 










95 




Thr 


Tyr 


Ser 


Ser 


Thr 


Tyr 


Val 


Leu 


Val 


Ala 


Met 


Ser 


He 


Asp 


Arg 


Tyr 








100 










105 










110 






Asp 


Ala 


He 


Thr 


His 


Pro 


Met 


Asn 


Phe 


Ser 


Lys 


Ser 


Trp 


Lys 


Arg 


Ala 






115 










120 










125 








Arg 


His 


Leu 


Val 


Ala 


Gly 


Ala 


Trp 


Leu 


He 


Ser 


Ala 


Leu 


Phe 


Ser 


Leu 




130 










135 










140 










Pro 


He 


Leu 


Val 


Leu 


Tyr 


Glu 


Glu 


Lys 


Leu 


He 


Gin 


Gly 


His 


Pro 


Gin 


145 










150 










155 










160 


Cys 


Trp 


He 


Glu 


Leu 


Gly 


Ser 


Pro 


He 


Ala 


Trp 


Gin 


Val 


Tyr 


Met 


Ser 










165 










170 










175 




Leu 


Val 


Ser 


Ala 


Thr 


Leu 


Phe 


Ala 


He 


Pro 


Ala 


Leu 


He 


He 


Ser 


Ala 








180 










185 










190 






Cys 


Tyr 


Ala 


He 


He 


Val 


Lys 


Thr 


He 


Trp 


Ala 


Lys 


Gly 


Ser 


He 


Phe 






195 










200 










205 








Val 


Pro 


Thr 


Glu 


Arg 


Ala 


Gly 


Phe 


Gly 


Ala 


Ala 


Pro 


Ala 


Arg 


Arg 


Ala 




210 










215 










220 










Ser 


Ser 


Arg 


Gly 


He 


He 


Pro 


Arg 


Ala 


Lys 


Val 


Lys 


Thr 


Val 


Lys 


Met 


225 










230 










235 










240 


Thr 


Leu 


Thr 


He 


Val 


Phe 


Val 


Phe 


He 


He 


Cys 


Trp 


Ser 


Pro 


Tyr 


He 










245 










250 










255 




He 


Phe 


Asp 


Leu 


Leu 


Gin 


Val 


Phe 


Gly 


Gin 


He 


Pro 


His 


Ser 


Gin 


Thr 



70 
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gttctttgca ctcttccaga ccgaacagtt tgctgtgctc 60 
cgttctgggc aattcagctg ttctgttcgt gatgttcatc 120 
gatgaactac ttcattaaac agctggcatt ggcagatctg 180 
cctcaccgac atcatatggc gcatcacgat ttcgtggcgg 240 
ggccatccgc ttctcgcagg tctgcgtcac atactcgtcc 300 
gagcatcgac agatacgatg ccatcacaca ccccatgaac 360 
agcccgtcac ctggtggctg gcgcatggct catctcggcg 420 
ggttttgtac gaggagaagc tcatccaagg acatccgcaa 480 
accgatcgcc tggcaggtgt acatgagcct ggtgtcggcc 540 
gctgatcata tctgcctgct atgcgatcat cgtaaagacg 600 
ttttgtaccc acggaacgtg ctggttttgg agctgcacct 660 
gggcattatt ccacgggcaa aggtcaaaac ggtcaagatg 720 
gttcatcatc tgctggtcgc cgtatatcat cttcgatctg 780 
tccacactca cagaccaaca ttgccatcgc caccttcatc 840 
ctcggcggcg aatccactaa tctattgcct cttctcatcg 900 
tcgctttccg ccttttaagt ggttcacatg ctgctgcaag 960 
gcaaaaccgc tgccacacgg ttggtcgtcg gcttcacaac 1020 
actgaccact tcgttgacgg tttcccgaag gtccaccaac 1080 
aatctgcgaa cgtcccacca aggtggttac cgtgccagcc 1140 
ttctctaaag gggaacacgg acatcctgtg a 1191 
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Asn lie 

Ala Ala 
290 
Thr Leu 
305 

Ser Tyr 

Arg Leu 

Thr Val 

Cys Glu 
370 
Arg Gly 
385 



Ala 
275 
Asn 

Ser 

Arg 

His 

Ser 
355 
Arg 

Val 



260 
He 

Pro 

Arg 

Asn 

Asn 
340 
Arg 

Pro 

Ser 



Ala Thr Phe 

Leu He Tyr 
295 

Phe Pro Pro 

310 
Asn Ser Gin 
325 

Ser Cys Asp 

Arg Ser Thr 

Thr Lys Val 
375 

Leu Lys Gly 
390 



265 
He Gin 
280 

Cys Leu 

Phe Lys 

Gin Asn 

Ser Met 
345 
Asn Lys 
360 

Val Thr 
Asn Thr 



Ser Leu Ala 

Phe Ser Ser 
300 

Trp Phe Thr 

315 
Arg Cys His 
330 

Arg Thr Leu 

Thr Asn Ala 

Val Pro Ala 
380 

Asp He Leu 
395 



Pro 
285 
Gin 

Cys 

Thr 

Thr 

Arg 
365 
Met 



270 
Leu 

Val 

Cys 

Val 

Thr 
350 
Val 

Ser 



Asn Ser 

Phe Arg 

Cys Lys 
320 
Gly Arg 
335 

Ser Leu 
Val He 
Glu Arg 



<210> 49 
<211> 5051 
<212> DNA 
<213> Drosophila 



<400> 49 

ttccaatgcg 

ccagtggaaa 

atatgttcat 

ctgagtcatt 

ccagcgttta 

ccttatctta 

ctctctcgct 

ttgttcattt 

agtgcgcgaa 

tcttggaaaa 

agtgatgtaa 

attatacaat 

gcaacaacaa 

acataatttc 

aactgcagca 

cgcgctggcg 

ggcgttgggg 

tcgtccgatc 

gtcgcaagac 

cttgagagca 

caagaaacaa 

aaatatttgg 

tgtgtcataa 

gactgtgaaa 

acctcctgat 

ccttttggat 

ccatcaatct 

agtctgcgga 

accatcaggt 

gaaggaacat 

actgtcgcga 

agacacacga 

cgcccgtcca 

tactgctcgt 

gcatttcggg 



atttaaggta 
aaaaaaactg 
tcagggagcg 
gtcgagcggg 
tttctatttg 
cggacatatc 
cttttcttgc 
cacagcagca 
cgccgttgtg 
tggaaaagca 
tacagagcaa 
ttgtgttgct 
ccactagtga 
aaagtcggag 
acactcgagc 
tttctccgtt 
cgaacgctca 
ggtgcgcgtg 
aattgacaag 
aaaaagaaat 
accgcataca 
ctgcaaatgt 
gcagagccaa 
ctgtaaagtg 
cctggccctg 
aacgcctggc 
cattggttgc 
ggcatccact 
gatacttcgt 
cagcgaaaac 
gttgaccatc 
agagaatctc 
actgagatgt 
ggtgggggac 
cgtgaaaacc 



attgactata 
gatttatttc 
tgccgatttt 
ttttggtagc 
cttttaattt 
tccccaagcg 
agctggcgcc 
gtcggcctaa 
ttgcacattg 
cgagcagcag 
aagtgaagcg 
gccgcactga 
taaagaccgg 
cgtgggaatc 
actcgtggct 
tcgctcgcac 
cgacagttgt 
ataaatacac 
acacgaaaaa 
tgttttttac 
atactgaaca 
gaagagacga 
aacaaaaccg 
atgtgtggta 
ctgctcgtct 
ttgaggacca 
tgcatcttgg 
tccaacgccg 
cgcaatggcc 
ggcgatacgg 
gatgagcgcg 
tcggcctttg 
tggtccatag 
acctggtgtg 
tggatcttca 



agtacccatg 
ttatcgcaaa 
ttagtttgcg 
cggggattaa 
cacaatcaaa 
tcgtctcgct 
cgacggtggg 
taaaaatgca 
gaaattgtgc 
tagcaccagc 
gcggcaattg 
actaggacaa 
ctggcaacag 
gcaaatcacg 
ctccggatcg 
ccgcctgtgg 
gcttggaacg 
ggcgaacaaa 
tcataataaa 
aaatctcgca 
caaaaaaaag 
gagacagcgg 
aaaagccaaa 
atcgaaaaac 
cgttcctgat 
ccgccaatcg 
cccccacttt 
agaccctgga 
agctggtgga 
tggagacctt 
gagatggtgg 
agagcttacc 
acatgactgc 
ccgttacgga 
tagcactcac 



acgaacttct 
acttgcgata 
aaattgagtt 
atctgctgta 
acagcaataa 
ctaatttgga 
aatttcatca 
atttttatgg 
agaagccaga 
agcagctata 
gcattgcaaa 
cagcagcaac 
tgcagcggcc 
gcagtggcag 
ctcatttcgc 
agcagtcgcg 
cttcggagtg 
attgagtcaa 
cacaagtgga 
cgtaaattag 
ggtattcaac 
atatactacc 
ttcaataaaa 
agcataaaga 
caacctgctg 
ctttacgatc 
gttcctggga 
gttcttctcg 
acaggacggc 
cttcaagtgc 
ctttgtcatc 
cacggaggct 
tgcgctggga 
tcccctgcgc 
ctgggtggtg 



ggacagttcc 
accaatttgt 
gttatgtttt 
tcatcttggt 
taaaaccgtg 
tttttcctct 
gctgttgttg 
ttaagaaaat 
agaaaatgtg 
acattgagcc 
aaagaaacag 
aacagcagca 
acttgcagca 
caacaacagc 
tcgccgttct 
tatatttgga 
aagatcgtgc 
gagtcagacg 
aaagcaactg 
caagcgttcg 
aaaaaaccaa 
gagtgaaatt 
ggccataagt 
atggccatag 
gcgctgtgtg 
aacctgctgg 
ctgccgggaa 
aagccgggaa 
gtggtggtgc 
aatgccacct 
acggaaaccg 
cccattttgc 
gctctggcag 
taccacagtc 
ggcatactct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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ttggagccct ttccgccttc cgagttttgg actttgaggc ggacgccctg ttcagcagac 2160 

aacgccgcct ggcagtcacg tacttcaaca tcagcagcac caatagtatt ttcggagtgg 2220 

tgtacgccag tgtctacttc atcgtgatca tcctcctgcc cttcggcttc gtgtgcggaa 2280 

tgtactggag gatattcagt gaggcgcgcg gaaatgggct gcgcatgcgc cagaatggct 2340 

cttcgccgct gctgcagagc gccttgaatc tgacggctgg ccagcaggcg gctcaagcca 2400 

accagttctc caatagcttg tgtgttcatc ggcattccat atcctcggcc agttcgcatg 2460 

gcggcaacag cagcttgggc ttgggcggtc tgcaaatgca aattgatcag cgccaacagc 2520 

cgaggagttc gcccagttgt ctccgtcggg actcggccgc gaaagtactt ttgcccacca 2580 

tctcggatga tggtggctca gatgcggaaa gtggagccgg agttcaactg atgcccgtac 2640 

aggagcactc actgtcagat agaaatcaga acattatgct tactctgcaa acagccagtg 2700 

gtgaaataaa gaggaactac tccgccagac agctgcccct gttgggcact tcttcgcagg 2760 

atctgaggga gactaatcgt ctgcagggca tacgccaggt gcacagttcg cccaatctcc 2820 

acaagtacac ggaactgcgc caggactcgc tcagtgagga gtgtggttct ccgcatcttt 2880 

tgggtcacgc ccagcggcag cagcagcagc aattgcacct gcaacatcag cagcaacagc 2940 

atcatcacca ccagcaacat catccgcact tcagttcgcc gcgccaccag caacacggcc 3000 

acgccctcca gatacccgcc attcatgcct cccccaaggc tctgagttac atgagctcct 3060 

tgcgacaccg tttgagcaac gccagttcgc tattcaaata ccgtgaggag tcgagagcag 3120 

cccgcatcag catcctggtg gtggtgatgt ttgtggtctc gtacctgccc ttcggactgc 3180 

tggtactcct gcaatcgcga ctgtcggctg ctaactttgg cggatcctcc cagctggcca 3240 

tcttcatgat cctgctggcc aatctcagtt cgccattcat cttcgcctac aggaacaaga 3300 

gggtgaggcg aggagtcaag cggctgtttg gtctggattc gtcgtcgggc ctgcagcgca 3360 

actgcagcag cagtgtaaag accaatggca ccgcaggtcc tgctgcaagt ggtgcacaat 3420 

tgcagcgcaa cagcagcaag ttgtcgcagt acagcagcaa cagctgcaag tatctcaccc 3480 

cccagagttc cctggtcagt caagtgccgg tgcatacgac cttgacgtta aggcccaaca 3540 

gcagctgcag caccatcatc aatttcggtg gcgccagggg ctcagcggac tccgatgagc 3600 

agccaccggc aacaccacca cctactgtgg cagttgctcc accgcccaca cgaccgcagc 3660 

gccccaagct gcagaggggc atcaccatcg ttgagcacat agctataaca ccaaccatgc 3720 

cccaaaagtt ccagaatcgg cgcgccaggc tcttcgacat gttcttccgc agctccaaga 3780 

aactgcaggc gggctgccag agccagagtt tgcccactga ggtctaaacg gggggttatg 3840 

ataactatac atataggaag gtaaacaaac actaaacttg cttgctgaag aattctaaga 3900 

acagtattcg taatttgtaa gtctagcaat taacaaagac aactgtttga atttgagttt 3960 

taagaccaga tcatgctgtc ctagaagctt ctcatgccaa aaaacaaaac cacaaagaaa 4020 

gtcaaactag aaagtttgag atgctttaaa ttaacaaaaa aagaaaagaa aaaaacaaaa 4080 

aacaaaacaa agtacctttt aaaaaataga aaactgtgta caagtcaaat taacaaagat 4140 

tgagagtgat gggagttata atcttagggt ggatatcaaa ctgaaaagtg gaaccaagcc 4200 

aagccaagtt gagatcgtaa tcaatgtacc taagcaactt gcatatgaac atttaatgtt 4260 

tttactcaga tgaaatcaca ggaaaatagc aaagacagca cagcttattc gaacttcccc 4320 

aactagttta ttatttattg aaacaattgt aaatatttat tagacatcca cgccgcaaag 4380 

ctttaactcc aaaaacattc tggctaaaca agaagagacg catacattcg attcataagt 4440 

taattaagtc aatgtgcaaa gcatatgtgt attctagagt ttaagttgta gagccgagta 4500 

gtcttacacc gagcacctag tgccgagcag tgctttttca cctgcccaac tggagcgtct 4560 

gttgcaggcg gcattttcat attgaaaatg cattttggcc agtgcacagt tggttttcca 4620 

gttggccaga atgtcagtca gcagttagtt agcctttggt tagccaacgg ctattgggcc 4 680 

attaagtcaa tttgcttttt agaacaccgc gggcattgcg tgcagctaaa tacctcatta 4740 

gttataattt gaagtgcgag atggatggcg gcaataggtc aaaggtcgga gccatcggca 4800 

cggcccatgc ataatgcatg gaaaatgttc gattgcccct ggcatttgta tcatttgctt 4860 

tactatgctg caggccagaa agctttatac cttaagttct cctgctttga tttcccttgg 4920 

ctgcacaaaa aggcaacaaa agcaacacac actcacacaa cacaaaaaaa cctcgaattt 4 980 

atatttctaa agtttatatt ttataaagag tgtgtgcata ggaaatatct ataaatattc 5040 

atgcatgtgt g 5051 



<210> 50 
<211> 3051 
<212> DNA 
<213> Drosophila 

<400> 50 

cttcggagtg aagatcgtgc tcgtccgatc ggtgcgcgtg ataaatacac ggcgaacaaa 60 
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attgagtcaa gagtcagacg gtcgcaagac aattgacaag acacgaaaaa tcataataaa 120 

cacaagtgga aaagcaactg cttgagagca aaaaagaaat tgttttttac aaatctcgca 180 

cgtaaattag caagcgttcg caagaaacaa accgcataca atactgaaca caaaaaaaag 240 

ggtattcaac aaaaaaccaa aaatatttgg ctgcaaatgt gaagagacga gagacagcgg 300 

atataccacc gagtgaaatt tgtgtcataa gcagagccaa aacaaaaccg aaaagccaaa 360 

ttcaataaaa ggccataagt gactgtgaaa ctgtaaagtg atgtgtggta atcgaaaaac 420 

agcataaaga atggccatag acctcctgat cctggccctg ctgctcgtct cgttcctgat 480 

caacctgctg gcgctgtgtg ccttttggat aacgcctggc ttgaggacca ccgccaatcg 540 

ctttacgatc aacctgctgg ccatcaatct cattggttgc tgcatcttgg cccccacttt 600 

gttcctggga ctgccgggaa agtctgcgga ggcatccact tccaacgccg agaccctgga 660 

gttcttctcg aagccgggaa accatcaggt gatacttcgt cgcaatggcc agctggtgga 720 

acaggacggc gtggtggtgc gaaggaacat cagcgaaaac ggcgatacgg tggagacctt 780 

cttcaagtgc aatgccacct actgtcgcga gttgaccatc gatgagcgcg gagatggtgg 840 

ctttgtcatc acggaaaccg agacacacga agagaatctc tcggcctttg agagcttacc 900 

cacggaggct cccattttgc cgcccgtcca actgagatgt tggtccatag acatgactgc 960 

tgcgctggga gctctggcag tactgctcgt ggtgggggac acctggtgtg ccgttacgga 1020 

tcccctgcgc taccacagtc gcatttcggg cgtgaaaacc tggatcttca tagcactcac 1080 

ctgggtggtg ggcatactct ttggagccct ttccgccttc cgagttttgg actttgaggc 1140 

ggacgccctg ttcagcagac aacgccgcct ggcagtcacg tacttcaaca tcagcagcac 1200 

caatagtatt ttcggagtgg tgtacgccag tgtctacttc atcgtgatca tcctcctgcc 1260 

cttcggcttc gtgtgcggaa tgtactggag gatattcagt gaggcgcgcg gaaatgggct 1320 

gcgcatgcgc cagaatggct cttcgccgct gctgcagagc gccttgaatc tgacggctgg 1380 

ccagcaggcg gctcaagcca accagttctc caatagcttg tgtgttcatc ggcattccat 1440 

atcctcggcc agttcgcatg gcggcaacag cagcttgggc ttgggcggtc tgcaaatgca 1500 

aattgatcag cgccaacagc cgaggagttc gcccagttgt ctccgtcggg actcggccgc 1560 

gaaagtactt ttgcccacca tctcggatga tggtggctca gatgcggaaa gtggagccgg 1620 

agttcaactg atgcccgtac aggagcactc actgtcagat agaaatcaga acattatgct 1680 

tactctgcaa acagccagtg gtgaaataaa gaggaactac tccgccagac agctgcccct 1740 

gttgggcact tcttcgcagg atctgaggga gactaatcgt ctgcagggca tacgccaggt 1800 

gcacagttcg cccaatctcc acaagtacac ggaactgcgc caggactcgc tcagtgagga 1860 

gtgtggttct ccgcatcttt tgggtcacgc ccagcggcag cagcagcagc aattgcacct 1920 

gcaacatcag cagcaacagc atcatcacca ccagcaacat catccgcact tcagttcgcc 1980 

gcgccaccag caacacggcc acgccctcca gatacccgcc attcatgcct cccccaaggc 2040 

tctgagttac atgagctcct tgcgacaccg tttgagcaac gccagttcgc tattcaaata 2100 

ccgtgaggag tcgagagcag cccgcatcag catcctggtg gtggtgatgt ttgtggtctc 2160 

gtacctgccc ttcggactgc tggtactcct gcaatcgcga ctgtcggctg ctaactttgg 2220 

cggatcctcc cagctggcca tcttcatgat cctgctggcc aatctcagtt cgccattcat 2280 

cttcgcctac aggaacaaga gggtgaggcg aggagtcaag cggctgtttg gtctggattc 2340 

gtcgtcgggc ctgcagcgca actgcagcag cagtgtaaag accaatggca ccgcaggtcc 2400 

tgctgcaagt ggtgcacaat tgcagcgcaa cagcagcaag ttgtcgcagt acagcagcaa 2460 

cagctgcaag tatctcaccc cccagagttc cctggtcagt caagtgccgg tgcatacgac 2520 

cttgacgtta aggcccaaca gcagctgcag caccatcatc aatttcggtg gcgccagggg 2580 

ctcagcggac tccgatgagc agccaccggc aacaccacca cctactgtgg cagttgctcc 2640 

accgcccaca cgaccgcagc gccccaagct gcagaggggc atcaccatcg ttgagcacat 2700 

agctataaca ccaaccatgc cccaaaagtt ccagaatcgg cgcgccaggc tcttcgacat 2760 

gttcttccgc agctccaaga aactgcaggc gggctgccag agccagagtt tgcccactga 2820 

ggtctaaacg gggggttatg ataactatac atataggaag gtaaacaaac actaaacttg 2880 

cttgctgaag aattctaaga acagtattcg taatttgtaa gtctagcaat taacaaagac 2940 

aactgtttga atttgagttt taagaccaga tcatgctgtc ctagaagctt ctcatgccaa 3000 

aaaacaaaac cacaaagaaa gtcaaactag aaagtttgag atgctttaaa t 3051 

<210> 51 
<211> 798 
<212> PRT 
<213> Drosophila 

<400> 51 

Met Ala lie Asp Leu Leu lie Leu Ala Leu Leu Leu Val Ser Phe Leu 

15 10 15 

He Asn Leu Leu Ala Leu Cys Ala Phe Trp He Thr Pro Gly Leu Arg 
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20 




















JU 






Thr 


Thr 


Ala 


Asn 


Arg 


Phe 


Thr 


lie 


Asn 


Leu 


Leu 


Ala 


lie 


Asn 


Leu 


lie 






3 3 










a n 










A C 








Gly Cys 


Cys 


He 


Leu 


Ala 


Pro 


Thr 


Leu 


Phe 


Leu 


Gly 


Leu 


Pro 


Gly 


Lys 




50 










55 










oil 










Ser 


Ala 


Glu 


Ala 


Ser 


Thr 


Ser 


Asn 


Ala 


Glu 


Thr 


Leu 


Glu 


Phe 


Phe 


Ser 


65 










70 










75 










O A 
OU 


Lys 


Pro 


Gly 


Asn 


His 


Gin 


Val 


He 


Leu 


Arg 


Arg 


Asn 


Gly 


Gin 


Leu 


Val 










85 










90 










95 




Glu 


Gin 


Asp 


Gly 


Val 


Val 


Val 


Arg 


Arg 


Asn 


He 


Ser 


Glu 


Asn 


Gly 


Asp 








100 










105 










110 






Thr 


Val 


Glu 


Thr 


Phe 


Phe 


Lys 


Cys 


Asn 


Ala 


Thr 


Tyr 


Cys 


Arg 


Glu 


Leu 






115 










120 










125 








Thr 


He 


Asp 


Glu 


Arg 


Gly 


Asp 


Gly 


Gly 


Phe 


Val 


He 


Thr 


Glu 


Thr 


Glu 




130 










135 










140 










Thr 


His 


Glu 


Glu 


Asn 


Leu 


Ser 


Ala 


Phe 


Glu 


Ser 


Leu 


Pro 


Thr 


Glu 


Ala 


145 










150 










155 










160 


Pro 


He 


Leu 


Pro 


Pro 


Val 


Gin 


Leu 


Arg 


Cys 


Trp 


Ser 


He Asp 


Met 


Thr 










165 










170 










175 




Ala 


Ala 


Leu 


Gly 


Ala 


Leu 


Ala 


Val 


Leu 


Leu 


Val 


Val 


Gly Asp 


Thr 


Trp 








180 










185 










190 






Cys 


Ala 


Val 


Thr 


Asp 


Pro 


Leu 


Arg 


Tyr 


His 


Ser 


Arg 


He 


Ser 


Gly 


Val 






195 










200 










205 








Lys 


Thr 


Trp 


He 


Phe 


He 


Ala 


Leu 


Thr 


Trp 


Val 


Val 


Gly 


He 


Leu 


Phe 




210 










215 










220 










Gly Ala 


Leu 


Ser 


Ala 


Phe 


Arg 


Val 


Leu 


Asp 


Phe 


Glu 


Ala 


Asp 


Ala 


Leu 


225 










230 










235 










240 


Phe 


Ser 


Arg 


Gin 


Arg 


Arg 


Leu 


Ala 


Val 


Thr 


Tyr 


Phe 


Asn 


He 


Ser 


Ser 










245 










250 










255 




Thr 


Asn 


Ser 


He 


Phe 


Gly 


Val 


Val 


Tyr 


Ala 


Ser 


Val 


Tyr 


Phe 


He 


Val 








260 










265 










270 






He 


He 


Leu 


Leu 


Pro 


Phe 


Gly 


Phe 


Val 


Cys 


Gly Met 


Tyr 


Trp 


Arg 


He 






275 










280 










285 








Phe 


Ser 


Glu 


Ala 


Arg 


Gly 


Asn 


Gly 


Leu 


Arg 


Met 


Arg 


Gin 


Asn 


Gly 


Ser 




290 










295 










300 










Ser 


Pro 


Leu 


Leu 


Gin 


Ser 


Ala 


Leu 


Asn 


Leu 


Thr 


Ala 


Gly 


Gin 


Gin 


Ala 


305 










310 










315 










320 


Ala 


Gin 


Ala 


Asn 


Gin 


Phe 


Ser 


Asn 


Ser 


Leu 


Cys 


Val 


His 


Arg 


His 


Ser 










325 










330 










335 




He 


Ser 


Ser 


Ala 


Ser 


Ser 


His 


Gly 


Gly 


Asn 


Ser 


Ser 


Leu 


Gly 


Leu 


Gly 








340 










345 










350 






Gly 


Leu 


Gin 


Met 


Gin 


He 


Asp 


Gin 


Arg 


Gin 


Gin 


Pro 


Arg 


Ser 


Ser 


Pro 






355 










360 










365 








Ser 


Cys 


Leu 


Arg 


Arg 


Asp 


Ser 


Ala 


Ala 


Lys 


Val 


Leu 


Leu 


Pro 


Thr 


He 




370 










375 










380 










Ser 


Asp 


Asp 


Gly 


Gly 


Ser 


Asp 


Ala 


Glu 


Ser 


Gly Ala 


Gly Val 


Gin 


Leu 


385 










390 










395 










4 UU 


Met 


Pro 


Val 


Gin 


Glu 


His 


Ser 


Leu 


Ser 


Asp Arg 


Asn 


Gin 


Asn 


He 


Met 










405 










410 










4 lt> 




Leu 


Thr 


Leu 


Gin 


Thr 


Ala 


Ser 


Gly 


Glu 


He 


Lys 


Arg 


Asn 


Tyr 


Ser 


Ala 








420 










425 










430 






Arg 


Gin 


Leu 


Pro 


Leu 


Leu 


Gly 


Thr 


Ser 


Ser 


Gin 


Asp 


Leu 


Arg 


Glu 


Thr 






435 










440 










445 








Asn 


Arg 


Leu 


Gin 


Gly 


He 


Arg 


Gin 


Val 


His 


Ser 


Ser 


Pro 


Asn 


Leu 


His 




450 










455 










4 60 










Lys 


Tyr 


Thr 


Glu 


Leu 


Arg 


Gin 


Asp 


Ser 


Leu 


Ser 


Glu 


Glu 


Cys 


Gly 


Ser 


465 










470 










475 










480 


Pro 


His 


Leu 


Leu 


Gly 


His 


Ala 


Gin 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Leu 


His 










485 










490 










495 




Leu 


Gin 


His 


Gin 


Gin 


Gin 


Gin 


His 


His 


His 


His 


Gin 


Gin 


His 


His 


Pro 
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500 










505 










510 






His 


Phe 


Ser 


Ser 


Pro 


Arg 


His 


Gin 


Gin 


His 


Gly 


His 


Ala 


Leu 


Gin 


He 






515 










520 










525 








Pro 


Ala 


He 


His 


Ala 


Ser 


Pro 


Lys 


Ala 


Leu 


Ser 


Tyr 


Met 


Ser 


Ser 


Leu 




530 










535 










540 










Arg 


His 


Arg 


Leu 


Ser 


Asn 


Ala 


Ser 


Ser 


Leu 


Phe 


Lys 


Tyr 


Arg 


Glu 


Glu 


545 










550 










555 










560 


Ser 


Arg 


Ala 


Ala 


Arg 


He 


Ser 


He 


Leu 


Val 


Val 


Val 


Met 


Phe 


Val 


Val 










565 










570 










575 




Ser 


Tyr 


Leu 


Pro 


Phe 


Gly 


Leu 


Leu 


Val 


Leu 


Leu 


Gin 


Ser 


Arg 


Leu 


Ser 








580 










585 










590 






Ala 


Ala 


Asn 


Phe 


Gly 


Gly 


Ser 


Ser 


Gin 


Leu 


Ala 


He 


Phe 


Met 


He 


Leu 






595 










600 










605 








Leu 


Ala 


Asn 


Leu 


Ser 


Ser 


Pro 


Phe 


He 


Phe 


Ala 


Tyr 


Arg 


Asn 


Lys 


Arg 




610 










615 










620 










Val 


Arg 


Arg 


Gly 


Val 


Lys 


Arg 


Leu 


Phe 


Gly 


Leu 


Asp 


Ser 


Ser 


Ser 


Gly 


625 










630 










635 










640 


Leu 


Gin 


Arg 


Asn 


Cys 


Ser 


Ser 


Ser 


Val 


Lys 


Thr 


Asn 


Gly 


Thr 


Ala 


Gly 










645 










650 










655 




Pro 


Ala 


Ala 


Ser 


Gly 


Ala 


Gin 


Leu 


Gin 


Arg 


Asn 


Ser 


Ser 


Lys 


Leu 


Ser 








660 










665 










670 






Gin 


Tyr 


Ser 


Ser 


Asn 


Ser 


Cys 


Lys 


Tyr 


Leu 


Thr 


Pro 


Gin 


Ser 


Ser 


Leu 






675 










680 










685 








Val 


Ser 


Gin 


Val 


Pro 


Val 


His 


Thr 


Thr 


Leu 


Thr 


Leu 


Arg 


Pro 


Asn 


Ser 




690 










695 










700 










Ser 


Cys 


Ser 


Thr 


He 


He 


Asn 


Phe 


Gly 


Gly 


Ala 


Arg 


Gly 


Ser 


Ala 


Asp 


705 










710 










715 










720 


Ser 


Asp 


Glu 


Gin 


Pro 


Pro 


Ala 


Thr 


Pro 


Pro 


Pro 


Thr 


Val 


Ala 


Val 


Ala 










725 










730 










735 




Pro 


Pro 


Pro 


Thr 


Arg 


Pro 


Gin 


Arg 


Pro 


Lys 


Leu 


Gin 


Arg 


Gly 


He 


Thr 








740 










745 










750 






He 


Val 


Glu 


His 


He 


Ala 


He 


Thr 


Pro 


Thr 


Met 


Pro 


Gin 


Lys 


Phe 


Gin 






755 










760 










765 








Asn 


Arg 


Arg 


Ala 


Arg 


Leu 


Phe 


Asp 


Met 


Phe 


Phe 


Arg 


Ser 


Ser 


Lys 


Lys 




770 










775 










780 










Leu 


Gin 


Ala 


Gly 


Cys 


Gin 


Ser 


Gin 


Ser 


Leu 


Pro 


Thr 


Glu 


Val 







785 790 795 



<210> 52 
<211> 3919 
<212> DNA 
<213> Drosophila 



<400> 52 

attctattgt 

aatgtaatta 

ggtcggtttg 

gtcgcctaat 

aatttatagt 

aaaacagata 

attaagacag 

gtatatgtat 

cgaatctagt 

gagaaaatta 

gcacacaatt 

tgtagatttc 

aacaaacaac 

ttttatatgt 

ttctatggaa 



cgtaggcgtg 
atatgaaaag 
ttggcgtaag 
tttgatcaaa 
caatttatct 
cgagcttcct 
acgaacatta 
ttactttaaa 
aaaccctttt 
attgtataat 
tttttttttt 
ttggaaatct 
aaaaacaaaa 
atgtatgtac 
tttccaattt 



gcaaaaggtt 
gtaacaaaaa 
ggcattatcg 
ctataactaa 
ttttattatt 
tagaagcgga 
ggacaaggcc 
ggggcgtaaa 
aatttacaag 
ataatattat 
ttttggaaac 
caatgaattt 
cacacaaaag 
atacatatac 
aagctgcaat 



ttttggtaaa 
attttgcaaa 
gtattttcct 
aattgtatat 
tcttatctaa 
cggaaataat 
aggtctactc 
cttgtatttt 
taacgaatat 
ctttgttaat 
tgaatcagtt 
tatggtcaag 
caaacaaatt 
aaagaataat 
acatgtacat 



tcgatagaaa 
agtgtgggca 
attttcagct 
tgtcagaaac 
agttatatgc 
gtctcgagat 
agctagggat 
acatgttaca 
aaaaagttga 
ccagagtctt 
aattgtgcca 
ttccaagaag 
atgcagaatt 
aaatgggtgg 
tttaacgtct 



tgtacaagag 
tggcagtttt 
atatgatgca 
aacgagcatg 
atatagaata 
ctggaccttc 
cctgatcaat 
tactttcaag 
atggtagcgg 
ggcaacttgt 
aaatttagag 
tgaaatgctt 
atcggatgta 
gtctgttggc 
ttctgtacgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tctcaaataa 
tcccatatga 
tggatcgttc 
ttgcgacttc 
cctctacgaa 
ggccgacgat 
gccatgcatt 
cgacatgtcg 
cgggtcggtg 
gcccggatgt 
gtttgaggtt 
ctgtagaacg 
ctgcgtccag 
actttcgtct 
ctttaaggtt 
atcctaacga 
ttcatctgct 
aaatactcct 
acgaccaatt 
ccgcgttctt 
ccagctggtt 
ggggcatggc 
agaagttgag 
tcctataata 
ctgtcatgat 
ttgtcctgac 
agcaaaaaac 
tatacagttc 
cgccctattc 
gtttttgttg 
ttggtcacag 
actgaaggaa 
atctggctga 
tcacttcaga 
cagtttatta 
aatgatcgtg 
atgggctacg 
taagcatatc 
tgcaataata 
tactacatgg 
gtctattatt 
gtatgctcac 
aattcaaaga 
aaaaatttct 
aacacagttt 
ttaaaaaaaa 
tagaactgaa 
gatactcatt 
tggcaagaaa 
ctgatttggt 
gaaatatata 



aaaaacacaa 
agaaaaatat 
accgcatttc 
ttcgacaccg 
ggcttgctga 
tcgaaggaga 
cggttttgtt 
gaggacgagc 
gtcaggagac 
ccccggatgt 
actttcctgt 
gctctcttct 
catctttcat 
gagcactcca 
tttagttaat 
tcagcgtgta 
acttggcctc 
ccggcttctg 
tcacatttcg 
ttggctttct 
gaaccgtttc 
gctgctcctg 
accccgcgtg 
acaattgcgg 
ctacttctat 
ggcttttcgt 
aaccaatagg 
taaatccaaa 
ctgcgactct 
agcaaaaatc 
ggcaccgtca 
cggtaagtta 
gcaggattaa 
atacgaaaat 
gttcatttgg 
accaaaggct 
aatatttgct 
taagttttaa 
atctattcca 
ggatatgtac 
atggatggag 
tcgtgataat 
actacaaatg 
gaatatttat 
taatttatca 
aatggaatat 
cgatagagaa 
tctcagctag 
aaataataaa 
cggttagtgg 
tatatatat 



ttttaggatt 
tcatatgatt 
ttttgctgtt 
tagatatttc 
tccccgccca 
aggtggcgag 
gcccccagta 
tgaacaagca 
acttcaagga 
actttctgaa 
gtttctgtca 
gcgtcactgg 
ttaaggatga 
gaacgtggaa 
gtccattttc 
cctctatgtt 
cttgttcctg 
cgttacagca 
atatcatgat 
gtcatcagtc 
ttgccacaga 
actgcaatca 
ggcgtcggca 
tgtaactcgt 
ggcccgatgc 
ataatgaaag 
ctcaattcgg 
agtttaaatt 
ttatcatcat 
aggcttgggc 
tatttctgct 
tcggttcttc 
gggtgggagg 
tgaccccagt 
tgttggaaac 
gctttgcttg 
acttatcact 
ttaaatgttg 
atccatccta 
gaatcttata 
gataaaaact 
aatttcgtaa 
attatattat 
aaacaacttt 
agcacatcgt 
tggtttacat 
aattacaaaa 
ttctttcatt 
atacaacaaa 
acgtgtcaat 



aataaaaaaa 
ttttgtttgg 
attgcaaaac 
aaaagcgcca 
tttgacagct 
ccacgtacga 
ccaaaagatg 
cgatcccttc 
ggatctgatc 
tcacgaactg 
cttctgaaga 
gacaaagtgg 
tagtatccga 
gaccgttggt 
acagcgatag 
gagaagcttc 
ggatatttct 
ggtttgcttg 
cagggttcct 
ttaccctctg 
atcggttcct 
cctatatagc 
aaaattgttg 
ttttttttta 
tactgataat 
tgaagaagga 
acaagcagac 
ttaaattatt 
gggcttgtcc 
caaggctttt 
atttgttctg 
cataggaggt 
gatgaagccg 
gttttttaat 
cttctatttt 
ctcattattg 
tttcaagtat 
aatgtaattg 
gacgtgccat 
ggcatgaata 
caggaaaata 
taattcgtac 
gactttttat 
aaaatgttat 
taaaaaacaa 
taacggtatg 
aaacgatcta 
taaaaatttc 
ttaatgcatt 
attatgaagc 



cagatatgta 
caggaaatgc 
tcaaatgccg 
agattctcga 
gaatatgact 
ggatgtgcct 
caaaagagca 
gtgaacgtga 
gtgcagtcgg 
ccgggcaatg 
tcgtaacctt 
aactgagcaa 
attgcacccc 
aagctgaatt 
tgatatcctt 
gcaacctgca 
tcctggtcct 
tggcgattac 
gggctacttc 
gaattcattt 
ttcttacaat 
tgaccaggtg 
gatctacagt 
accaacagct 
tgttttcaac 
agcacaaaac 
gtaataattt 
ttctatggct 
tggagcttgg 
atggtggctg 
aggcccagca 
ccaattaacc 
gcgccagcga 
atatgggtaa 
gaatcactaa 
ctcacatatc 
cttgatttct 
tcctttaagt 
caaagcgaga 
ctcggcattt 
cttttcgcat 
tgtttttact 
gacaatgact 
taacttagga 
taaattactt 
caaaaaatca 
gtcgagttcc 
tctttcgttt 
tttgaaaatt 
aaacgtggac 



cgtcttctat 
gaatagttat 
aaattcccgg 
acggatcgta 
acaagctcct 
gccacctcag 
agtgctacgg 
cgctcagcga 
atctggccaa 
aattcactct 
ttttgaatct 
gccgggagta 
acttctgtcc 
ccctgcttga 
gatatgcata 
cggaaagtgt 
caacgtgtgg 
aagataaagg 
tccgtcatcg 
agcggcaact 
ttatacgcct 
gttaagaacg 
aagttactat 
ggggatatga 
ataacaatgt 
tttactcaac 
taccatattt 
tttaaagtta 
aaataatctc 
attacttcaa 
cactgaagct 
tggttctatc 
tgaacatatc 
tgtgaaaata 
caggcagcct 
gtatgctcat 
gcacggtgtc 
tataactttt 
tcaaagcaac 
atataccctg 
cagcacttat 
gcctcaactt 
aactaaaaat 
taataataat 
gaatataaat 
taaaaagtca 
tcgactataa 
cgatagtaat 
atgggcgtga 
actacaaact 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3919 



<210> 53 
<211> 1547 
<212> DNA 
<213> Drosophila 



<400> 53 

caggaaatgc gaatagttat tggatcgttc accgcatttc ttttgctgtt attgcaaaac 60 
tcaaatgccg aaattcccgg ttgcgacttc ttcgacaccg tagatatttc aaaagcgcca 120 
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cctctacgaa ggcttgctga tccccgccca tttgacagct 180 

ggccgacgat tcgaaggaga aggtggcgag ccacgtacga 240 

gccatgcatt cggttttgtt gcccccagta ccaaaagatg 300 

cgacatgtcg gaggacgagc tgaacaagca cgatcccttc 360 

cgggtcggtg gtcaggagac acttcaagga ggatctgatc 420 

gcccggatgt ccccggatgt actttctgaa tcacgaactg 480 

gtttgagaac ggctctcttc tgcgtcactg ggacaaagtg 540 

actgcgtcca gcatctttca tttaaggatg atagtatccg 600 

cactttcgtc tgagcactcc agaattaatg tccattttca 660 

atatgcataa tcctaacgat cagcgtgtac ctctatgttg 720 

ggaaagtgtt tcatctgcta cttggcctcc ttgttcctgg 780 

aacgtgtgga aatactcctc cggcttctgc gttacagcag 840 

gtcatcgccg cgttcttttg gctttctgtc atcagtctta 900 

ggcaactcca gctggttgaa ccgtttcttg ccacagaatc 960 

tacgcctggg gcatggcgct gctcctgact gcaatcacct 1020 

aagaacgaga agttgagacc ccgcgtgggc gtcggcaaaa 1080 

gatatgactg tcatgatcta cttctatggc ccgatgctac 1140 

acaatgtttg tcctgacggc ttttcgtata atgaaagtga 1200 

actcaacagc aaaaaacaac caataggctc aattcggaca 1260 

ctgcgactct ttatcatcat gggcttgtcc tggagcttgg 1320 

agcaaaaatc aggcttgggc caaggctttt atggtggctg 1380 

ggcaccgtca tatttctgct atttgttctg aggcccagca 1440 

cggattaagg gtgggaggga tgaagccggc gccagcgatg 1500 

acgaaaattg accccagtgt tttttaa 1547 



<400> 54 



Met 


Arg 


He 


Val 


He 


Gly 


Ser 


Phe 


Thr 


Ala 


Phe 


Leu 


Leu 


Leu 


Leu 


Leu 


1 






5 










10 










15 




Gin 


Asn 


Ser 


Asn 
20 


Ala 


Glu 


He 


Pro 


Gly 
25 


Cys 


Asp 


Phe 


Phe 


Asp 
30 


Thr 


Val 


Asp 


lie 


Ser 
35 


Lys 


Ala 


Pro 


Arg 


Phe 
40 


Ser 


Asn 


Gly 


Ser 


Tyr 
45 


Leu 


Tyr 


Glu 


Gly 


Leu 


Leu 


He 


Pro 


Ala 


His 


Leu 


Thr 


Ala 


Glu 


Tyr 


Asp 


Tyr 


Lys 


Leu 


50 










55 










60 










Leu 


Ala 


Asp 


Asp 


Ser 


Lys 


Glu 


Lys 


Val 


Ala 


Ser 


His 


Val 


Arg 


Gly 


Cys 


65 










70 










75 










80 


Ala 


Cys 


His 


Leu 


Arg 


Pro 


Cys 


He 


Arg 


Phe 


Cys 


Cys 


Pro 


Gin 


Tyr 


Gin 








85 










90 










95 




Lys 


Met 


Gin 


Lys 
100 


Ser 


Lys 


Cys 


Tyr 


Gly 
105 


Asp 


Met 


Ser 


Glu 


Asp 
110 


Glu 


Leu 


Asn 


Lys 


His 


Asp 


Pro 


Phe 


Val 


Asn 


Val 


Thr 


Leu 


Ser 


Asp 


Gly 


Ser 


Val 




115 










120 










125 








Val 


Arg 
130 


Arg 


His 


Phe 


Lys 


Glu 
135 


Asp 


Leu 


He 


Val 


Gin 
140 


Ser 


Asp 


Leu 


Ala 


Lys 


Pro 


Gly 


Cys 


Pro 


Arg 


Met 


Tyr 


Phe 


Leu 


Asn 


His 


Glu 


Leu 


Pro 


Gly 


145 










150 










155 










160 


Asn 


Glu 


Phe 


Thr 


Leu 
165 


Phe 


Glu 


Asn 


Gly 


Ser 
170 


Leu 


Leu 


Arg 


His 


Trp 

175 


Asp 


Lys 


Val 


Glu 


Leu 
180 


Ser 


Lys 


Pro 


Gly 


Val 
185 


Leu 


Arg 


Pro 


Ala 


Ser 
190 


Phe 


He 



agattctcga 
gaatatgact 
ggatgtgcct 
caaaagagca 
gtgaacgtga 
gtgcagtcgg 
ccgggcaatg 
gaactgagca 
aattgcaccc 
cagcgatagt 
agaagcttcg 
gatatttctt 
ggttcctggg 
ccctctggaa 
ggttcctttc 
atatagctga 
attgttggat 
tgataattgt 
agaaggaagc 
agcagactta 
aaataatctc 
attacttcaa 
cactgaagct 
aacatatctc 

<210> 54 
<211> 192 
<212> PRT 
<213> Drosi 



acggatcgta 
acaagctcct 
gccacctcag 
agtgctacgg 
cgctcagcga 
atctggccaa 
aattcactct 
agccgggagt 
cacttctgtc 
gatatccttg 
caacctgcac 
cctggtcctc 
ctacttctcc 
ttcatttagc 
ttacaattta 
ccaggtggtt 
ctacactggg 
tttcaacata 
acaaaacttt 
cgccctattc 
gtttttgttg 
ttggtcacag 
actgaaggaa 
acttcagaat 



iphila 



<210> 55 
<211> 3640 
<212> DNA 
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<213> Drosophila 
<400> 55 

aattaatgca tttttgaaaa ttatgggcgt gactgatttg gtcggttagt ggacgtgtca 60 
atattatgaa gcaaacgtgg acactacaaa ctgaaatata tatatatata tcatttcatt 120 
gcccttaagt ctgaaaaacg gctttaaaat tgagaagtta ctttaagaga tattcgaggt 180 
attcggtgac agcagcgcac attatcgact ggccaatggg aatattagat ttgtagaata 240 
gggtaatatg taaacgtaaa cctttcgatt gtataaagca ttcaattcct aatat'ggata 300 
tgtccgttca ccagcctgta ttaactttgc gtgttattac ttaaatttat ctttctatgt 360 
acatgccaga aactgataga atggcacggt taccgccgat agtcgctgag tatggggtat 420 
cttatagtcg agcacattct cgttctccct tgacatcgca tatatcggtt gagttgttcg 480 
cattttggga tcagtcatct taaaactctg gtcgcataaa catgtttttt gtgttagtcg 540 
ttccatttga cccttgcgta agtatttata ttcaaagttt ttacgatttt ggatcgttaa 600 
gcataagaag gaccactcga gtacgatttg accgttttta tacaatttcg tcctcttaga 660 
cgaataaata ttgtgcctgc cttgtttcta aaaatgaact tgcttcagct tcgtagagaa 720 
tgcttagtca taacagatgg atttgttgaa tttattttta acagttgata agttatactt 780 
ttataaaaaa aaaaaaacaa cttactcaga atgctatagc acatgactca attttttgtt 840 
tttcataagt atttagatca gttttcagtt ttgcggaggt tattttgcta ctttatttct 900 
aattgtgaac acgtaacttg gtttaaaacc gtatatactt acagaatagc atacatcatt 960 
tgaattcagt tgcctatcat attaatttcc tttgcagaaa atgcggatat tgttaattgc 1020 
agttcttttt ctgttgatgc caaaatcaaa tgccgaaatt cccggttgcg acttcttcga 1080 
caccgtagat atttcaaaag cgccaagatt ctcgaacgga tcgtacctct acgaaggctt 1140 
gctgatcccc gcccatttga cagctgaata tgactacaag ctcctggccg acgattcgaa 1200 
ggagaaggtg gcgagccacg tacgaggatg tgcctgccac ctcaggccat gcatccggtt 1260 
ttgttgcccc cagtaccaaa agatgcaaaa gagcaagtgc tatggcgaca tgtcggagga 1320 
cgagctgaac aagcacgatc ccttcgtgaa cgtgacgctc agcgacgggt cggtggtcag 1380 
gagacacttc aaggaggatc tgatcgtgca gtcggatctg gccaagcccg gatgtccccg 1440 
gatgtacttt ttgaatcacg aactgccggg caatgaattc actctgtttg aggttacttt 1500 
cctgtgtttc tgtcacttct gaagatcgta accttttttg aatctctgta gaacggctct 1560 
cttctgcgtc actgggacaa agtggaactg agcaagcggg agtactgcgt ccagcatctt 1620 
tcatttaagg atgatagtat ccgaattgca ccccacttct gtccactttc gtctgagcac 1680 
tccagaacgt ggaagaccgt tggtaagctg aattccctgc ttgactttaa ggtttttagt 1740 
taatgtcctt tctcacagcg atagtgatat ccttgatatg cataatccta acgatcagcg 1800 
tgtacctcta tgtcgagaag cttcgcaacc tgcacggaaa gtgtttcatc tgctacttgg 1860 
cctctttgtt cttgggctat ttcttcctgg tcctcaacgt gtggaaatac tcctccggct 1920 
tctgcgttac agcaggtttg cttgtggcga ttacaagata aaggacgact aatttcacat 1980 
ttcgatatca tgatcagggt tcctgggcta cttctccgtc atggccgcgt tcttttggct 2040 
ttctgtgatt ggaatccact tgcggattaa gtttagtcta gcctctaact gtttgcaccg 2100 
attgctgcca gaaaatccgt tccgtgcata caacttatac gcctggggta taccgctaat 2160 
catgactgca atcacctata cagctgacca ggtggttaag aacgagaagt tgagaccccg 2220 
cgtgggcgtc ggcaaaaatt gttggatcta cagtaagtta ctattcctat aattacaatt 2280 
acggtgtaac tcgttttgtt aaccaacagc tggcgatatg actgtcatga tctacttcta 2340 
tggcccgatg ctgttgctaa tcgctttcaa cataataatg tttgtccttt cggcgattta 2400 
catatataac ataaagaaaa atgtgaaagg ccttgtccac aagcaacaaa caaaccaaca 24 60 
gataaatgac caacaaatgt aattatatta tattataaga taataactat aatgtcaaat 2520 
ggcacttttt gtaatctttt ataggtttgc tatattcctg cgacttttca tcttaatggg 2580 
tttgtcgtgg agctttgaga tattatcctt tttgttaacc aaacagcaag cttgggctag 2640 
ggctttaatg gtggctgact actttaattg gtcccagggt accatcatat tcgtgctttt 2700 
tattttaaag cctagcattc taaaacttat aatagcaggg taagtaaagt attacaaata 2760 
accttgaaat ttgaccgtct catacgtatg aacaatcttt tgatttaggg gacgtcaaaa 2820 
cctcccagga agtcaccata attcgagatc aaaagcagct cgatataatt caactcatac 2880 
ggcttgtgaa ggatcaattg cggatccaaa cgcttactgc taaaaaaatg gaattcaata 2940 
agatgcaacg tctctgttat tgatcacatt attttaattt tctatggagc aaaagtttaa 3000 
aaaaaatatt tgttatcaag acaataaata atgtcttgga acatataaag caaaatatat 3060 
taaagggtac aaaaaacagg aaaagtatgg ggcaatacag caaaatattt ttcggatcat 3120 
caactttttt gtaagacaaa atgtagaaat atttctggca ccgttgtttc aaaccagggt 3180 
ttgagtcaac aatgctggcc caccactggc aagtaccaaa cgattacaga aaactttttc 3240 
taatcgatta agcatcgggg gtcatcgatg tttctccacc actacaattt catgttgttt 3300 
ttttagtcgc ggctgtttga cgtttgtcgc tttcctccgt tgcgaatata atatattacg 3360 
tagctcattt ttatacaaac ggaattacga gcgcaacgac gacagcaaca ctagtagcac 3420 
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taatcgtaag cgcaggggcc aaaaattaaa ttgcgtttgc ggccgcaaag atttgatgac 3480 

gtcgcatacg ccgtcttcta gggcgtaaaa agcaaagcaa agcaaacaaa cgcgaaagcg 3540 

aaacgtgtaa acggcgtaga agcgataaac gcgactcaaa tacgcagcag ataaaataca 3600 

atacgcgaga agagaaaagt cacgggaaat attgttcata 3640 



<210> 56 
<211> 1089 
<212> DNA 
<213> Drosophila 



<400> 56 

atgcggatat 

cccggttgcg 

tcgtacctct 

ctcctggccg 

ctcaggccat 

tatggcgaca 

agcgacgggt 

gccaagcccg 

actctgtttg 

gagtactgcg 

tgtccacttt 

atatgcataa 

ggaaagtgtt 

aacgtgtgga 

gtcatggccg 

ctagcctcta 

tacgcctggg 

aagaacgaga 

ttgggctag 



tgttaattgc 
acttct tcga 
acgaaggctt 
acgattcgaa 
gcatccggtt 
tgtcggagga 
cggtggtcag 
gatgtccccg 
agaacggctc 
tccagcatct 
cgtctgagca 
tcctaacgat 
tcatctgcta 
aatactcctc 
cgttcttttg 
actgtttgca 
gtataccgct 
agttgagacc 



agttcttttt 
caccgtagat 
gctgatcccc 
ggagaaggtg 
ttgttgcccc 
cgagctgaac 
gagacacttc 
gatgtacttt 
tcttctgcgt 
ttcatttaag 
ctccagaacg 
cagcgtgtac 
cttggcctct 
cggcttctgc 
gctttctgtg 
ccgattgctg 
aatcatgact 
ccgcgtgggc 



ctgttgatgc 
atttcaaaag 
gcccatttga 
gcgagccacg 
cagtaccaaa 
aagcacgatc 
aaggaggatc 
ttgaatcacg 
cactgggaca 
gatgatagta 
tggaagaccg 
ctctatgtcg 
ttgttcttgg 
gttacagcag 
attggaatcc 
ccagaaaatc 
gcaatcacct 
gtcggcaaaa 



caaaatcaaa 
cgccaagatt 
cagctgaata 
tacgaggatg 
agatgcaaaa 
ccttcgtgaa 
tgatcgtgca 
aactgccggg 
aagtggaact 
tccgaattgc 
ttgcgatagt 
agaagcttcg 
gctatttctt 
ggttcctggg 
acttgcggat 
cgttccgtgc 
atacagctga 
attgttggat 



tgccgaaatt 
ctcgaacgga 
tgactacaag 
tgcctgccac 
gagcaagtgc 
cgtgacgctc 
gtcggatctg 
caatgaattc 
gagcaagcgg 
accccacttc 
gatatccttg 
caacctgcac 
cctggtcctc 
ctacttctcc 
taagtttagt 
atacaactta 
ccaggtggtt 
ctacacaagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1089 



<210> 57 
<211> 362 
<212> PRT 
<213> Drosophila 



<400> 57 



Met 


Arg 


He 


Leu 


1 








Asn 


Ala 


Glu 


He 








20 


Lys 


Ala 


Pro 


Arg 






35 




He 


Pro 


Ala 


His 




50 






Asp 


Ser 


Lys 


Glu 


65 








Leu 


Arg 


Pro 


Cys 


Lys 


Ser 


Lys 


Cys 








100 


Asp 


Pro 


Phe 


Val 






115 




His 


Phe 


Lys 


Glu 




130 






Cys 


Pro 


Arg 


Met 


145 








Thr 


Leu 


Phe 


Glu 


Leu 


Ser 


Lys 


Arg 



Leu He Ala Val 
5 

Pro Gly Cys Asp 

Phe Ser Asn Gly 
40 

Leu Thr Ala Glu 
55 

Lys Val Ala Ser 
70 

He Arg Phe Cys 
85 

Tyr Gly Asp Met 

Asn Val Thr Leu 
120 

Asp Leu lie Val 
135 

Tyr Phe Leu Asn 
150 

Asn Gly Ser Leu 
165 

Glu Tyr Cys Val 



Leu Phe Leu Leu 
10 

Phe Phe Asp Thr 
25 

Ser Tyr Leu Tyr 

Tyr Asp Tyr Lys 
60 

His Val Arg Gly 
75 

Cys Pro Gin Tyr 
90 

Ser Glu Asp Glu 
105 

Ser Asp Gly Ser 

Gin Ser Asp Leu 
140 

His Glu Leu Pro 
155 

Leu Arg His Trp 
170 

Gin His Leu Ser 



Met Pro Lys Ser 
15 

Val Asp He Ser 
30 

Glu Gly Leu Leu 
45 

Leu Leu Ala Asp 

Cys Ala Cys His 
80 

Gin Lys Met Gin 
95 

Leu Asn Lys His 
110 

Val Val Arg Arg 
125 

Ala Lys Pro Gly 

Gly Asn Glu Phe 
160 

Asp Lys Val Glu 
175 

Phe Lys Asp Asp 
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T 1 fa 
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215 




















Leu 
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Ser 
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Tyr 


Leu 


T w r" 

i yr 


Val 
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Lys 


Leu 


Arg 


Asn 


Leu 


Ml S 
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Gly 


Lys 


Cys 
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Tyr 


Leu 
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Leu 
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Leu 


Gly 


Tyr 
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L. fi 3 




















£. O 0 




Phe 


Leu 


1 7 — T 
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Leu 


Asn 


17 1 

va J. 
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Lys 


Tyr 


ber 


Ser 


Gly 


rne 


Cys 


vai 


1 rir 








i bU 




















o "7 n 






Ala 


Gly 


Phe 


Leu 


Gly 


Tyr 


rne 


ber 


val 


Met 


Ala 


Ala 


rne 


Phe 


Trp 


Leu 






Z ID 




























Ser 


Val 


lie 


Gly 


lie 


His 


Leu 


Arg 


lie 


Lys 


Phe 


Ser 


Leu 


Ala 


Ser 


Asn 


































Cys 


Leu 


His 


Arg 


Leu 


Leu 


Pro 


Glu 


Asn 


Pro 


Phe 


Arg 


Ala 


Tyr 


Asn 


Leu 


305 










310 










315 










320 


Tyr 


Ala 


Trp 


Gly 


He 


Pro 


Leu 


He 


Met 


Thr 


Ala 


He 


Thr 


Tyr 


Thr 


Ala 










325 










330 










335 




Asp 


Gin 


Val 


Val 


Lys 


Asn 


Glu 


Lys 


Leu 


Arg 


Pro 


Arg 


Val 


Gly 


Val 


Gly 








340 










345 










350 






Lys 


Asn 


Cys 


Trp 


He 


Tyr 


Thr 


Ser 


Leu 


Gly 















355 360 



<210> 58 
<211> 7503 
<212> DNA 
<213> Drosophila 



<400> 58 

gattgeegtg 

aaaaggtttg 

gageggcate 

cattccacgt 

cacgagaaca 

aaactctgea 

ggcatttgat 

gcatcttggc 

tgtgcccaag 

gggtgttgat 

egggcagtaa 

gggaggcagc 

aggtcctgac 

ctggcatcat 

tccggcgcat 

ttgetcatta 

ctgaaagctc 

atttatttcc 

ttcacagggt 

caactggatc 

caagtactcg 

gcggccccga 

cgtctcgccc 

gccgcgctgc 

ggcgtgctgg 

gggcggcgga 

cggttttggc 

ggtgttcggg 

ggcgtagttc 



gtcgtgcgcc 
gacggccctt 
cgctggccaa 
gaaaggacac 
acaaggtctt 
cggcacagcc 
aatatcatat 
gacgattgtc 
ccgattccct 
atacttgaaa 
gatctggcaa 
cgtctcgcta 
caaggcgcca 
accacaggaa 
tgctctgcaa 
aaataaatta 
atgccagtga 
caactatctc 
atgtatgtag 
taacaatagg 
aegatatege 
tagttgggac 
tcgagcagtt 
ctgctccgct 
atgttagagc 
tctcgttccc 
cgatgtttgt 
tcataggtca 
cgcggcgact 



gcgggagaaa 
gctttattag 
gttggtctat 
tegcagtegg 
cgtcctgatc 
ggatagctct 
tattegctet 
aggtgacgee 
cgttcgcctg 
tataaattat 
atggagcgtt 
ttctgtccca 
ttggagtgca 
attgeegget 
tagattcccc 
gagttgccgt 
aagggaaaca 
atgtcattgc 
atgegtaact 
agtttaacta 
atttgatgga 
tettcteggt 
ccgtctcgct 
gettateggg 
tccggctgac 
gttcgctgtc 
tgcggttctt 
tcccgtggcc 
ggegctgett 



tgcccttttc 
gecataattt 
aaaagecegg 
gecgacgaga 
ctgctctact 
gtggccgcca 
aatggattcc 
cgtcatccat 
cgtgggtcgc 
aatttgatat 
cctgcatgtg 
aagtgaagcc 
tgtgtcggcc 
atteggaega 
catcagttta 
ttctgcttgt 
tcagcagata 
tactagttca 
ccaaagaaaa 
eggcataatg 
tatattegga 
gaggagtttc 
gctggcgccg 
cgccacattg 
cttacgcggc 
cgtctcgctc 
cteggagacg 
gecatgatgt 
ggcgtgcgac 



gtgtacgcaa 
aegctgtega 
eggctgeaag 
tgctgcgcca 
gcgtcctggt 
eggataatgg 
atttaccatc 
gtgctccagt 
tgtgccagtt 
aatctgatga 
ctgecaggag 
eggegagegt 
atgcacttcc 
gggtccactc 
gcactcatac 
acacagtgaa 
tatgattatt 
ctaaatggtg 
caaagataat 
catatgactc 
cagctctggc 
eggaagatgg 
tagatgetge 
ggcgaactgt 
tggacgcacg 
tggcaaaegg 
gaccgctgga 
tccgcacagc 
agcgtgtgcg 



ttactaggca 
geggecatta 
tggcgttttc 
cctgctccgc 
cagcattctg 
tatttgataa 
cagatattac 
atcctggatg 
atatccaggt 
ctttaggttt 
tctggtgagc 
aaattcaaga 
attgaggagt 
aacaatcact 
ccatgcccat 
aaatgtgcct 
ttaaattgta 
tgtgtcctgc 
ctttacactg 
gcgccgccta 
ccacggcact 
geaaaattge 
tatcctgett 
ggtggtgggc 
gctgactgtt 
tgcccttttg 
tgggcgaggt 
getgeggact 
tgtcccagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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gtcactctgt 
gtcggaggca 
gacgcgcaga 
tgttgctcaa 
tgtactttgg 
attgacgcct 
tgatgctgcc 
gatgatgatg 
ggtgctgctg 
gctgcatcat 
ttttgctgtt 
acggaggaag 
ccacatgagt 
ggttgcaatg 
gggcatctct 
ctccagttcg 
tgcctcgggt 
ctttcggaat 
cgaggaatcc 
caccacgccc 
taaccaaaga 
ataggatagt 
tgctcttagt 
tttaagatct 
aagatcagct 
ggcctattac 
agctacaaaa 
acaaagtggc 
ttcgatccga 
aaacggagaa 
cccaggccaa 
tcttacaaat 
tgctaacgaa 
catcatcaat 
aatatgttct 
atcggcaatc 
cataaacaat 
atgcacccgc 
gatccaggct 
gagacccagg 
caaatgggga 
ttgtttaata 
aaaaactgtg 
tggattgacc 
cacccatttg 
cccgatcttt 
gttgttgtag 
ggattgttta 
agacactctc 
cggccacata 
agcgggaata 
agtaaaatat 
gccactgtga 
aaggctgtag 
gtggaatagg 
gggagtttat 
agcctcacag 
gtccttctcg 
caccacctcg 
gcgtggctca 



gaaaccacga 
gtgggagcta 
aaacttactc 
ctccgattgc 
ccgggcggtg 
ctcccgcatg 
tatattatca 
ctgctgctgc 
ctgcatctgc 
agttggagca 
ggagaatatc 
cgatggcgcc 
cgaggtcaca 
catgaacata 
cgacgcaccg 
ctaccgacgt 
cccagcttgc 
cggcagtttg 
cggcgtccgt 
tggggattcc 
ttatactacg 
taaagatatt 
gggttctata 
gttttcgagt 
cagctggcgg 
cgccgacaaa 
actacaaaac 
tgtcgtacaa 
tagcaccagg 
accaatatgc 
aaaggcgcca 
taatgtttat 
tatggtcatg 
attctcgtga 
atcaatgcac 
tgccaggtgg 
tgatagtcct 
cacgtcttcg 
cgagctgtgc 
aaaatgatgc 
ctggacatct 
tagcattgtt 
gcaataatca 
tgactgtgct 
actggttttc 
tccatttgtc 
aaacgcacgg 
tctatctacc 
ccagtccttc 
ctgaatggcc 
ttcagtgcct 
gaaatttatt 
tgtcaccact 
cgatctcctc 
tagccatgat 
tgcaataatt 
aatgccttgc 
cgcagtttct 
acctccatgg 
ttctggtaga 



aattggactt 
ttttcgatta 
ttaatcacag 
gatgtggaga 
ggcgtctggc 
ctggagcagt 
tcggtctgag 
tgctgttgct 
tgatgttgct 
ggggcatgcg 
catagattag 
atcactgtag 
ctatccacgg 
tctacgaaat 
gtggagctga 
tggagccacc 
ggattaaggc 
tgttctttac 
ttttggacat 
gcttcagctc 
cggtcaacat 
ccaacagcaa 
tagttcaagt 
acatgttgtt 
taaagtacga 
caacattcat 
ttccttacta 
aaaatgataa 
gatatagccg 
ttggaatcgt 
aaaacctatt 
tatatactca 
tgctcagact 
tgctgtagtt 
tcaccaaggg 
ttataatggc 
tgatcacctt 
agaacatggc 
agatataggg 
tcaaataggt 
tgatgtatct 
aagctcacct 
ctccaatgac 
cgatgtagat 
ccaagcttaa 
ccagctgaaa 
gtcccttgag 
gtcacaccct 
acgcgataat 
agggcagctg 
cgcagtctgt 
acaagaattt 
ggtggacaag 
cacactgcac 
cagccattga 
ttaaattgag 
gtgcaaagtg 
tcaatgattc 
catccaggtc 
cagtgcccac 



cagagcggag 
gaaaatcgat 
ccggagtact 
tgttggtgga 
tgccactgtc 
ctcctccgct 
cggacaccga 
gttgcatctg 
gtaagtgctg 
agtggtgctc 
agtgtattta 
actgcaccga 
acgtcatctc 
ccccctcctt 
gagcctaaca 
tgtgcaggtc 
ctgcagctcg 
atctcgcaat 
gggtctcaac 
caccaactgc 
ctaggaagga 
ggaagtggga 
ttatagttag 
tctgtatcaa 
tcagttaaga 
attacacgat 
ccagtcttac 
aaaacgagag 
ctccggccag 
tcagagccgg 
gaaaaaaatt 
ccaacaacag 
ggcagtactc 
aataataatc 
ttccaactgt 
tatatcaatg 
cttattgagc 
tccaaaactg 
aaaagctgcc 
catcatacag 
ataaaaacat 
ttgattgcga 
cgaagcacta 
caaagtgcga 
atccaagtgt 
ctgattgatc 
aatgtaaaat 
cgaaggatgt 
caaaatgtgt 
cccagatacc 
cgtactccac 
taatagactt 
ggcaaaagat 
tcgctgtcct 
taggctctgc 
acttaagtta 
ctcgttaaag 
agccacatcg 
agccaccatg 
tctggtccaa 



cccatgctgg 
agtattatag 
gggctttgag 
ggagttccgg 
gtagtgcctg 
ccgcttgccc 
gtttctcttc 
ctggtgttgc 
ctgcagttgc 
tacaatcgat 
agaccaaaat 
cacaaaggag 
ggtaacttct 
agccactcac 
atgtcatcgt 
accccatcga 
ttctccttct 
cgttgctcca 
gtggccctaa 
aaaacgattc 
tcaactctac 
tttgcctaca 
tctgtagtgt 
ggtgtatcaa 
agctctaacc 
tattatatgt 
tttcggtacg 
taagacaaaa 
acaagtgatg 
tatagaaaca 
attgctttat 
tcccttgtag 
gttttcgggg 
attttagatt 
ttagtttcgc 
gccaaaagca 
ttcagatcgg 
agactgaatc 
acactactta 
cccacaatga 
caatatttta 
tacttaatgt 
gccgatacaa 
tccttgggag 
gacttctgtg 
aggatgttgg 
atgttttaac 
attacgtaga 
tagcagatcc 
atcgtaggta 
aagatactca 
tactggtact 
ccactagaat 
gcgtgacatt 
catacatatc 
gtatatccaa 
ttgcccagaa 
ttgacgaagc 
tgattgtggg 
ttgaactcct 



tgcggcgatg 
tattataaga 
tgtggacaca 
tgcgagctgc 
gaggcggtcg 
gccgtactgg 
tccagatggc 
tgctgctgca 
tgctggatgg 
tgctttttct 
ttaagaactt 
ttatccatct 
atgacaaacg 
ctgtggtgct 
tcagtatggg 
tccatttgcg 
ccatcagcgc 
gttcgcacac 
cgcgtttgtc 
gcatgaatat 
ggacagtaat 
accttgctgc 
tatatagcat 
cgctagacga 
acaactgatc 
aacatataac 
aacaccaaac 
actaaatcct 
aacacgttat 
tgtcgagttt 
tataatgata 
gcataaataa 
atcaccaaga 
tttcacttca 
ggtaaaaggg 
cgcccacaac 
tgaatatcga 
cagccatcaa 
atgtggtatc 
tcagattgtt 
ttaaacgaaa 
taaaggccag 
tatatatggt 
gagttttccc 
agtggtattc 
ccttgcgctc 
tgcaaaaact 
gcctcaagga 
tctcgctttt 
tagccatgaa 
tcagcagtct 
aacaatgcca 
ggcaccttcc 
ccaccaatcc 
caatctggaa 
ctcacttata 
tgatcctcac 
tctgcgtttc 
gcagcgaata 
tcagcaaggc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
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cagtcg'cggc 
atccttggtg 
gctggccttg 
attgggaccc 
gaaaggaaac 
attacaatat 
gtccatcatc 
tcgttccacc 
cccaaggcga 
tccccgtacg 
gcggtgctcc 
atgttgaccg 
aacttggagc 
gccgaggcaa 
tggcactcgc 
caaaattgct 
atgatctaaa 
ggtgttgcat 
gtcccagaac 
atctcagcca 
cttgacattt 
agctgtcact 
tgcgcttgta 
gcccgttgcc 
caaattgaaa 
ttgagaaaat 
acaaggaggt 
atactttgag 
atatctattt 
atatcctgac 
ccgcctcaag 
aatgttcaag 
cgccaatctg 
tctgtacttc 
tcgagcgcca 
ggcaggttag 
tat 



gcattaaagg 
aacatgggat 
gctatgggat 
gaatgcatca 
caaaattaca 
gaaccaatga 
agagattcac 
acatgtgcag 
gctttacact 
ggaagaatcc 
aaaggagcag 
cgtgggctgt 
caagttgtgg 
ggctgcgccc 
atgcaatttt 
tcaaattgct 
ggattcgtgc 
cctcttaaat 
tggcttttat 
acgaagaact 
gtctgacagg 
gctccgcctt 
aacatcctgc 
atggactact 
attgtgagta 
cggcgaaggc 
ggccatcaag 
ggaaatcaag 
tccatttgtt 
ctaactggtc 
agcgaggtta 
gggattgcat 
cttatcagcg 
cctgaagacg 
gagattctat 
ccatgactct 



cattc.tccga 
gggtgtccgc 
cggtcacatg 
tatcgaagaa 
ttccagaata 
acattaaaaa 
ttaatgaaat 
cctgtagttg 
gggcataacg 
cgctatgtag 
gaacagcagg 
aaaacgattc 
gtggtgaatg 
agtgggtgtg 
acaggacagg 
gcagcaatgc 
attatttagt 
cttatgtgta 
ttcactaccg 
tttgcagtgc 
aggaattcga 
ggcctaggct 
aacaggtggt 
ggggtcgaaa 
accataatgg 
gtccatggct 
aaggtggccc 
accctgcagc 
aattatgtaa 
tctccttggt 
atcccctgag 
atctgcatga 
atacagatat 
agtcgcgcct 
ttggcagcca 
ttgatgggtt 



gggtaccacg 
gtagctgagc 
ggtgcacgca 
ggactttacg 
tatggaaata 
aaatacctaa 
cttcaagcta 
gccagtatct 
ccccgaccgg 
acgtccgatc 
ctggcgaatg 
taaatgtgaa 
gcagtggttg 
tattattttc 
tggagcaagg 
tggcacgttt 
atgctcgtta 
catgttttta 
atgccatttg 
gagggtccga 
ggcaggaagt 
ttccaacccc 
cccatcagct 
aacgaagtag 
aggactatgc 
gcgtgtttaa 
ttaagaacaa 
tttgcaagtc 
ctaattgtag 
gctggaatac 
ccgccagcag 
agcgggtctc 
gcttaagatt 
ttattcgcct 
gaagtatgga 
tacatatatc 



cgaaagaaat 
tgggtgaggt 
gcgccgaaga 
cccacagcag 
tgcataggta 
ttgaacttta 
aaccctaact 
ttccatgctc 
tgtaggagtt 
tcttggctgt 
gaaaccaact 
tagcgcatat 
gtttcgggtg 
tacgttctac 
ctaaggccag 
agtaattata 
acgcgaagga 
ggaacctacc 
aaacacttat 
aaaattatgt 
gcgcatgtcc 
cgtaattacc 
ggtactccgg 
ataactgctc 
gcccagccgc 
ggccatcgat 
gttcggaaac 
tgaatatgta 
attctggaca 
cagccggaca 
gtgcgtaagt 
atgcaccggg 
gccgatttcg 
caggtctcaa 
accggagtgg 
cttatggctg 



tcggaaacgc' 
gccagtgctt 
gcatcacttt 
cattgcactg 
aaggtgtgat 
aaaaccacaa 
cacctgagtg 
attaacgtga 
ttccacgcca 
cctgccacag 
tggccggaac 
cgaggacgtg 
agccaaggct 
ctgtgtttgc 
tccaggccag 
aagcaattta 
gtatcttcca 
agcttgctta 
tttacatcaa 
atccccctga 
ttaaaccgaa 
ataaacccac 
caattgttgt 
ggaaaattac 
tataaaatgc 
ctgcagcgca 
atagccctaa 
ggtataccat 
tcattgatat 
ctctgtacaa 
ttgcacacca 
atattaaacc 
gactggcccg 
cccggtggta 
atatgtgggc 
acattattgc 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7503 



<210> 59 
<211> 3971 
<212> DNA 
<213> Drosophila 



<400> 59 

aaaagttctt 

gaaataaaag 

agttggtttc 

acagccaaga 

aactcctaca 

gagcatggaa 

aatgctgctg 

atgctcttcg 

tggcacctca 

ccgaatttct 

ctgaaggagt 

ctgccccaca 

cagagcttcg 

atcattctgg 



cgttggctga 
ccagttctgg 
cattcgccag 
gatcggacgt 
ccggtcgggg 
agatactggc 
tgggcgtaaa 
gcgctgcgtg 
cccagctcag 
ttcgcgtggt 
tcaattggac 
atcacatggt 
tcaacgatgt 
gcaactttaa 



gatttgatgt 
gactaagcaa 
cctgctgttc 
ctacatagcg 
cgttatgccc 
caactacagg 
gtccttcttc 
cacccatgtg 
ctacgcggac 
accctcggag 
cagagtgggc 
ggctgacctg 
ggctgaatca 
cgagcacttt 



aaaataagtg 
gctgcccacg 
ctgctccttt 
ggattcttcc 
agtgtaaagc 
ctgcacatgt 
gatatgatgc 
accgatccca 
acccatccca 
aatgccttta 
actgtctacc 
gatgccatgg 
ttgaagaaac 
gcacgcaagg 



tttcaaatgg 
cggtcaacat 
ggagcaccgc 
cgtacgggga 
tcgccttggg 
ggtggaacga 
attcgggtcc 
tagccaaggc 
tgttcaccaa 
atgcgccgcg 
agaatgagcc 
aggtcgaggt 
tgcgcgagaa 
cattctgtga 



catcggtagt 
gttccggcca 
ctgtggcagg 
tggcgtggaa 
tcacgttaat 
cactcagtgc 
caataaagtg 
cagcaagcac 
ggatgcgttt 
actggccttg 
acgctattcg 
ggtggaaacg 
ggacgtgagg 
ggcttataaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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ttggatatgt atggcagagc ctatcaatgg ctgatcatgg ctacctattc cacggattgg 900 

tggaatgtca cgcaggacag cgagtgcagt gtggaggaga tcgctacagc cttggaaggt 960 

gccattctag tggatctttt gcccttgtcc accagtggtg acatcacagt ggctggcatt 1020 

actgctgatg agtatcttgt ggagtacgac agactgcgag gcactgaata ttcccgcttt 1080 

catggctata cctacgatgg tatctgggca gctgccctgg ccattcagta tgtggccgaa 1140 

aagcgagagg atctgctaac acattttgat tatcgcgtga aggactggga gagtgtcttc 1200 

cttgaggctc tacgtaatac atccttcgag ggtgtgacgg gacccgtgcg tttctacaac 1260 

aacgagcgca aggccaacat cctgatcaat cagtttcagc tgggacaaat ggaaaagatc 1320 

ggggaatacc actcacagaa gtcacacttg gatttaagct tgggaaaacc agtcaaatgg 1380 

gtggggaaaa ctcctcccaa ggatcgcact ttgatctaca tcgagcacag tcaggtcaat 1440 

ccaaccatat atattgtatc ggctagtgct tcggtcattg gagtgattat tgccacagtt 1500 

tttctggcct ttaacattaa gtatcgcaat caaagataca tcaagatgtc cagtccccat 1560 

ttgaacaatc tgatcattgt gggctgtatg atgacctatt tgagcatcat tttcctgggt 1620 

ctcgatacca cattaagtag tgtggcagct tttccctata tctgcacagc tcgagcctgg 1680 

atcttgatgg ctggattcag tctcagtttt ggagccatgt tctcgaagac gtggcgggtg 1740 

cattcgatat tcaccgatct gaagctcaat aagaaggtga tcaaggacta tcaattgttt 1800 

atggttgtgg gcgtgctttt ggccattgat atagccatta taaccacctg gcagattgcc 1860 

gatccctttt accgcgaaac taaacagttg gaacccttgc atcacgagaa tattgatgat 1920 

gtcttggtga tccccgaaaa cgagtactgc cagtctgagc acatgaccat attcgttagc 1980 

attatttatg cctacaaggg actgttgttg gtttttggcg cctttttggc ctgggaaact 2040 

cgacatgttt ctataccggc tctgaacgat tccaagcata ttggtttctc cgtttataac 2100 

gtgttcatca cttgtctggc cggagcggct atatccctgg tgctatcgga tcgaaaggat 2160 

ttagtttttg tcttactctc gttttttatc attttttgta cgacagccac tttgtgtttg 2220 

gtgttcgtac cgaaattggt ggagctgaag cggaatcccc agggcgtggt ggacaaacgc 2280 

gttagggcca cgttgagacc catgtccaaa aacggacgcc gggattcctc ggtgtgcgaa 2340 

ctggagcaac gattgcgaga tgtaaagaac acaaactgcc gattccgaaa ggcgctgatg 2400 

gagaaggaga acgagctgca ggccttaatc cgcaagctgg gacccgaggc acgcaaatgg 24 60 

atcgatgggg tgacctgcac aggtggctcc aacgtcggta gcgaactgga gcccatactg 2520 

aacgatgaca ttgttaggct ctcagctcca ccggtgcgtc gagagatgcc cagcaccaca 2580 

gaagttaccg agatgacgtc cgtggatagt gtgacctcga ctcatgtgga gatggataac 2640 

tcctttgtgt cggtgcagtc tacagtgatg gcgccatcgc ttcctcccaa aaagaaaaag 2700 

caatcgattg tagagcacca ctcgcatgcc cctgctccaa ctatgatgca gcccatccag 2760 

cagcaactgc agcagcactt acagcaacat cagcagatgc agcagcagca cctgcagcag 2820 

cagcaacacc agcagatgca acagcaacag cagcagcagc agcatcatca tcgccatctg 2880 

gagaagagaa actcggtgtc cgctcagacc gatgataata taggcagcat caccagtacg 2940 

gcgggcaagc ggagcggagg agactgctcc agcatgcggg agaggcgtca atcgaccgcc 3000 

tccaggcact acgacagtgg cagccagacg cccaccgccc ggccaaagta cagcagctcg 3060 

caccggaact cctccaccaa catctccaca tcgcaatcgg agttgagcaa catgtgtcca 3120 

cactcaaagc ccagtactcc ggctgtgatt aagactccca ctgcctccga ccatcgccgc 3180 

accagcatgg gctccgctct gaagtccaat ttcgtggttt cacagagtga cctctgggac 3240 

acgcacacgc tgtcgcacgc caagcagcgc cagtcgccgc ggaactacgc cagtccgcag 3300 

cgctgtgcgg aacatcatgg cggccacggg atgacctatg acccgaacac cacctcgccc 3360 

atccagcggt ccgtctccga gaagaaccgc aacaaacatc ggccaaaacc gcaaaagggc 3420 

accgtttgcc agagcgagac ggacagcgaa cgggaacgag atccgccgcc caacagtcag 3480 

ccgtgcgtcc agccgcgtaa ggtcagccgg agctctaaca tccagcacgc cgcccaccac 3540 

cacagttcgc ccaatgtggc gcccgataag cagcggagca ggcagcgcgg caagcaggat 3600 

agcagcatct acggcgccag cagcgagacg gaactgctcg agggcgagac ggcaattttg 3660 

cccatcttcc ggaaactcct caccgagaag agtcccaact atcggggccg cagtgccgtg 3720 

ggccagagct gtccgaatat atccatcaaa tgcgatatcg tcgagtactt gtaggcggcg 3780 

cgagtcatat gcattatgcc gtagttaaac tcctattgtt agatccagtt gcagtgtaaa 3840 

gattatcttt gttttctttg gagttacgca tctacataca taccctgtga agcaggacac 3900 

acaccattta gtgaactagt agcaatgaca tgagatagtt gggaaataaa ttacaattta 3960 
aaataatcat a 3971 

<210> 60 
<211> 1221 
<212> PRT 
<213> Drosophila 

<400> 60 
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Gly 
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He 


Leu 


Ala 
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Tyr 
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Met 


Trp 


Trp 


Asn 


65 
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Gin 


Cys 
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Ala 


Ala 


Val 


Gly 


Val 


Lys 


Ser 


Phe 


Phe 
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Met 
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Met 
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Ser 


Gly 
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Met 
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Ala 
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130 
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Asn 


Phe 


Phe 


Arg 


Val 
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Pro 
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Glu 
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Ala 


Phe 
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Ala 


Pro 
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Leu 
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Thr 


Val 
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1 T A 
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He 


He 
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225 










230 










235 
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He 
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Ala 
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He 
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465 470 475 480 
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800 


Glu 


Pro 


He 


Leu 


Asn 


Asp 


Asp 


He 


Val 


Arg 


Leu 


Ser 


Ala 


Pro 


Pro 


Val 










805 










810 










815 




Arg 


Arg 


Glu 


Met 


Pro 


Ser 


Thr 


Thr 


Glu 


Val 


Thr 


Glu 


Met 


Thr 


Ser 


Val 




820 










825 










830 






Asp 


Ser 


Val 


Thr 


Ser 


Thr 


His 


Val 


Glu 


Met 


Asp 


Asn 


Ser 


Phe 


Val 


Ser 




835 










840 










845 








Val 


Gin 


Ser 


Thr 


Val 


Met 


Ala 


Pro 


Ser 


Leu 


Pro 


Pro 


Lys 


Lys 


Lys 


Lys 




850 










855 










860 










Gin 


Ser 


He 


Val 


Glu 


His 


His 


Ser 


His 


Ala 


Pro 


Ala 


Pro 


Thr 


Met 


Met 


865 










870 










875 










880 


Gin 


Pro 


He 


Gin 


Gin 


Gin 


Leu 


Gin 


Gin 


His 


Leu 


Gin 


Gin 


His 


Gin 


Gin 










885 










890 










895 




Met 


Gin 


Gin 


Gin 


His 


Leu 


Gin 


Gin 


Gin 


Gin 


His 


Gin 


Gin 


Met 


Gin 


Gin 








900 










905 










910 






Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


His 


His 


His 


Arg 


His 


Leu 


Glu 


Lys 


Arg 


Asn 






915 










920 










925 








Ser 


Val 


Ser 


Ala 


Gin 


Thr 


Asp 


Asp 


Asn 


He 


Gly 


Ser 


He 


Thr 


Ser 


Thr 




930 










935 










940 










Ala 


Gly 


Lys 


Arg 


Ser 


Gly 


Gly 


Asp 


Cys 


Ser 


Ser 


Met 


Arg 


Glu 


Arg 


Arg 


945 










950 










955 










960 
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Gin 


Ser 


Thr 


Ala 


Ser 


Arg 


His 


Tyr 


Asp 


ber 


bly 


Ser 


Gin 


Thr 


Pro 


TV-it- 

i nr 










rt r c 

965 










y / u 










y 1 D 




Ala 


Arg 


Pro 


Lys 


Tyr 


Ser 


Ser 


Ser 


His 


Arg 


Asn 


Ser 


Ser 


Thr 


Asn 


lie 


















985 










990 






Ser 


Thr 


Ser 


Gin 


Ser 


Glu 


Leu 


Ser 


Asn 


Met 


Cys 


Pro 


His 


Ser 


Lys 


Pro 






995 










1000 








1005 






Ser 


Thr 


Pro 


Ala 


Val 


He 


Lys 


Thr 


Pro 


Thr 


Ala 


Ser 


Asp 


His 


Arg 


Arg 




1010 








1015 








1020 








Thr 


Ser 


Met 


Gly 


Ser 


Ala 


Leu 


Lys 


Ser 


Asn 


Phe 


Val 


Val 


Ser 


Gin 


Ser 


1025 








1030 








1035 








1040 


Asp 


Leu 


Trp 


Asp 


Thr 


His 


Thr 


Leu 


Ser 


His 


Ala 


Lys 


Gin 


Arg 


Gin 


Ser 










1045 








1050 








1055 


Pro 


Arg 


Asn 


Tyr 


Ala 


Ser 


Pro 


Gin 


Arg 


Cys 


Ala 


Glu 


His 


His 


Gly 


Gly 








1060 








1065 








1070 




His 


Gly 


Met 


Thr 


Tyr 


Asp 


Pro 


Asn 


Thr 


Thr 


Ser 


Pro 


He 


Gin 


Arg 


Ser 






1075 








1080 








1085 






Val 


Ser 


Glu 


Lys 


Asn 


Arg 


Asn 


Lys 


His 


Arg 


Pro 


Lys 


Pro 


Gin 


Lys 


Gly 




1090 








1095 








1100 








Thr 


Val 


Cys 


Gin 


Ser 


Glu 


Thr 


Asp 


Ser 


Glu 


Arg 


Glu 


Arg 


Asp 


Pro 


Pro 


1105 








1110 








1115 








1120 


Pro 


Asn 


Ser 


Gin 


Pro 


Cys 


Val 


Gin 


Pro 


Arg 


Lys 


Val 


Ser 


Arg 


Ser 


Ser 










1125 








1130 








1135 


Asn 


He 


Gin 


His 


Ala 


Ala 


His 


His 


His 


Ser 


Ser 


Pro 


Asn 


Val 


Ala 


Pro 








1140 








1145 








1150 




Asp 


Lys 


Gin 


Arg 


Ser 


Arg 


Gin 


Arg 


Gly 


Lys 


Gin 


Asp 


Ser 


Ser 


He 


Tyr 






1155 








1160 








1165 






Gly Ala 


Ser 


Ser 


Glu 


Thr 


Glu 


Leu 


Leu 


Glu 


Gly 


Glu 


Thr 


Ala 


He 


Leu 




1170 








1175 








1180 








Pro 


He 


Phe 


Arg 


Lys 


Leu 


Leu 


Thr 


Glu 


Lys 


Ser 


Pro 


Asn 


Tyr 


Arg 


Gly 


1185 








1190 








1195 








1200 


Arg 


Ser 


Ala 


Val 


Gly 


Gin 


Ser 


Cys 


Pro 


Asn 


He 


Ser 


He 


Lys 


Cys 


Asp 



1205 1210 1215 

lie Val Glu Tyr Leu 
1220 



<210> 61 
<211> 18996 
<212> DNA 
<213> Drosophila 



<400> 61 

gttccttaaa 

cgatatatca 

atgtcctttg 

acgcttgccg 

ctccatacgg 

ttgacgaagt 

atcacaaacg 

aggcgctcjcc 

ctgtcggtga 

tggaattgga 

ccagcagcag 

tgagggtcag 

gcgccgtggt 

ccatcacgaa 

gcatgccggt 

gcaagctcat 

tcttgtccgt 

ttagtcccta 



cgtgatcaaa 
acaagtccag 
gcagcatgtt 
tcagtgcttc 
ctgtgccgac 
gtgttgtact 
agcatccttc 
gctggccaag 
cggggtcacc 
tctcggactg 
caccttggcg 
tgccgatgtg 
cgggaacctg 
cgtttttctg 
gaccctggtc 
ccagtttgct 
gtccaaatta 
tccccgtgac 



aatataacta 
ctggcaacat 
gcggtcgctt 
gcgaagattg 
actcaatccc 
gctgcaagtg 
ttcggcctca 
gaggctgcga 
atcacaggga 
gatttggatc 
ccagctgtaa 
cccatctggg 
ctggtggtgc 
ctcaacctgg 
ggcactctgc 
cagggtgagt 
aaaggttatc 
cgagccccat 



gagctactac 
gttgccgcgc 
ggcgccggct 
cgaggcgcga 
gacctaagcc 
caagtccccg 
cgattgacgc 
tccacctgaa 
cacccggtga 
ttaatctgga 
ccgtcaggac 
tggtgccgtg 
taacgctcgt 
ccatctcgga 
taaggcattt 
ggtggaagcc 
ccaaacttcc 
aatcgcttgt 



taagcagcga 
ctgtggtgag 
ggtggtcaca 
cacacatcgc 
caattttttg 
attgctagtg 
cttctaccac 
tgccagcaac 
tcttctcaac 
catggaccta 
accggggaac 
ctattcggcc 
ccagaatcgt 
catcctgctc 
catattcggc 
agtaggccag 
aactgtgaat 
caaatggcca 



gcacagcgtg 
ttgtgcctct 
aaaagcgccg 
ggaacccgaa 
ttgcgtgaag 
ctattccggg 
tacctgcgcc 
gagatctccg 
tacagcgggc 
gccactactc 
aggagcgtgg 
atccttctgt 
cgcatgcgca 
ggcgtattct 
gagctgctct 
ttcatttatg 
tccgcattta 
cgaaccagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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cgacctcgag 
gatgacacgt 
ggagcgcgac 
gagccagtac 
ctcctccgca 
tgaatctgaa 
agacagatag 
ccccagatcc 
gctttcgagg 
ctggaaatga 
ttaacaaaat 
tcagtgtaca 
gctaagctta 
tgaccatact 
ctgagtctgc 
tcgttggcaa 
attttatgat 
accgggtttt 
acacgcacac 
tcaactcaat 
taaacattct 
gtggcgatat 
tgattccatg 
ctgtcacaaa 
tttaatcaaa 
ctcaacctgt 
agtccaggcc 
tgcaaattgc 
tgtacgcagt 
tccttgctcc 
taaccgtggc 
tagtgttccg 
acatccttta 
tttcctgttt 
gtttgttttc 
cctatccact 
ccgcccccgc 
tcaggcacac 
ggcccagaaa 
atccgcatcc 
gggtcgtaaa 
tcacgtcaag 
ccttccggag 
cagtggcatt 
tttcatcatc 
ccgaaggata 
agaatttcaa 
tggggagcta 
cacaaaccga 
acgaacatct 
gaaaggaatg 
gttgtctaat 
agaaaaggtt 
acatccttag 
acctggttgc 
ccacccattg 
attgaatttt 
cttgttttct 
cgcgcaggga 
aagggccaaa 



cggattcgcc 
cgctgcttcg 
tgcctggcca 
ttccgtcgcc 
tcaccaccag 
tccgcatcta 
ccacgtgtgg 
ccagtgtact 
atgatttcat 
aagtggattt 
cctagtagag 
ttcactatgc 
atattgtata 
gtatatttta 
catcgatttt 
gaacgcacta 
ggccgcccgc 
gggttcctca 
acacgcactc 
gactgtgccc 
caccaggcat 
tcatcagctg 
tcccgtcggc 
tgctaatcgg 
gtgcatgcca 
ttcacctcct 
agtcatcctc 
tcgcctgccg 
gtcgatctgg 
tttggcgttg 
aatattgcga 
ttccgaagcc 
cgtcccctcg 
cctgcgtttc 
ggtactgccg 
gcccccggca 
actaacttat 
acatgccaat 
aggagctggc 
ttgttcgcat 
tctgaaaatg 
gtgcagcgat 
tgggcgtgaa 
tgcattggac 
ttgcttgcaa 
agctcctgta 
cgaaattcac 
ctaaaaatac 
tggtcttttt 
tagcaactaa 
ttttctgttt 
taacttaatt 
tctttgtggc 
accaaagttt 
actttttgta 
ggcgcgtcca 
ccttggctct 
tggcaacaca 
gctaagtgtc 
ggcactcgta 



agcagatcgt 
acgtatgtat 
actgtatgga 
ggcgtcgcca 
catccgcatc 
gggcctccaa 
agtttattcc 
tgggtgatgg 
acttcgagac 
tctgcaggcg 
tcttaagtaa 
tttgagattc 
atatacaaag 
gattgtgcgc 
acgcacaatt 
tatctcgtct 
atttgcgtaa 
agggttcaaa 
gcacactgta 
aaaactcatg 
taatttgtgt 
agcccggttc 
ggcagtggaa 
catttcggtt 
tcggacattc 
aggactcctt 
cattcccgca 
ggagctctgc 
gagcctactc 
tagcactcaa 
aaatcactct 
cctttccccc 
ccgatccttc 
cgtttcagac 
gaaccgctgt 
tcctgacgta 
taacaacata 
ttgttcagca 
attcagtttc 
tcccattcct 
tcatccctgg 
gtcctgacac 
cgcgcccctg 
tcctgccagt 
gagtaatgtt 
agtccatatt 
aagcgaggca 
atagttgttg 
ttatggtcct 
tattagcaac 
gatgaacact 
tgatatccaa 
gcttgaaaag 
atggctttgg 
ccccaggcgg 
actcgtccaa 
gttttgggtt 
aatacaaatc 
tcaagaaatg 
atgagcaaat 



gt tggccaac 
cgctatgccg 
tcgattcatg 
tcagcaaatt 
cgcatccatg 
tgacgaaaag 
aagtcgaaaa 
ccacttgtcc 
ctttgggaag 
tggccatcga 
atgacgattg 
tctttaaagt 
tagtgtagtg 
ttgtgcacat 
tccgcgctta 
cccaatctgc 
cgacagtcag 
agattaaata 
aatatgcggg 
tttggtcctt 
ctccagctca 
agccatctgt 
cagaatgaag 
tcaaggagcc 
taattatttt 
ggaggcgcat 
agattaatcc 
ttctgctcag 
acccttctca 
gtcgcagcct 
gctatgcagc 
atcagctcca 
tgacaccctt 
gtgctggtga 
tctctcctgg 
tcctggcgta 
ataacaatgg 
agcactcggc 
caaggattcc 
ttccatctgt 
ctgggctgag 
gtccttcctg 
gatttatgat 
ggcaaccaat 
ggcattgtcc 
ggttatagtt 
gcaataacaa 
acttttggct 
tttcttttcg 
tttgtatgtt 
ttaaggctag 
ttaaagtaac 
gacatttgtg 
gctttagaat 
gacggccttg 
gtcgctgaca 
tttttttttt 
agcgccacac 
tgggcgcagt 
ttatgcacaa 



attcaggagc 
gaaatggagt 
gactcggttc 
cgtttgtccc 
cccaaatccg 
gtcaaataat 
cgaaggctgt 
acattggtca 
ggcaagaagg 
tgatttcttg 
gcagaaaata 
cacccttttg 
gggagaagat 
cacttttcca 
tttatggcct 
atccgcctga 
ggtttccccg 
ggcccacgca 
ggaagctctt 
gaaaagtcca 
gataagggtt 
gatatggttc 
cagaaatgtg 
gagcggcatt 
cataacggta 
ccaaacggat 
aggctgatcc 
cattctggtc 
cccagctcct 
aagcagtctg 
gtcacgtcgc 
tcagctccgc 
ccttgcattt 
tgttgtttgt 
cctccaacac 
ccctggcata 
ctggcaaaca 
gctgtctcga 
gcgcccccat 
tgcacggggc 
gcaccaattt 
gctccctgtc 
cgtaattatg 
cgccacccac 
attacttatg 
tctgcatatt 
gctcaaagtt 
cagaattaga 
caagaaattc 
gcccatacca 
gacataatct 
tatgtgtgct 
tgcggaaaag 
tttctctatt 
aaccctgacc 
gctcggcaaa 
tgttgtccat 
agacgtaatc 
gcaattccag 
acacacagtt 



tgataaagaa 
tgcgctccac 
aggtggtgtc 
gttcgaccgc 
cagcagcaaa 
ggttaagtct 
tccgtcgtag 
tcatcccaaa 
ggtcgtctag 
ctattggttc 
aagatgtcgt 
aaatcactta 
aagttttaaa 
cttaggaaaa 
aaatcatatt 
gaatggccgt 
accagcgagc 
cacgacacac 
taaaaccatt 
gtcagatggg 
ggtgatgtgg 
gccttcggcg 
gccctgacac 
aatttgctta 
aggagaggtc 
gatgaagctc 
ccatttccaa 
aatttccata 
gttaccctgt 
gaattgcttt 
gattttcttg 
tagctgctcc 
tcatgacttg 
cttgctgcaa 
cacccccccc 
tccatcttgg 
agttgagcgc 
gccccagaaa 
cctcatccgc 
gtcgactcgt 
atggccatca 
ccacctggct 
catttgtggc 
tgccattcga 
taaatcgaaa 
cgatctacca 
agtcggatta 
ggtaataaaa 
ctcctcgaaa 
aaaagagtgt 
cagataaact 
ggtaagacaa 
ggtcaaattg 
ccactgctcg 
tgtggatggg 
ttaatgatta 
tctattattt 
attaattagg 
ctagtttttc 
tctacagttt 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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ctgatggagc 
taagaactaa 
atgttattga 
tctttttctt 
actgagaagg 
gcacatatat 
tgttcatgtt 
cggccaacga 
atccgcccag 
caagccccaa 
gtcgaactcg 
atgacaccca 
gcaactaagg 
tttaaatacc 
acctgcccat 
tttcatcggt 
aggctaacgc 
gccccgtgtc 
tgtcggccat 
ttaaagcgat 
tatactcctg 
gacaggcatc 
agcatggagc 
ccttttgagg 
taacatggct 
acacacacat 
caagttgcca 
gtattggatc 
ttggacccaa 
aaagcggagg 
gtagcccgtc 
cataaaagat 
gattgaattg 
acgccagctc 
ctttaaagta 
gaccttatat 
aaatgaattc 
gaacgattag 
agagcactct 
tccaataaat 
cccatcaaga 
ggaactttgt 
ttttatgtct 
tcttacctcg 
tgatgcagcg 
acattggcca 
tgcatgcgaa 
cgaagcgaag 
cactcgataa 
ctctagtttt 
ccccatcctc 
agtgtgtagg 
gattgaggct 
atatgtatat 
ttttttcgat 
ttttttcaaa 
cgcggcacat 
ccacaagtgg 
tccttgtgcc 
cgagcgtcga 



ttggcaaaca 
ccacgatgct 
cgcccaaagt 
ttgctggctc 
aggtaggagg 
acatatatat 
gttttcgcct 
tgaatatgaa 
ttctttttga 
cattatttgc 
acactttatg 
gtgttacaca 
aaaaaagcct 
gtgttcatcg 
gaagaggcga 
ttcccgtata 
gcgagtgggg 
ccattgattt 
taaataaaat 
ataaagcacc 
agctgaatta 
ttaagggcgc 
atgggcacaa 
atttccatgg 
tatatgcaac 
ccatgcaccc 
tcaagaacac 
ccgaatacca 
tttattttgc 
cgcaggacat 
tgggtgaaag 
agattgttta 
agtgctctgg 
taagacttgt 
tcttggtata 
tccaccattt 
agccctgaag 
catgcattat 
caaattgtgt 
taacgctcga 
ctttccatca 
ttccgtgaac 
ttttcctcaa 
actcgaactc 
cagcggcagt 
ttcgatggga 
atggagttct 
gaagctgcaa 
atttacccaa 
ccaattaaaa 
gccggcgtcg 
acattttaat 
ttttcgcctg 
atgtatgtac 
ggccttaact 
taatttcaca 
cgaccatctg 
agggtagtga 
ttttcaatct 
aaaggccaaa 



ttaaggggcc 
ctcgtttttc 
gccgccccca 
caaagtttga 
tcggcgaagg 
atatacatac 
tttaatgaga 
tgcgccactc 
ccatgggtct 
accgaacgta 
gtgctccgaa 
caattgacat 
ctgctgccac 
gcaatctgtg 
aagtgatgtg 
attgttttct 
agcgtgtggc 
cgaggcaaaa 
gctcaacgag 
attttttgcg 
atgagacccg 
aaggacatcg 
agggaataca 
ggaaaccaca 
gcatactgcc 
aaacaccgac 
ggttgtgcca 
gccaccaccc 
aatttatcgc 
tgaaaaaaca 
tcacgccgtg 
tagttggtat 
agggaatcgg 
attattatgg 
aagtaaagta 
ttaagtaaca 
gctgaaaaat 
taacactcac 
cacattcagc 
caagaaggac 
atttgcaaaa 
tttgcaccgg 
ctactgtccg 
caattgcgat 
tgggctctcc 
gtgtgacaaa 
cgagaagcag 
aactttcggt 
tcaccagaaa 
tgcaaagttc 
tcatccttgg 
ttgttgtgtt 
agtccgctca 
agacaggcga 
tggcgtatca 
ggcgaaatca 
atacctcttg 
tctgttgata 
ggccatcttg 
cttttgcaca 



aaatggcata 
ttatttccta 
tagacccatg 
acggcatctg 
acgcatttat 
atactgcgct 
gttaactgag 
aacttaatcc 
gcaattgtcc 
tctctgtacc 
attgttttgt 
ggttattgat 
tgctgccact 
tgcgcttctg 
ttaaatgaaa 
ggctttcaac 
atgaaatcaa 
gcttatattt 
agaaaagaaa 
tgtgagtatt 
gtaggcgggt 
ggaacatgga 
aatgaaaggt 
aagtgcggtt 
ctgttgcatt 
agctcaaccg 
ctgcctctgt 
tccgctccct 
tggataaaaa 
gtgccatgac 
aactgccact 
caaaaggaaa 
ggtcaccaaa 
tattaaatgg 
tgcagtttta 
gtccagtaga 
aaacccaatt 
aaatatatgc 
acattccttt 
aaacttaaat 
gtgttggcca 
cacgcggtga 
ggacaaaaaa 
gtccttgttg 
gtaaatcgag 
gctcccagct 
ctagtgaatc 
ttttaagcca 
aatatgcaac 
tagagtttcc 
gaatggagat 
ttacaatttg 
gtttatggca 
ggacccccgg 
atatgtgggg 
aaaggaagca 
tctatgctgg 
ccatcgacat 
ccatccggcc 
accgtttttt 



agtagaaaaa 
tccccccatg 
ccaccgtgcc 
cagagaaacg 
ttgaacttat 
gtgtatgcct 
cgggggaact 
cactcaatca 
tctcggctcg 
gatctgaatt 
aagcgaaata 
tttcaattat 
gatgtcctgt 
tgtattttca 
atatgctcct 
actacactgc 
tttcattcgt 
gtcaagatca 
cccattaaaa 
atttgcgtgg 
tgaagtgggt 
acacggaaca 
gatttttcag 
gacagggcct 
catgtcagcc 
tccatctttg 
tcatcctgtt 
acaacttgcc 
gtgactgcga 
aacaattacg 
ttctcccctt 
aaggcctggc 
tgctggcctc 
tcagggacac 
atcctaaact 
tttcaatgca 
aataatttaa 
cataatatgt 
cagttcgttt 
gcacttcaaa 
aaagtgaggc 
acgcgaaatg 
atcagtttgc 
cccagctctc 
cgacataaag 
aaacttcccc 
tttaatggat 
cgctcggttt 
agcttttggg 
aaaactcaat 
ctcactcgat 
cccccataca 
cttttataac 
ggcggtaaaa 
gggaaaacaa 
gcccgggaag 
tgcggtgtcc 
tcatccgaga 
atccaagcat 
attgttgact 



cgaattgaaa 
tgggcggtac 
acgcgcccac 
ggcttatttg 
gtacacaagt 
gccatattca 
ggaaactccg 
ttatgtggca 
tactcgtaca 
cgtattgcat 
atggcaagtt 
ataaatattt 
ggaaaaagca 
atttgcatac 
aataaacgtt 
acgaaat tta 
tggttttaaa 
taatcaaagc 
actgagcgaa 
cgaaatgaaa 
gggctttgcg 
cggaacacgg 
ctccctttgg 
aatgtgaaat 
acacacacac 
tgtagcgggt 
taacagcccg 
gcaaatttcc 
aaagcatttc 
gcgtgtttaa 
cgttggatgg 
caatgtcgca 
agagtccaga 
cagagtacaa 
tggctcaaat 
tcacaattag 
agcttcggct 
acttttatgt 
aattaaagtg 
tggcgcagaa 
tccccacctg 
cattattcac 
aactgaagaa 
agctccttga 
cggtcaccca 
ggccatgtca 
tcatcgattg 
ggtagttgga 
cttcattcta 
gctcctggac 
ttggccatag 
aattcatgtc 
tcacatgtat 
agttgctctt 
tgggaagcaa 
aacaagaacg 
attgtgggcg 
aaaaagcgta 
cggctgccat 
caaatttgga 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 



88 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/US01/09341 



attattttta 
cagttatgtg 
atcatcgcat 
gattggtttc 
caatcgattt 
ctgttttgta 
aatacataaa 
catatcctgt 
aaaagcattt 
cgcaaccggc 
tttcattgtg 
attgcagcga 
tgatgaggca 
cctttgaccc 
ccagttccgt 
aagctaagcc 
gataaacaga 
tttataaatg 
gacattggtg 
ctgcattagc 
cagataatta 
aaattacttc 
taggaaaatt 
gttcttttgg 
tcgacatggt 
ataaccagct 
gatggatggt 
agttgaaagt 
cttttggaat 
gtgggcgaca 
aagaagctca 
aaattatggg 
ccatttgaaa 
atccgtactg 
gcacccaagt 
cggaggtggc 
cgggaacccg 
tcgacccagt 
tgtggacaca 
tccgaacaaa 
cactcgactt 
ataatttatg 
taaaatgccc 
agtcataaac 
aacccaaatc 
gaatgctaat 
atttaaaagc 
ctttggcaat 
ataagaaatg 
cttgttttat 
gttccaaagg 
tgagaagctc 
caaagttctc 
aattcgacac 
cggctgacac 
ctgaattcat 
ataaaattaa 
tccagtttgc 
gagggtgcgt 
ggcccctttc 



tttgagcacc 
gcagtgggtg 
cgaatggggc 
gaaatctgga 
gcatgcgccg 
ttgcaatttt 
tagtgcgtct 
tcaaacaata 
aaatccggtc 
aaacttttcg 
ctacccgcta 
agtgtccggg 
tgatgggttt 
tcacaactcg 
tccaacttcc 
gcactcaaag 
aataaatgaa 
aaattataca 
ttgcattaaa 
atgttgattc 
tttgatttac 
tatcccaaga 
gcccgtggca 
gcaaaggcca 
cagatggagt 
ggaaaatgtt 
tgatgggtgg 
tcattttgtg 
ttttcattgt 
ataaacagat 
tgaaataccc 
attcattcgc 
gcccacactt 
ccatgatatt 
gcagaatgcc 
cggtggcttt 
gctcattgtc 
cgacccagtc 
ggctccattc 
cgcgttggtt 
ttatctttaa 
cccaatttat 
aaaaatacca 
aaatcctttg 
ataatattaa 
agtacataaa 
agacaaacag 
ctgtcatgtt 
catatttaat 
gactgcgaca 
ggcagattat 
gtgtggaaaa 
gatgtcgatg 
agttttcccc 
gccccatccg 
tgttggccca 
catttttgtt 
atttcccagc 
atatggcggc 
cgtttcatca 



gtggagggac 
taagtggaaa 
gctaaatcaa 
attcgcggtg 
cttaaagtca 
cgcatataaa 
ctggtgtgtc 
ctgcgctaga 
gcagaggtta 
atccactctc 
tttgcactaa 
attaaagtgt 
ctgccaggtg 
aaacatgctc 
ttggtcacct 
gttggctctg 
atgttacatt 
tttatactat 
aataaagtgc 
atctgaatat 
atgcttaaat 
cattctctca 
tttggcttgc 
gttatacttt 
cgaaaggtga 
tggccacaac 
aaaccggagc 
cagcaacaaa 
gaacgaagga 
gaagtctgag 
tggtagccac 
agagggaatg 
tcgttcgatt 
tctgcatggc 
ggcatgtgag 
tcgatagcgg 
attttccgac 
gacccagtcg 
gagatggctg 
ttcactcgaa 
aatatgattt 
tatgaattaa 
tatatattac 
ataaagccaa 
tcttatgcac 
tcacaactta 
cgtaggatgg 
tttgaagtag 
aattttattc 
aagtaatcct 
cagggataca 
attggcccca 
ttgtttccct 
ttagccattt 
gaatgccgcg 
tggcccacgt 
gcggactctc 
gaaaagggaa 
gccacgcgac 
tcccgccaat 



acacgcactg 
tgccgaaaga 
aaacaacgac 
gggggaaatt 
tttccaattc 
ttacattaaa 
cgccgattcc 
gtcgaaatcc 
accgtagtag 
gcgactagcg 
actttacact 
tgcatcccca 
gtcgcacttt 
gtaatttacc 
caaaagctga 
tgtttgcctt 
tgtgtacctt 
attttaatgc 
ttaaaagtat 
tgagttacat 
catatcagtg 
gtgtacatgc 
cagaaacgga 
tgcggttgtt 
gcggtgtatc 
cctctgagaa 
acacgcacag 
acgaaaatgt 
gctggccgaa 
gcatgtttgc 
aaactggttg 
tgcaacaaag 
tgccacaatt 
agaaccattt 
gatgcgagga 
aggatggtca 
ccgagttggc 
accagtctac 
ctcgtcaaca 
attggccgaa 
aaattcagcg 
aaaaaaaaaa 
caattttatt 
taaaataata 
accaaactaa 
tcactttggg 
ctggtacatt 
gtcctcgtcc 
tttcactcag 
gtgcaaaagg 
acagagatgt 
accaacacac 
tcaatcagcg 
tgccacttgc 
aaaatgggcg 
tgcggtccgg 
aaaatcactg 
aggggcaaga 
ggagctaaat 
gggggcagcc 



agtccataaa 
tggacttatt 
cagagggcaa 
gaaaatgggc 
gataatcata 
tatttacgtc 
gcgcgagtcc 
aaatatttgc 
tttttaagcc 
agcaattagc 
cat taaacag 
tcccgctgcc 
taattggtat 
aaccaaaata 
aggcgagaag 
acctttgcag 
gaaaatattc 
agtgcctggg 
gcaacacaga 
accccacatg 
ctagcttacc 
ggtgggtatt 
tgcctgtcga 
agcttaattt 
tggagaagat 
tccacttggc 
cacaattagc 
gatgaaaagt 
aatgattttg 
ggtccactta 
ctccatctgc 
caactaaaat 
tgcagcatgt 
caagtttgtc 
tgcgaaaatg 
agccacgagg 
tttgagcacc 
ctgtctacct 
gagccatcaa 
aaataaatgt 
aataaaatac 
aagatatgaa 
gattaacatt 
ttttaaaaaa 
atgttgaatt 
aaacaaccat 
tttactggga 
tcgacagcat 
caggggccct 
taggaagtgg 
gctcgactta 
aacacaaaaa 
attcaattta 
cacttgccac 
tgcgaatatt 
tttctatttt 
agctatggtc 
ggtaaagtgg 
ttaattttgt 
agccagccag 



aatccacttt 
gtcattgagt 
tccggaattg 
catattttgg 
atttatgaag 
agccgtgaaa 
tttgtatccg 
gcagcaaatg 
gcctggctgc 
tgcggtcgca 
gcgagaaatt 
agtcggatga 
tcaactttct 
agcgagtttc 
cctgttcgct 
ttgagatgaa 
agaagccatg 
aaaactctaa 
attgatagta 
ttgtacttat 
cagaagccct 
cccaaaagtt 
agtgcagcta 
aatgtaactt 
cgaaaagcaa 
cttacgagtg 
agttatgaaa 
tcgcactgac 
ccaaaaagcc 
attgcaggcc 
agaccaaacg 
gtgagatttt 
ataaagttat 
actcgaacac 
ccaggacacg 
ggcaccgaaa 
tgagaaccag 
gtcgaccaaa 
attgtcattt 
ccttgtaaat 
ccaatagctg 
gatttgatat 
tgctcatttc 
ttttaaacca 
ggcaagacga 
taaaaatgct 
ttttgctctc 
cgtctgccag 
aggagcttag 
gaaatcactg 
gctggcatta 
tgggcgaagc 
cgagtgccaa 
ttttcactgc 
ttgcctttcg 
cagtatatcc 
ctcgtggtcc 
caacggggta 
taatgagctc 
ccagcatggc 



8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 
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gcagccacac 

tgtgaggtgg 

gttttggcct 

attgggattg 

gggatggcga 

tggctgggga 

gaaaacaaaa 

attaacattt 

aatgagtgag 

cattaatatt 

attagcaata 

gattatgtta 

gccaaaaata 

ggatttttgt 

gccgtttcct 

ccgctgaggt 

tggctgggca 

agccgaccag 

tcaggacttc 

tacaacctgt 

tatcttttta 

ggcagcagtg 

cagcgacgcg 

aatcctcacc 

attttttgag 

ttcagagggt 

gcgtcattcg 

cgtcacacac 

ctcctgccta 

aaagcctgga 

agcgaaagct 

accacccatg 

cggatgctgt 

gccggcgcaa 

atatacatat 

atgcttgact 

cggtgctcca 

aagcgaagtc 

tactttaatc 

tcctgccacg 

gtccctgggc 

cgaaatggaa 

gagaattcat 

acggacatgc 

cgagacgagc 

gagcccgtcg 

tgtgacaata 

gacataaccc 

aaatgtccca 

ttcacatggc 

gcttgatggc 

aaattgcgag 

cccaaatgat 

ctacaatgta 

tctacgtgtc 

gagcggagac 

cagtagtgtg 

tcgtggtcca 

ttctggacga 

tgtagttcgt 



tctgccgtcc 

tccgtggttc 

ttattgcgct 

ggttggggat 

ttagaacgac 

ctttgattaa 

acaaaacaaa 

taattgttcc 

ctcgaataaa 

ggagccataa 

tatcccatgc 

ggcaataaac 

tacatatata 

attggttggt 

cctggacact 

cgcgcacctg 

gtctggtgtg 

gtgagtgtgc 

gcaagtgccg 

tcctggacct 

tcacccgcac 

ggccggaggt 

ccaacggaag 

agcagcacca 

atattcgtag 

tcagtggcgt 

ggattagctc 

aatttgcacg 

aatcgcggaa 

ggagcactct 

ctcccggcgt 

gctagatggc 

gaataatccc 

atcaattctg 

ctctatgtaa 

gtaaatatag 

ccatggatgc 

aaccgtagag 

ttcgagaata 

gaaagatgga 

gactcatttg 

agtgtgcatt 

cgagcgattc 

cagtccaccc 

atttaagcgc 

gacgcgtggc 

aatttgatta 

cgtcataaat 

tcgccgaaca 

ccccactatt 

aatttaaatt 

gtatatggtc 

tcccatcaca 

gcacaaattt 

cttctcttga 

acctccaccc 

gtcagtgcag 

gctccgaagc 

aactggttgg 

agacaatctg 



tattattttg 
cagaatccac 
cttggtgccc 
ggggaatggg 
aatggaccat 
aagcggcacc 
aaaaatgagg 
ggcaacgcaa 
agtttatggg 
aagtcaaaac 
cgccaggaaa 
aaacttaagt 
atctaatgta 
aacttctctg 
ggtggccatc 
gcagacgatc 
catgacgccc 
gccaggataa 
cgagcagtgg 
ggccctgctg 
cctgtacgtg 
caccacctcc 
ccattgtcag 
ccaccattcc 
attttttttg 
agcattaata 
aactactgag 
gcaaagtttc 
gtgaattgaa 
gtccaagttt 
ttttactttc 
cccttctccc 
cagattatgg 
ctaattcgcc 
atttagaata 
aacagtccca 
gagttgataa 
tccccgtgga 
ttactctgcc 
agcactgagg 
catgaaaggg 
agcaccttca 
atttcctagg 
gctgctgaaa 
tcgttaaagg 
actcattttc 
aatttttaac 
ttccaagcca 
agctgaacat 
cgtgccaaga 
atgctcagct 
tcattccgca 
tgctggtagc 
tgtgatatat 
aactcagcca 
tgcactgatc 
ccttcgtctg 
ccagaaaccg 
gtgccggata 
aagtctcact 



cattcgcaat 
ttgccaagtg 
gccaggatta 
gaatgggaat 
tgccggaaac 
attgctatca 
gaaaataaac 
tggagtcgtg 
gtggaggcga 
ttttcgacgg 
ttgaccctag 
tttgaatctg 
aatattttag 
tttttccaat 
tcttgcgagc 
aaccacgcca 
atcgccgcct 
cactaattca 
ccggcggata 
gtccttcctc 
agcatgcgca 
tcctctgccg 
tccctcgaca 
cagtactact 
tcagccagct 
gttgcgactg 
cttatccgcg 
ccacgcacga 
tggctggctg 
ttcaagcgtg 
atttgcattc 
atctacggcg 
gcacaaaaat 
tttaatgccg 
tattgtagta 
gtggagaggc 
cgtgacgtgt 
aatcacagtg 
tctggtccaa 
caatggagca 
ttggcgcctc 
cttgcgccgc 
caactgaccc 
aaggtcaatt 
ccgttatccc 
acacatactc 
tttttcgttc 
aagagcattt 
tttcatggaa 
atccgcatgc 
gaactactcg 
cttaagtatc 
tacaaatata 
ggatataatc 
aatgccattc 
ctcatccgcc 
gagcaggcca 
acagcccatt 
caatgcggga 
gaagtgtcac 



tctaaaagcc 
gagcgagtgt 
gactgtggat 
gggaataggg 
acttgaaaac 
tagcggcaag 
aagttggatc 
gaccaagcgt 
agggcagtcg 
gtggcaaatg 
tcattatttg 
ctcgatttgt 
ttatggattt 
tcattgcagc 
gctactacgc 
acaagatcat 
ttagccagct 
ggaactaacc 
gcctcaacta 
tgctggctct 
acgagcgagc 
ctgtcgcaga 
cgattgtgcc 
atggtaaggg 
gggtcaaagg 
ttgccggggg 
gacccaaagc 
aaagattcgt 
ggggatgggg 
tgtacgaaga 
gcatcgccgg 
tcctccatcc 
tctaggcagc 
ccttaaagcg 
tatgctcgtc 
ctgctgaaga 
atgataaatg 
agcctaccga 
ttaatgttga 
gtggagcagt 
caggtggtaa 
actgaattat 
cgcttgaccc 
taaatgccgt 
attgccagga 
gcactccggt 
cgtttaaggc 
atgtgtgttg 
attgccatat 
tatccatccc 
cccaaccgca 
catttccccc 
gtatcctttc 
ccttcattgc 
ctgctggcgg 
agctggacgc 
agtcgcattc 
ggtggctgat 
tctctgaaat 
tcacaaatga 



tcaacccgtt 
gtgtgcgtgt 
aggcatcggg 
gattgccatt 
tgtttcgtgt 
ccgaaaagca 
tgacaaagac 
aacaattgcc 
ggaaaatggc 
ggaggcgtgg 
ccacagagat 
gaatgaaaac 
tgaatggctt 
tgcctcggtg 
catttgccac 
cgccatcatc 
gatgccgacc 
ccctgtctgg 
cgagcgggca 
gagcttcacc 
catgaacttt 
agccggtagc 
acaccagcac 
gtttatgctt 
gaattctggg 
ttggcagggg 
ccccgaactg 
gtgtctacga 
aaagggggct 
gacaatagca 
ctttaagagc 
tctccagctg 
ccaaggcaaa 
tgctcccaat 
tgcagcagac 
ggaacctcct 
cacacccaca 
gcagcgaact 
gaagaggagc 
ggagcagtga 
gtcctttatg 
cgcagcactc 
atggaccaac 
aaattatgca 
ttctccactg 
cgcctttctg 
catctcgttt 
gctggatcgc 
tattagcagc 
caaactgcgc 
aacggcaata 
ttacatgtat 
catgtaacat 
atcattaata 
tgaggaaggt 
agtggtcctg 
ccggacacat 
ttagaaagcg 
cgaaaaagac 
atggcctgaa 



11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
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ccggcaacac 
gatcaggaag 
gacgcagtgc 
caatcagatt 
gtgcatcctc 
cgtagccctc 
ttttgctttc 
ccctgaacca 
ggttacacat 
gcgtgtgcaa 
cgagccattc 
gtgcatccag 
ccttggcctc 
catgcggcaa 
agactcgcca 
ggtggtccac 
ggcgcagcaa 
ccacatattt 
ggactcttaa 
ggcggcccag 
gtcccaggcg 
ggataaatag 
tcccactcaa 
cgttgcgtga 
gacgggtatg 
tctataacca 
gcattgtaca 
gttttttttt 
agatgtgttt 
caagcggcga 
gcggagcgca 
aagtggaact 
gatgcagctg 
gcagcccagt 
catccacata 
ccaacgaaac 
tgctggagtt 
ttgggccgac 
cctactcatc 
gcgccttcgt 
gcttctggcg 
cgctggccaa 
gccgccgccg 
cgggagcgct 
cattccgccg 
gaggtgccat 
atatgcgtat 
cgactttgac 
atctaaattg 
tttaaacaaa 
tcgtgagcca 
ggaaaatggt 
tggtaaacaa 
cattgtgctg 
agtaaaaggt 
cagcaagctg 
tcaaaaatgg 
attaagaaac 



cagcacggtg 
ggtgaatgtc 
ctccaagagc 
gccagcgtgg 
attgagggcc 
cagtaggaga 
cgccagccat 
gtgctgcagg 
tggcgagttc 
cagtgacatg 
cagccagcga 
accgaagatc 
gaagcgtcca 
ggtctgacgc 
gcggtctccg 
ataaacggcg 
ctcctggtca 
caggcccgga 
gattagtgta 
ggcaaagtcc 
acgccagtga 
caccagtggc 
cttgcattcc 
ttccatgatt 
tttcaatcac 
ctatgactca 
catccattca 
tctgtgctta 
tgtggatacc 
ttgatcagcg 
tccgtgaagt 
ggagctggaa 
cagcagcaag 
ctgcgtatca 
tgaccgaccg 
caagagtctg 
cttcatctgc 
ggtgtacgag 
cagctgctgc 
ggacaccttt 
gcgacgctcc 
ctcggtcatg 
acggcagatg 
aatatgtgat 
cgttctgaat 
ggcaattgcc 
gtgtgtatgc 
ctcgcgttcc 
agatcgccac 
caataggcta 
acaaaatgac 
atatgtgttg 
ctcatttgtc 
cactttctaa 
attctgatca 
attttacagg 
ttgtcagacg 
ccaatcgggt 



cccaaatgca 
ttggtgcagc 
gcagcttcgc 
cgctgg'gcca 
aattcgtaga 
gtggtggcat 
tgcaagaggt 
ggcaggggta 
cacattacca 
gtctccagca 
cttggttcat 
gcggaggcaa 
aggggcacct 
actcggctgg 
ttggcgatga 
ttgtccgcac 
cgcaccagaa 
ttccgactgt 
tcttacgtat 
aacatattcc 
ttgccctgcc 
agcattatcg 
cagttgagtt 
gcgcctcatt 
gcaagattcg 
atatgttttc 
gttttaattt 
acattcaatt 
ctgtaacttt 
gaggaggacc 
ccctgcgcca 
atggcgagtg 
gctacgtgag 
cggaagcggg 
attcgtgtgt 
gagagcaaga 
tggacgccgc 
tacgtgggct 
aatccgatca 
aaggggatgc 
aagaacgaga 
tccagccaca 
gccgagtcgg 
gatgcaatgt 
gtgtgtggaa 
cctccaaatt 
ttagagttgc 
atgtgtttac 
acgctatggg 
agaaatcagc 
ttccaaatgg 
gaaggatttg 
acacatgtcg 
ttttatttcc 
cggtgcagtg 
gtacacaggg 
gagcttatgg 
gccaaaatcc 



ccttaagttc 
gcttctgcag 
tatagcgaag 
cttcatccaa 
gggcgaaggc 
gtccagctag 
tgtcgccacc 
cgtatctgta 
accggtggcc 
aactccaaac 
gagccaccct 
ccacttgtag 
gatcccggag 
gtgtgaagag 
agggattgtg 
agtcggcaaa 
tggcgggttc 
caagaaaagt 
agatttcctg 
caaccaatgg 
atttccaaaa 
atgagcatct 
gaatatctca 
agccacgtaa 
agacagctag 
ggataataat 
atttagtgca 
tgacaatcgt 
ccatcttgca 
ctgcgaagga 
tcagcagatc 
ctgcagccgg 
cgacaatgaa 
actgcggagg 
cgattttcgc 
agcgtgtggt 
tgtatgtgat 
atacctccat 
cctactgctt 
gggtgtgcga 
cgaatctttc 
cgatcctcga 
agtcacttta 
aactgcaata 
gtggccagat 
gctcttcatc 
gctaaagcgc 
accttaaggt 
gaatggaaag 
acaatggttt 
ggggcgctcc 
ttggttcaaa 
tatgagtgat 
acactttacg 
aaaggtttca 
aagtcccact 
agtggcagac 
aattttaagt 



gctgctcgct 
agcctgcatg 
ttcacccaga 
ttcgatgtgt 
gagtagcatc 
ctcctccttt 
acttcgggct 
taagtggata 
aatgagtcta 
gcttggctga 
catgtgaatc 
cgtataacgc 
caactgacat 
cggcaccagg 
gctagccagg 
gttacggccc 
gttaaacagg 
tcattgcacg 
gtaaatctct 
ccagccaaaa 
caggagcgtg 
gaaaagcttg 
aagcagctga 
ttgctgtata 
cattgggctt 
cttagtattt 
tttcttctgt 
atccatttgc 
gattacggcc 
aggaggcatc 
aacggaggag 
gtgcacagga 
tcccggcgca 
tgagtgcttc 
tttcacggta 
caagatgctg 
caacacgatg 
cagtttcctc 
catgaacgcc 
gcgcctgtgc 
ggtggccggt 
gagtccgcga 
ggttagcaat 
gaaactgtgg 
ggataggtgg 
ttaccgataa 
tttattccgt 
gggtgcaacc 
tgctcagctg 
cacgacatgg 
ataaaattgt 
tctggagttg 
atttacgcaa 
ttccaaagat 
atgaatcaag 
ctcaaaagga 
actccgaggt 
agacgaaagt 



cctgatttct 
gccggatgaa 
gccactggat 
aaccgtcgct 
gccgaggttt 
cccaagcctc 
tccaccaatt 
gtggatcaaa 
cccaatcgag 
aagagcggag 
cccaggcaat 
gtggccaagt 
gctctggcca 
tcggcccgca 
acatcgcgct 
accaggatct 
cgatagaaac 
tggccttgaa 
ccatatgagc 
ggtgcctcca 
gcggccaata 
aaaggcacaa 
gagtgaacca 
tcaatcaact 
tgaattaatt 
caacttatat 
tcaaaatttt 
gagtcaaatc 
actgtggcag 
tatactgcat 
gtggaaccct 
tgcgccaaca 
agtcattgtc 
agggtattct 
ccttgcagat 
ttcgtcetag 
accatgctcc 
cagctgctgg 
agcttccggc 
gctccctgct 
aactccattg 
ctctgagcca 
gggtacggat 
ttgtcaacga 
gtggatggag 
gtattttaat 
gtgtgttatt 
agtcatgcca 
gattaagatt 
tcacagtatg 
tttaatttat 
ggtcaaaagt 
gctatccgac 
caatcccaaa 
aagcatttga 
ttcccatcat 
ccaaaactaa 
agctggaaat 



15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
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<210> 62 
<211> 2250 
<212> DNA 
<213> Drosophila 



<400> 62 

atgttgccgc 

gacacacatc 

cccaattttt 

cgattgctag 

gccttctacc 

aatgccagca 

gatcttctca 

gacatggacc 

acaccgggga 

tgctattcgg 

gtccagaatc 

gacatcctgc 

ttcatattcg 

accgagcccc 

ccagcagatc 

cgacgtatgt 

caactgtatg 

acactggtgg 

acctggcaga 

gtgtgcatga 

cgcaagtgcc 

ttcctggacc 

atcacccgca 

gggccggagg 

gccaacggaa 

cagcagcacc 

cgattgatca 

gcatccgtga 

actggagctg 

ctgcagcagc 

agtctgcgta 

aagaagcgtg 

ccgctgtatg 

ggctatacct 

atcacctact 

atgcgggtgt 

gagacgaatc 

cacacgatcc 



gcctgtgcgc 
gcggaacccg 
tgttgcgtga 
tgctattccg 
actacctgcg 
acgagatctc 
actacagcgg 
tagccactac 
acaggagcgt 
ccatccttct 
gtcgcatgcg 
tcggcgtatt 
gcgagctgct 
ataatcgctt 
gtgttggcca 
atcgctatgc 
gatcgattca 
ccatctcttg 
cgatcaacca 
cgcccatcgc 
gcgagcagtg 
tggccctgct 
ccctgtacgt 
tcaccacctc 
gccattgtca 
accaccattc 
gcggaggagg 
agtccctgcg 
gaaatggcga 
aaggctacgt 
tcacggaagc 
tggtcaagat 
tgatcaacac 
ccatcagttt 
gcttcatgaa 
gcgagcgcct 
tttcggtggc 
tcgagagtcc 



cgacgcttgc 
aactccatac 
agttgacgaa 
ggatcacaaa 
ccaggcgctg 
cgctgtcggt 
gctggaattg 
tcccagcagc 
ggtgagggtc 
gtgcgccgtg 
caccatcacg 
ctgcatgccg 
ctgcaagctc 
gtcaaatggc 
acattcagga 
cggaaatgga 
tggactcggt 
cgagcgctac 
cgccaacaag 
cgcctttagc 
gccggcggat 
ggtccttcct 
gagcatgcgc 
ctcctctgcc 
gtccctcgac 
ccagtactac 
accctgcgaa 
ccatcagcag 
gtgctgcagc 
gagcgacaat 
gggactgcgg 
gctgttcgtc 
gatgaccatg 
cctccagctg 
cgccagcttc 
gtgcgctccc 
cggtaactcc 
gcgactctga 



cgtcagtgct 
ggctgtgccg 
gtgtgttgta 
cgagcatcct 
ccgctggcca 
gacggggtca 
gatctcggac 
agcaccttgg 
agtgccgatg 
gtcgggaacc 
aacgtttttc 
gtgaccctgg 
atccagtttg 
cacgaaccag 
gctgataaag 
gttgcgctcc 
tcagctgcct 
tacgccattt 
atcatcgcca 
cagctgatgc 
agcctcaact 
ctgctggctc 
aacgagcgag 
gctgtcgcag 
acgattgtgc 
tatgattacg 
ggaaggaggc 
atcaacggag 
cgggtgcaca 
gaatcccggc 
agatccaacg 
ctagtgctgg 
ctccttgggc 
ctggcctact 
cggcgcgcct 
tgctgcttct 
attgcgctgg 



tcgcgaagat 
acactcaatc 
ctgctgcaag 
tcttcggcct 
aggaggctgc 
ccatcacagg 
tggatttgga 
cgccagctgt 
tgcccatctg 
tgctggtggt 
tgctcaacct 
tcggcactct 
ctcaggtccc 
cacgacctcg 
aagatgacac 
acggagcgcg 
cggtggccgt 
gccacccgct 
tcatctggct 
cgaccagccg 
acgagcgggc 
tgagcttcac 
ccatgaactt 
aagccggtag 
cacaccagca 
gccactgtgg 
atctatactg 
gaggtggaac 
ggatgcgcca 
gcaagtcatt 
aaaccaagag 
agttcttcat 
cgacggtgta 
catccagctg 
tcgtggacac 
ggcggcgacg 
ccaactcggt 



tgcgaggcgc 
ccgacctaag 
tgcaagtccc 
cacgattgac 
gatccacctg 
gacacccggt 
tcttaatctg 
aaccgtcagg 
ggtggtgccg 
gctaacgctc 
ggccatctcg 
gctaaggcat 
tatccccgtg 
agcggattcg 
gtcgctgctt 
actgcctggc 
ttcctcctgg 
gaggtcgcgc 
gggcagtctg 
accaggactt 
atacaacctg 
ctatcttttt 
tggcagcagt 
ccagcgacgc 
caatcctcac 
cagcaagcgg 
catgcggagc 
cctaagtgga 
acagatgcag 
gtcgcagccc 
tctggagagc 
ctgctggacg 
cgagtacgtg 
ctgcaatccg 
ctttaagggg 
ctccaagaac 
catgtccagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2250 



<210> 63 
<211> 749 
<212> PRT 
<213> Drosophila 

<400> 63 

Met Leu Pro Arg Leu Cys Ala Asp Ala Cys Arg Gin Cys Phe Ala Lys 

15 10 15 

lie Ala Arg Arg Asp Thr His Arg Gly Thr Arg Thr Pro Tyr Gly Cys 

20 25 30 

Ala Asp Thr Gin Ser Arg Pro Lys Pro Asn Phe Leu Leu Arg Glu Val 
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35 40 45 



Asp 


Glu 


Val 


Cys 


Cys 


Thr 


Ala 


Ala 


Ser 


Ala 


Ser 


Pro 


Arg 


Leu 


Leu 


Val 




50 










55 










60 










Leu 


Phe 


Arg 


Asp 


His 


Lys 


Arg 


Ala 


Ser 


Phe 


Phe 


Gly 


Leu 


Thr 


He 


Asp 


65 










70 










75 










80 


Ala 


Phe 


Tyr 


His 


Tyr 


Leu 


Arg 


Gin 


Ala 


Leu 


Pro 


Leu 


Ala 


Lys 


Glu 


Ala 










85 










90 










95 




Ala 


He 


His 


Leu 


Asn 


Ala 


Ser 


Asn 


Glu 


He 


Ser 


Ala 


Val 


Gly Asp 


Gly 








100 










105 










110 






Val 


Thr 


He 


Thr 


Gly 


Thr 


Pro 


Gly Asp 


Leu 


Leu 


Asn 


Tyr 


Ser 


Gly 


Leu 






115 










120 










125 








Glu 


Leu 


Asp 


Leu 


Gly 


Leu 


Asp 


Leu 


Asp 


Leu 


Asn 


Leu 


Asp 


Met 


Asp 


Leu 




130 










135 










140 










Ala 


Thr 


Thr 


Pro 


Ser 


Ser 


Ser 


Thr 


Leu 


Ala 


Pro 


Ala 


Val 


Thr 


Val 


Arg 


145 










150 










155 










160 


Thr 


Pro 


Gly 


Asn 


Arg 


Ser 


Val 


Val 


Arg 


Val 


Ser 


Ala 


Asp 


Val 


Pro 


He 










165 










170 










175 




Trp 


Val 


Val 


Pro 


Cys 


Tyr 


Ser 


Ala 


He 


Leu 


Leu 


Cys 


Ala 


Val 


Val 


Gly 








180 










185 










190 






Asn 


Leu 


Leu 


Val 


Val 


Leu 


Thr 


Leu 


Val 


Gin 


Asn 


Arg 


Arg 


Met 


Arg 


Thr 






195 










200 










205 








He 


Thr 


Asn 


Val 


Phe 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 


He 


Leu 


Leu 




210 










215 










220 










Gly 


Val 


Phe 


Cys 


Met 


Pro 


Val 


Thr 


Leu 


Val 


Gly Thr 


Leu 


Leu 


Arg 


His 


225 










230 










235 










240 


Phe 


He 


Phe 


Gly 


Glu 


Leu 


Leu 


Cys 


Lys 


Leu 


He 


Gin 


Phe 


Ala 


Gin 


Val 










245 










250 










255 




Pro 


He 


Pro 


Val 


Thr 


Glu 


Pro 


His 


Asn 


Arg 


Leu 


Ser 


Asn 


Gly 


His 


Glu 








260 










265 










270 






Pro 


Ala 


Arg 


Pro 


Arg 


Ala 


Asp 


Ser 


Pro 


Ala 


Asp Arg 


Val 


Gly Gin 


His 






275 










280 










285 








Ser 


Gly 


Ala 


Asp 


Lys 


Glu 


Asp 


Asp 


Thr 


Ser 


Leu 


Leu 


Arg Arg 


Met 


Tyr 




290 










295 










300 










Arg 


Tyr 


Ala 


Gly 


Asn 


Gly 


Val 


Ala 


Leu 


His 


Gly Ala 


Arg 


Leu 


Pro 


Gly 


305 










310 










315 










320 


Gin 


Leu 


Tyr 


Gly 


Ser 


He 


His 


Gly 


Leu 


Gly 


Ser 


Ala 


Ala 


Ser 


Val 


Ala 










325 










330 










335 




Val 


Ser 


Ser 


Trp 


Thr 


Leu 


Val 


Ala 


He 


Ser 


Cys 


Glu 


Arg 


Tyr 


Tyr 


Ala 








340 










345 










350 






He 


Cys 


His 


Pro 


Leu 


Arg 


Ser 


Arg 


Thr 


Trp 


Gin 


Thr 


He 


Asn 


His 


Ala 






355 










360 










365 








Asn 


Lys 


He 


He 


Ala 


He 


He 


Trp 


Leu 


Gly 


Ser 


Leu 


Val 


Cys 


Met 


Thr 




370 










375 










380 










Pro 


He 


Ala 


Ala 


Phe 


Ser 


Gin 


Leu 


Met 


Pro 


Thr 


Ser 


Arg 


Pro 


Gly 


Leu 


385 










390 










395 










400 


Arg 


Lys 


Cys 


Arg 


Glu 


Gin 


Trp 


Pro 


Ala 


Asp 


Ser 


Leu 


Asn 


Tyr 


Glu 


Arg 










405 










410 










415 




Ala 


Tyr 


Asn 


Leu 


Phe 


Leu 


Asp 


Leu 


Ala 


Leu 


Leu 


Val 


Leu 


Pro 


Leu 


Leu 






420 










425 










430 






Ala 


Leu 


Ser 


Phe 


Thr 


Tyr 


Leu 


Phe 


He 


Thr 


Arg 


Thr 


Leu 


Tyr 


Val 


Ser 






435 










440 










445 








Met 


Arg 


Asn 


Glu 


Arg 


Ala 


Met 


Asn 


Phe 


Gly 


Ser 


Ser 


Gly 


Pro 


Glu 


Val 




450 










455 










460 










Thr 


Thr 


Ser 


Ser 


Ser 


Ala 


Ala 


Val 


Ala 


Glu 


Ala 


Gly 


Ser 


Gin 


Arg 


Arg 


465 










470 










475 










480 


Ala 


Asn 


Gly 


Ser 


His 


Cys 


Gin 


Ser 


Leu 


Asp 


Thr 


He 


Val 


Pro 


His 


Gin 










485 










4 90 










495 




His 


Asn 


Pro 


His 


Gin 


Gin 


His 


His 


His 


His 


Ser 


Gin 


Tyr 


Tyr 


Tyr 


Asp 








500 










505 










510 






Tyr 


Gly 


His 


Cys 


Gly 


Ser 


Lys 


Arg 


Arg 


Leu 


lie 


Ser 


Gly 


Gly 


Gly 


Pro 
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515 










520 










525 








Cys 


Glu 


Gly 


Arg 


Arg 


His 


Leu 


Tyr 


Cys 


Met 


Arg 


Ser 


Ala 


Ser 


Val 


Lys 




530 










535 










540 










Ser 


Leu 


Arg 


His 


Gin 


Gin 


lie 


Asn 


Gly 


Gly 


Gly 


Gly Thr 


Leu 


Ser 


Gly 


545 










550 










555 










560 


Thr 


Gly 


Ala 


Gly Asn 


Gly 


Glu 


Cys 


Cys 


Ser 


Arg 


Val 


His 


Arg 


Met 


Arg 










565 










570 










575 




Gin 


Gin 


Met 


Gin 


Leu 


Gin 


Gin 


Gin 


Gly 


Tyr 


Val 


Ser 


Asp 


Asn 


Glu 


Ser 








580 










585 










590 






Arg 


Arg 


Lys 


Ser 


Leu 


Ser 


Gin 


Pro 


Ser 


Leu 


Arg 


He 


Thr 


Glu 


Ala 


Gly 






595 










600 










605 








Leu 


Arg 


Arg 


Ser 


Asn 


Glu 


Thr 


Lys 


Ser 


Leu 


Glu 


Ser 


Lys 


Lys 


Arg 


Val 




610 










615 










620 










Val 


Lys 


Met 


Leu 


Phe 


Val 


Leu 


Val 


Leu 


Glu 


Phe 


Phe 


He 


Cys 


.Trp 


Thr 


625 










630 










635 










640 


Pro 


Leu 


Tyr 


Val 


lie 


Asn 


Thr 


Met 


Thr 


Met 


Leu 


Leu 


Gly 


Pro 


Thr 


Val 










645 










650 










655 




Tyr 


Glu 


Tyr 


Val 


Gly 


Tyr 


Thr 


Ser 


He 


Ser 


Phe 


Leu 


Gin 


Leu 


Leu 


Ala 








660 










665 










670 






Tyr 


Ser 


Ser 


Ser 


Cys 


Cys 


Asn 


Pro 


He 


Thr 


Tyr 


Cys 


Phe 


Met 


Asn 


Ala 






675 










680 










685 








Ser 


Phe 


Arg 


Arg 


Ala 


Phe 


Val 


Asp 


Thr 


Phe 


Lys 


Gly 


Met 


Arg 


Val 


Cys 




690 










695 










700 










Glu 


Arg 


Leu 


Cys 


Ala 


Pro 


Cys 


Cys 


Phe 


Trp 


Arg 


Arg 


Arg 


Ser 


Lys 


Asn 


705 










710 










715 










720 


Glu 


Thr 


Asn 


Leu 


Ser 


Val 


Ala 


Gly 


Asn 


Ser 


He 


Ala 


Leu 


Ala 


Asn 


Ser 










725 










730 










735 




Val 


Met 


Ser 


Ser 


His 


Thr 


He 


Leu 


Glu 


Ser 


Pro 


Arg 


Leu 









740 745 



<210> 64 
<211> 4289 
<212> DNA 
<213> Drosophila 



<400> 64 

agctgtaaag 

tccattaaag 

aaactgatct 

taattggtat 

acacaaatta 

tgctgaaggg 

tcatcatggc 

gttgctgtgc 

atgacaggcg 

ccacagccag 

tagcttgcaa 

gaaaaaaaaa 

gtactcaccc 

ctgtttattt 

ttattttacg 

taaaacaaaa 

ttgatcctga 

cggcggcggc 

tccgccaggt 

tcgcataggc 

caccgacgcc 

caaaggcgcg 

atgagtaggc 



tactactact 
aagccaccaa 
actttgatga 
gctgataaac 
agaggacggt 
attgccccca 
cctctttttt 
tgtgctgtgt 
gatggactcg 
gacataagcg 
ttaaaatacc 
aaaatattcc 
aaaaggattc 
gcagcaacct 
agtacgatca 
aaaaaaacca 
atgggtcctt 
ggcactattg 
gcccataccc 
gcccgggcct 
tccgctggca 
ccggaagctg 
cagcagctgg 



acatatgtgt 
ctttaagata 
tcaactttca 
aggaaaacca 
aagtccttca 
cacccagacc 
ggccaggctg 
actcgacaag 
aacactcgga 
gcttgtgctc 
cacttgggcc 
agcactttgg 
caactataaa 
taggaagttc 
taatcacaag 
cacgcagatc 
ggaggcgaat 
gtggtcacca 
atggctaggc 
gcgcccgcct 
cctgctccac 
gcgttcatga 
aggaaactga 



acattcaagt 
cacgcatata 
ttgctgctga 
aaatgtcaac 
ctcggacctt 
gcgcccactt 
cggtggcttt 
gcgacgaacc 
cccggacccg 
cactcaactg 
ctataaattg 
tgtatggagt 
gacatacagc 
aattgaaaca 
taggtaaaaa 
gcttatcata 
cctctacagc 
ccgcattgag 
cgggattgag 
ggctggcgga 
gacgccaggg 
agcagtaggt 
tggccgtgta 



ttcgcttacg 
acaactcctg 
ttgaataatt 
agttgataag 
ttcagctgag 
ccacttccac 
gggccttaat 
gagagccttg 
gacccgaacc 
tcgcctttcg 
tgaacgtagg 
ggggtccact 
gagagtgctt 
ccaatagact 
aaatccatat 
catatttaca 
ttagagctga 
aaactcgtgg 
actaatgttg 
gagtcctcca 
cagacccttg 
gatcggattg 
gtcgacatac 



aatgcaacgt 
agaattgtga 
gagtaatgag 
gaacattaaa 
caccgtaaag 
tttaaaataa 
tgtcgactca 
cagctccata 
cggacccagt 
gagctgctcc 
aaggcaggcg 
taattcccag' 
aagttttact 
agcgtctgag 
tcgcct tact 
ccgcatctcg 
ggactgttga 
cgtgagcgac 
gtgttggcac 
ccagcagcac 
aaggtgtcga 
cagcagctgg 
tcgtacacca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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ccggtccgat 
actccagcac 
cctcgttgga 
tggcagaccg 
gctgggcgag 
tggagccgtt 
ggatccctct 
gtgaacacaa 
gttgccataa 
tttaagagtc 
agcatttcac 
ttaaataact 
tgctcttaaa 
gcagttacca 
tgctggtgcc 
gctgcaggat 
tgtaggccac 
tgtagaagag 
cgtaagttgg 
tcgactggtg 
cagccagatg 
cagtggatgg 
aacggccacg 
acacaatgga 
ggtccttcgc 
accgtaccaa 
tgacaaacac 
tccttttctt 
aagcaatttg 
ggggctacac 
aagaaaaaga 
tctagttttt 
caaataaata 
ttggaattta 
cacaatttag 
gaaattgttg 
gatatatatg 
agggacgagt 
caggcaatta 
ttatattgga 
ttgcagtgat 
ctaccaaaga 
cgtttctcta 
gctgcactgt 
catgcaaact 
cccgtccagg 
tcacattggt 
ttttgattga 
acttaaaaca 



cagcatgacc 
caggacgaac 
cctggatagt 
gttatcggac 
gaggtcttgg 
ggtctcgttg 
gaatttccat 
gccaaatata 
ctaatctaaa 
tcaaaaataa 
caacaatcta 
tatgagttat 
gttgtttgca 
accattatag 
cggattcggt 
gcgtccgctg 
gcagaggacg 
ctcgtatccc 
tataacaacc 
ggtatcaatt 
aagccgatga 
catatcgcgt 
gaggcggctg 
cacacccagc 
ccgcgttcgt 
ccagacggac 
gatgcacgat 
gttgcccggt 
tacccacacc 
aagtgcaagt 
cccgattcga 
caatcgattg 
tgataaactg 
tagcttacca 
acttcaatgc 
tgctagcaga 
tatgtacgta 
atttaagaca 
aggtccaccc 
atggtatttt 
tactacttta 
tagttacacc 
gttcaacaca 
attatattac 
aatatgagca 
caaccaaggt 
ctcgccctct 
cggcggtgtg 
aacagcagca 



atcgtgttga 
agcatcttga 
ggattgtgga 
acgaatccta 
ccagcgtcac 
tcaaggtggt 
tattattgtt 
atcaaacaaa 
ctaatacata 
tatagcatat 
tatctttaaa 
gatatttaga 
tattcatggc 
actgcggtgg 
gcgcttccgc 
tccttggcca 
agaagcggca 
tggtcgggcc 
agacggtgtc 
gactaaagac 
tcttgtaggc 
agtagcgctc 
aaataaataa 
agctcctgtc 
tgcattttaa 
ggagacatcg 
gcaatttgca 
ctccttttca 
ctgcccatcc 
caatagggtg 
aaaattggtc 
ttttatagca 
atcaaatgaa 
aaacacttgt 
aataagctgt 
tacatttgtg 
ttgcattcaa 
gcctgtatat 
taatgtgctt 
ctcgcaaaat 
gttacgaatg 
ttagaatcct 
tccagccaac 
attttagctg 
gtaaagatgg 
ccaccctaat 
gtcgcgtttg 
tgatgctcta 
tggcatttc 



tcacgtacag 

ccacacgctt 

ttgtggagaa 

cctgcgaagt 

cgtggtcgtc 

cgttgccata 

actattttct 

ttaacgttat 

gacgtaaaat 

tatgttaagt 

gtattatgcc 

tgcaaaataa 

ttttaaggat 

cggtcaggac 

cggccgttgt 

tgcctacgta 

ggacgagcag 

aaaactcacg 

tgctggccag 

cgctatgggc 

gtgactgatt 

acaggatatg 

atgaaagcgg 

gagcgccgtg 

ttacacaatt 

ctcttcgcac 

gaaagtgcaa 

tgtaagcctg 

ctcatattct 

gactgctgta 

aattttagcg 

ccaaatcata 

agcataattt 

gaaatttgca 

aaggtctatg 

gtttgttaga 

aagcaataag 

gttcccttta 

atcccttcag 

gatatgcaag 

atgataggtg 

aaaagaaata 

caaggtccac 

cacaattgtt . 

attacctcca 

ggggtagccc 

ttgagttaaa 

agccagaaat 



cggagtccag 
cttgctctcc 
atgggaatag 
gccgcatcgt 
gtggtggtgg 
ttggttgatc 
atgtgaaaaa 
tagcagcaac 
aattaaattt 
ggatttgtac 
atattacatg 
tacttgaatg 
tgaaaaagga 
caggatgcag 
agcggaaaca 
cagggtacgc 
caggaagtcc 
gcacttgcag 
gttgctatgc 
gtcatgcaga 
gtctgccagg 
gccaccaagg 
attaacttac 
tcaaatcgct 
tgcatattgg 
ttctgcctgt 
ttaagcaatt 
ttagatgatt 
tttgggctac 
ggtgaaaaac 
caattaaatg 
tatcttaacc 
gcttgtatgt 
aatatacata 
gtttcaatta 
tatacatatt 
cagtaaggta 
ctttaatgcg 
ccatctgcag 
ttttagattg 
atgtagtaga 
agtgttttgt 
cctaatgtga 
tactttctat 
cctgttccgt 
ttgcccttcc 
cccctgaaat 
gctagcatgt 



cagatgaaaa 
agggtcttgg 
ggaataggat 
ggacgcgaat 
tgatcacagt 
ctccgcctgc 
aaaaaaacat 
aaataaaaca 
tcaatttggt 
cttcttatcg 
tcatgtaaca 
cattcataaa 
ttagtttcgt 
ttactactgc 
ggcagcgatt 
gtgatgagga 
agcaggatgt 
tagcctgtaa 
actcacccgg 
ggatgccgcc 
atcgcgagcg 
tccaggacga 
aacaaaccga 
ttcattagaa 
gaccagacga 
tcttgtagta 
ccccagccac 
tcccattgta 
cactatcgat 
tcgacaccga 
agaaataatt 
gtaaaggcat 
atgcgtgttt 
tttgttcata 
ttaatactcg 
tgcttataaa 
aattggatca 
acacctcatt 
ctgcattgta 
atacgtacaa 
tatatatttg 
agtcgcagtt 
ctgcacttca 
tttaattatg 
tggagtgcat 
aactgcattg 
tgaagtttac 
gtgcgcttaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4289 



<210> 65 
<211> 1248 
<212> DNA 
<213> Drosophila 



<400> 65 

atgcactcca acggaacagg tggagccgcc tccgtggccg tttcgtcctg gaccttggtg 60 

gccatatcct gtgagcgcta ctacgcgata tgccatccac tgcgctcgcg atcctggcag 120 

acaatcagtc acgcctacaa gatcatcggc ttcatctggc tgggcggcat cctctgcatg 180 

acgcccatag cggtctttag tcaattgata cccaccagtc gaccgggcta ctgcaagtgc 240 
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cgtgagtttt ggcccgacca gggatacgag ctcttctaca acatcctgct ggacttcctg 300 

ctgctcgtcc tgccgcttct cgtcctctgc gtggcctaca tcctcatcac gcgtaccctg 360 

tacgtaggca tggccaagga cagcggacgc atcctgcagc aatcgctgcc tgtttccgct 420 

acaacggccg gcggaagcgc accgaatccg ggcaccagca gcagtagtaa ctgcatcctg 480 

gtcctgaccg ccaccgcagt ctataatggt tggcggagga tcaaccaata tggcaacgac 540 

caccttgaca acgagaccaa cggctccaac tgtgatcacc accaccacga cgaccacggt 600 

gacgctggcc aagacctcct cgcccagcat tcgcgtccac gatgcggcac ttcgcaggta 660 

ggattcgtgt ccgataaccg gtctgccaat cctattccct attcccattt ctccacaatc 720 

cacaatccac tatccaggtc caacgaggcc aagaccctgg agagcaagaa gcgtgtggtc 780 

aagatgctgt tcgtcctggt gctggagttt ttcatctgct ggactccgct gtacgtgatc 840 

aacacgatgg tcatgctgat cggaccggtg gtgtacgagt atgtcgacta cacggccatc 900 

agtttcctcc agctgctggc ctactcatcc agctgctgca atccgatcac ctactgcttc 960 

atgaacgcca gcttccggcg cgcctttgtc gacaccttca agggtctgcc ctggcgtcgt 1020 

ggagcaggtg ccagcggagg cgtcggtggt gctgctggtg gaggactctc cgccagccag 1080 

gcgggcgcag gcccgggcgc ctatgcgagt gccaacacca acattagtct caatcccggc 1140 

ctagccatgg gtatgggcac ctggcggagt cgctcacgcc acgagtttct caatgcggtg 1200 

gtgaccacca atagtgccgc cgccgccgtc aacagtcctc agctctaa 1248 

<210> 66 
<211> 415 
<212> PRT 
<213> Drosophila 



<400> 66 



Met 


His 


Ser 


Asn 


Gly 


Thr 


Gly 


Gly 


Ala 


Ala 


Ser 


Val 


Ala 


Val 


Ser 


Ser 


1 








5 










10 










15 




Trp 


Thr 


Leu 


Val 


Ala 


He 


Ser 


Cys 


Glu 


Arg 


Tyr 


Tyr 


Ala 


lie 


Cys 


His 








20 










25 










30 






Pro 


Leu 


Arg 


Ser 


Arg 


Ser 


Trp 


Gin 


Thr 


He 


Ser 


His 


Ala 


Tyr 


Lys 


He 






35 










40 










45 








lie 


Gly 


Phe 


lie 


Trp 


Leu 


Gly 


Gly 


He 


Leu 


Cys 


Met 


Thr 


Pro 


He 


Ala 




50 










55 










60 










Val 


Phe 


Ser 


Gin 


Leu 


He 


Pro 


Thr 


Ser 


Arg 


Pro 


Gly 


Tyr 


Cys 


Lys 


Cys 


65 










70 










75 










80 


Arg 


Glu 


Phe 


Trp 


Pro 


Asp 


Gin 


Gly 


Tyr 


Glu 


Leu 


Phe 


Tyr 


Asn 


lie 


Leu 










85 










90 










95 




Leu 


Asp 


Phe 


Leu 


Leu 


Leu 


Val 


Leu 


Pro 


Leu 


Leu 


Val 


Leu 


Cys 


Val 


Ala 








100 










105 










110 






Tyr 


lie 


Leu 


He 


Thr 


Arg 


Thr 


Leu 


Tyr 


Val 


Gly 


Met 


Ala 


Lys 


Asp 


Ser 






115 










120 










125 








Gly 


Arg 


lie 


Leu 


Gin 


Gin 


Ser 


Leu 


Pro 


Val 


Ser 


Ala 


Thr 


Thr 


Ala 


Gly 




130 










135 










140 










Gly 


Ser 


Ala 


Pro 


Asn 


Pro 


Gly 


Thr 


Ser 


Ser 


Ser 


Ser 


Asn 


Cys 


lie 


Leu 


145 










150 










155 










160 


Val 


Leu 


Thr 


Ala 


Thr 


Ala 


Val 


Tyr 


Asn 


Gly 


Trp 


Arg 


Arg 


He 


Asn 


Gin 










165 










170 










175 




Tyr 


Gly 


Asn 


Asp 


His 


Leu 


Asp 


Asn 


Glu 


Thr 


Asn 


Gly 


Ser 


Asn 


Cys 


Asp 








180 










185 










190 






His 


His 


His 


His 


Asp 


Asp 


His 


Gly 


Asp 


Ala 


Gly 


Gin 


Asp 


Leu 


Leu 


Ala 






195 










200 










205 








Gin 


His 


Ser 


Arg 


Pro 


Arg 


Cys 


Gly 


Thr 


Ser 


Gin 


Val 


Gly 


Phe 


Val 


Ser 




210 










215 










220 










Asp 


Asn 


Arg 


Ser 


Ala 


Asn 


Pro 


He 


Pro 


Tyr 


Ser 


His 


Phe 


Ser 


Thr 


He 


225 










230 










235 










240 


His 


Asn 


Pro 


Leu 


Ser 


Arg 


Ser 


Asn 


Glu 


Ala 


Lys 


Thr 


Leu 


Glu 


Ser 


Lys 










245 










250 










255 




Lys 


Arg 


Val 


Val 


Lys 


Met 


Leu 


Phe 


Val 


Leu 


Val 


Leu 


Glu 


Phe 


Phe 


He 








260 










265 










270 






Cys 


Trp 


Thr 


Pro 


Leu 


Tyr 


Val 


He 


Asn 


Thr 


Met 


Val 


Met 


Leu 


He 


Gly 






275 










280 










285 
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Pro 


Val 
290 


Val 


Tyr 


Glu 


Tyr 


Val 
295 


Asp 


Tyr 


Thr 


Ala 


He 
300 


Ser 


Phe 


Leu 


Gin 


Leu 


Leu 


Ala 


Tyr 


Ser 


Ser 


Ser 


Cys 


Cys 


Asn 


Pro 


He 


Thr 


Tyr 


Cys 


Phe 


305 










310 










315 










320 


Met 


Asn 


Ala 


Ser 


Phe 
325 


Arg 


Arg 


Ala 


Phe 


Val 
330 


Asp 


Thr 


Phe 


Lys 


Gly 
335 


Leu 


Pro 


Trp 


Arg 


Arg 
340 


Gly 


Ala 


Gly 


Ala 


Ser 
345 


Gly 


Gly 


Val 


Gly 


Gly 
350 


Ala 


Ala 


Gly Gly 


Gly 


Leu 


Ser 


Ala 


Ser 


Gin 


Ala 


Gly 


Ala 


Gly 


Pro 


Gly 


Ala 


Tyr 






355 










360 










365 








Ala 


Ser 
370 


Ala 


Asn 


Thr 


Asn 


lie 
375 


Ser 


Leu 


Asn 


Pro 


Gly 
380 


Leu 


Ala 


Met 


Gly 


Met 


Gly 


Thr 


Trp 


Arg 


Ser 


Arg 


Ser 


Arg 


His 


Glu 


Phe 


Leu 


Asn 


Ala 


Val 


385 










390 










395 










400 


Val 


Thr 


Thr 


Asn 


Ser 
405 


Ala 


Ala 


Ala 


Ala 


Val 
410 


Asn 


Ser 


Pro 


Gin 


Leu 
415 





<210> 67 
<211> 9249 
<212> DNA 
<213> Drosophila 



<400> 67 

caaaataaat 

cccatgctta 

gctgccatcc 

aaggaagcta 

ttgaagctgt 

aacgccacca 

ggtttcaatt 

gccatcaaag 

tgcctcgcct 

gcggctgagc 

caacaaatgc 

aatcccaagc 

gtagcttacc 

tacctcttta 

atcaaaaaca 

cataccactt 

atatggcaga 

ataataggag 

cgttagtgcg 

ccattaccat 

tactggaggg 

tacccgttct 

tcatcgacga 

aagacgtggt 

ccgacggcag 

tactggccgc 

tgcggtgagt 

ctcaaatgtc 

gcgccatggt 

tagataataa 

aagagccctt 

gtccacgaat 

gccccttttt 

gaccgcgttc 

tttatcatat 

ccgaacactt 



gtatataatt 
tgctgaaata 
ggaatggaat 
aatttcactg 
ttatggtaaa 
acctccgcat 
tcactgtctg 
ccacaaggtg 
cacagatttg 
aggcggatag 
cgaattgatg 
agcagtcaag 
ttcaaagcaa 
atattggata 
aatataagca 
tcttttccat 
caataaatct 
tcatcaaaac 
acatttggag 
gacgctactg 
cagcatcagc 
ggaggagtca 
gggactgggc 
cgccggacag 
tagccatttg 
catcctgggc 
ccttgccatt 
tgggtttcat 
ttcaatttca 
tttaacaaag 
actttcgctt 
cgtgggtgtt 
ccttttattt 
ggcttaacga 
tttccatcat 
cagagggggg 



tcaaaattga 
tggacaaaat 
gagttggaat 
ggccccaacc 
ttaaaccaat 
caactccgca 
gacgagcctg 
tttatgggct 
gatttgcatg 
aagacaaata 
caaatgtaaa 
tgatgaatac 
aaattgtaga 
ctatttatta 
agcttacccc 
ccacaaactg 
agattctttt 
gagagcaaat 
ttggaaacta 
cagaggcttc 
agttttggtg 
gcctcacatg 
agtgcggtgg 
gcgcaggaca 
gcattagtct 
aacatgctgg 
gtgcgttcta 
taatttgtcg 
ttgtgtgcat 
catgaacttc 
ttgtttaagg 
ttagctgtca 
ctttgtttgc 
gtagctacct 
ttttgcgtgt 
cgtggttggt 



catttgctaa 
tcgggcacag 
ttgggcagca 
aaacggaaat 
tgagtgggat 
atctccgaca 
cggctaacga 
taactccggc 
tgccacgatt 
gccagcacgt 
tggattcgat 
acggcttaaa 
tttcattctc 
cccagtttat 
atttaagtac 
gctttcaaac 
tcaattgcag 
gtcatacagc 
ggagcccact 
aggccatgtc 
gcgggacaaa 
ccagatattt 
gcagtgggag 
tccaggcgag 
tcgtcaagtg 
tgattgtgtc 
gccatatccg 
ccatgaatca 
ttggcaggaa 
aacatcaaaa 
taaattgccc 
aggacatgac 
tcctttttgc 
cctttttttg 
caagcgcaca 
tgactaagtt 



tgataagcct 
agcaagagat 
gatggcgctg 
ttccacccac 
tacgttttgc 
ccccgaagtg 
aatttatata 
cctgattccg 
tggggcatgc 
tgcggttcat 
atccaggaac 
agatactgta 
tagtaacgag 
tacttagttt 
ttggcaattg 
tgctgttggc 
ggtttggcaa 
atgtcgtcct 
tagactaaca 
ggccaccacg 
tgagcctctg 
gaaattcatt 
cagcatcgcc 
cgaaggatcc 
tttcattatt 
ggtcatgcgg 
ttggtcagga 
ggtggctttt 
gcttccaact 
cattagtgag 
gagctgtaac 
ttgtacaccc 
cgaacgattt 
cggccattaa 
gaactcatct 
aaatgctaga 



ctgcctcatg 
gcgcagagca 
tcgcaaaagg 
ggctcagaaa 
gcactacaac 
tttccaactg 
cccatacaat 
cctacttggc 
ggctgagaag 
ttggctgtca 
gcgtcgtcgg 
ttaacttaca 
ttcatatcca 
ttatttcaga 
caaatgtgca 
caactcataa 
acgattggca 
tattccgtgg 
acgcccgtga 
accaggacaa 
gcgagcaaaa 
gccgacgggc 
gtatccgttg 
accgacgacg 
ggtttcatca 
caccggaaat 
catgtgtgtc 
ttcccttttt 
ataagttact 
caaatattaa 
ctgccacatt 
cacccccccc 
tgtgcgattt 
tttgtcatca 
cgttcgccca 
tacgtgcgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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attacattag 
cgccattggc 
caccttcaca 
aaagtacgta 
cccaacataa 
tgtcaaggcg 
ttcctttcgc 
aggatgggca 
atttgggacc 
cttgcgccag 
ttttcacctt 
cgaaacaaca 
agtagcaggt 
tttcacgact 
caaattaaag 
tttcctatta 
taatcaaact 
cggcttttgg 
tttgtacttt 
ttgtggccgt 
ggcgagcgat 
atgattgatg 
cacattaatc 
tcaagttccc 
ttatatccgt 
ataaacgcat 
tttgcggtgc 
tattaattaa 
cggcggtcac 
agcaatccga 
aatttaattt 
tgccattttt 
tttactcctc 
gacatgctgg 
tggatgtttg 
gccagcatca 
cattgtggaa 
tgcccatcaa 
ccgtagatac 
cgtgttcatc 
ctgctcggga 
gagtagctta 
tctattatgg 
ttattcaatt 
gactttgcat 
cctttgactt 
cagttttacg 
ggtttcctcc 
tacatttata 
attccccccg 
ggtaattcca 
taattttcgg 
tagcatccgc 
agccggcaga 
ggcagcgctg 
tcgcattgcc 
gccagccgca 
gactgcactg 
aaatcgcaac 
gagttcaaag 



aaatgttaat 
atttacttta 
tgagatatgg 
cgtgtttcag 
aacacttgga 
cagactatta 
cttggaacca 
tggtgatgag 
tgaccctgac 
catctctttc 
ttccgcctcg 
tgcattccca 
atctatt tea 
ggccactcaa 
teagactgea 
attagagctt 
tgacacccaa 
gtaacgcctc 
ttgggcgctt 
aattaggttt 
tgttaacgag 
cagtttaaga 
cgtttagacg 
gcagatgtta 
ggaatggcgt 
ttgctgctcc 
cacgcaacgt 
cccattaaac 
aegeggegta 
agtaacgatt 
gagaacttgg 
ggttgactgt 
ctccttgcag 
tggccctctg 
gttccgtgat 
tgcacctctg 
taattgaagt 
attgaatagt 
tacgecattg 
atgctattga 
tggtacacaa 
cttgaagccc 
agcccacaga 
ttcttgeaca 
geattttgea 
ttcccccgac 
cctttcaggc 
gcaaatactt 
tatatatata 
ccactgcggt 
ttccggtttt 
ctctacacct 
ggtgggcttt 
atgtcctggc 
taatgcacag 
attgeaattg 
gttggcaatt 
ctccaacgtt 
gatttcgatt 
tgaacaaggc 



aaggggegag 
tgtgcggcca 
ccacgtgggc 
tegcaaatte 
gagaggegae 
aagcagcccc 
accaacgcac 
catgggggat 
tactacgtgt 
attattgttg 
ctaggcagtt 
cttaggtccc 
atgttctgga 
cttacattat 
gatcaagtac 
tccacgttgg 
caagtttcct 
aataacattt 
taagcttget 
gcgggagatg 
tttaatagat 
agttcccatc 
gaaagcaaaa 
tggttttaaa 
tatgtacgee 
acatagegta 
tgatgatege 
aatgaaatcc 
tgcgcaatgt 
agegctgega 
cactctcaat 
gtggcttccg 
catcattacc 
tgegatgaca 
gtgcgacatg 
ttgeatateg 
tecgagtget 
tgcttgacag 
tgcagccact 
tggtgtggct 
cgaccgagaa 
ttaattggga 
tttgatgagt 
tcaaactaat 
tgecttcgat 
ccgcctcaac 
tgccagccga 
ggcaaactcc 
tatatatatg 
tatcattgeg 
ctgcgctccc 
ggaaggaege 
aattagtttt 
cgtaaatctg 
cgtaattgag 
ccgtcgcaat 
gecgagcatt 
cattttgatt 
ccgatttcga 
atacgecata 



cataaaaata 
tataatttgt 
gtttgaggga 
aaaagcccaa 
aatggtgtaa 
gggaaaacgt 
acctgacatt 
gaggctcatt 
atgccatgtt 
ggctcattgt 
taaatcatca 
atgetaccta 
taagecatet 
tcatgtcaca 
gaggaagagc 
ctgcctgccg 
cctttcattc 
tcgtgttgcc 
ccaaaaatgt 
gtttaatgtc 
aaaggtcgga 
caaagttcct 
ctttcatttg 
catgaattcc 
tgattgtttg 
tacgccgcgt 
tggtggaagt 
actcagagaa 
gcgcaccaaa 
gctctgagtg 
accactggca 
ccttggttcc 
aactactttg 
tttaatgett 
tggaacagct 
gtcgacaggt 
actgggatgt 
tttctagtga 
ggactatcca 
atcgccggcg 
ctacaagtat 
caattaaata 
aggtcgatcc 
agaaaattct 
gccacgtggg 
ggaaattgat 
caggtagcat 
ttggcagcac 
ttctctgccc 
caataaaatg 
attcaggctg 
ctgaaacgcc 
acttgtttat 
attagatgtc 
tttttattgt 
ageegttget 
acgctccatt 
tattgatagt 
tgctcccctc 
gtcagctcgt 



ttgttgccta 

caaatttatg 

tttccaggta 

caccgctggc 

cctaaccccg 

cttccactgt 

tgtggctgcc 

tagcctgcca 

geegttttta 

catgtgcgtt 

eggcaaatta 

aaaaccttgt 

gggcgacagt 

cgcatcgata 

egggaaagtg 

aaagtgtcga 

ccgagcaccc 

tetgatgegg 

agacctttcg 

ccagtgaaat 

tacacataaa 

tcattcaaaa 

cgatgttgtg 

aaccgcccat 

gaaatcagtg 

gcacctaatt 

tgcggtggcg 

gcgccttttc 

gtttccaact 

acttgaaagc 

ctcaggcttt 

tgctaagttt 

tggtctctct 

cegtcatgat 

tegaegtcta 

gagtcceggg 

cttttatacc 

caatcttttt 

ctaatcatga 

ctcctctcgt 

ctcaaatcga 

tttgtacaat 

ggtgtatgca 

aatttagttg 

cgtccatgtg 

ttatgtttgt 

atttattact 

ctcctccatc 

gtcagcagct 

ctgatgtaga 

acttcagcat • 

tgacaggect 

gctaataact 

ttctggtggc 

cgcacttggc 

catacgcaac 

acggttcgcc 

cgctaattga 

actccatatt 

cgatgagctt 



ettaeggggg 
aattgggcgc 
tccaggtagc 
atttaaagaa 
ggacattcaa 
cattgeegta 
accgagaatg 
gggacaggca 
tctcaactgc 
ttccgcacca 
gacttaatta 
gaecaagegg 
tgtgcgactt 
aagtgggtga 
tttcctcact 
ctcataattg 
actatataat 
gtgttttgcc 
gggcgtcagt 
atggcaaaat 
aagaatatcc 
gtttatctgg 
gtacgaagag 
teggttgegg 
cggacggccg 
gaettaaegt 
ttaagctaat 
gtacatgaaa 
caagttggac 
ttaatgeaca 
aggttcattc 
taacctagca 
ggccgtcgcc 
ctegggcaag 
cttctccacc 
tttactttgt 
ttcctttaaa 
tcccctttac 
cacagcgacg 
tcctgcccat 
ateegcatgt 
agtgtataac 
cctacattat 
aaaeggtate 
gcagtggcaa 
gtggctgaag 
ccattttatt 
tacaaatata 
gctggtactg 
aataaatgta 
aggctgtgca 
ttgeatacat 
gtcagegega 
cttttgtcat 
attgeagtaa 
gttggccgcc 
actggcatcc 
gttgcccatc 
teagatatge 
ctggattccc 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
54 60 
5520 
5580 
5640 
5700 
5760 
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ggcatcgtaa 
ctggtgtaca 
taattactta 
gcaaattagc 
gacgatgcgg 
catctgctgg 
atcgctagca 
aataatttgc 
caagtatgtt 
ttatgcacta 
tttctccaat 
cgcctgctcg 
atttatgcat 
tgcgggctgg 
ttattctatt 
gacttagtac 
ttgcacagaa 
gccaccacca 
atagcatcac 
tggagtaact 
acattgtctc 
tagcgttcat 
acaaaatagc 
ctctacagct 
tttcgggcat 
ttgaacactc 
taccatatat 
atggcccaac 
gtaatattta 
cacctacacc 
ctctaaaaat 
acacacacac 
tgcgcggcct 
ctttcccttt 
cacccgaaaa 
tgcttaacct 
ctaacggtag 
agagtctgtt 
gggatctgga 
ccggatccgc 
gcgtgatgcg 
agcagctgca 
caactccagt 
ctccagaaaa 
aatggctgct 
acagcaacaa 
tgcaaaataa 
gtcatggagt 
gaatttgaat 
taataaaata 
actgtgcaac 
aaaaaatttt 
ctggtccttt 
gacttaaaaa 
aaattcactt 
gtattgtttg 
gattcgttaa 
ataatactta 



tgctgtcgat 
ggtaatagct 
ctgtgtgcgc 
caaattccca 
cgtgagcgga 
ctgccgttct 
ctctaaggat 
actttccatg 
tccttgggat 
ttcaagtatc 
taaaccgcag 
ttggcatcct 
acttcaaccg 
ccatcgagta 
caatttcgct 
agggcttcgg 
gcggcggcgg 
gcgccccacc 
gggtgcactg 
tagttccagt 
ctcatttttt 
attagttgat 
ttttcctctg 
ctgcctgcgc 
ggtgaatggg 
tgcttgccac 
catttctctt 
aaaagctaag 
aattaaaacg 
ttctccacat 
tctccctttt 
acactgaggc 
ttcatttcgt 
agcctttgtt 
aaaaaaacac 
ttgagtgctg 
cat'gcgagtt 
tccctacgct 
gttcggcggt 
cgagatggcc 
tgcccgccag 
tcccctgtac 
gcttcacaat 
ccagattgcc 
aagagggcta 
tacgacaatc 
attgaaatct 
caggaatttc 
gtttgatata 
tgaaatatgc 
agtggtatgt 
taatctgttc 
cacattttat 
tgatttaatt 
taactctcca 
taaaactcat 
atacattaaa 
catacatagg 



gtactaccgc 
attcgagtgg 
ccagatccaa 
agccccggcc 
aggccgcccg 
tcctctggtg 
caagtgccga 
ctacaggtgg 
tgaaatataa 
tattccacct 
gtacatcgta 
gttttggatc 
cgacttcagg 
atttgcatat 
ttgttcgggc 
caactgctgc 
ccaccatttc 
gacgaggctc 
agaaaaaaca 
tacacgacta 
ctcagtgcag 
tgaaatgctt 
cctgaagaac 
ctgcctctgt 
acagtgaatg 
ttaccatcgt 
tgcacgcata 
cccgccaaag 
gaacctcttc 
tcatacaagc 
atgctgcaca 
acatggacgg 
ttagcatttt 
ttctgtgtga 
catacccctg 
tacagtagcg 
tatctcatct 
ttctacttct 
cccagccgac 
aactgcgtca 
tatgccgtca 
accaactaaa 
gtctgttcat 
agcttacaaa 
aacaagaatt 
ccacagatta 
tatagaagta 
ataacgagat 
cgtatataaa 
attgattggt 
agtgagaaaa 
aattatttga 
tcgattagcg 
aaaacattat 
ataaacaaag 
actctcagtt 
ataatctagt 
tattcatatt 



atttaccagg 
cccctccatt 
ggtggccgct 
gagcattcag 
caccctgggc 
cgtatacttg 
gaagttttta 
gcggaaagtg 
gaaaggcaat 
acttaaaagt 
tcctcgctgt 
ggctacttca 
gccgccttca 
accttccact 
atccaaaact 
aatcgcaaag 
gacaacagta 
acgtttagca 
tgaacaattt 
acaaataaga 
ctcagtgttc 
cttcatccac 
tgaaaactct 
ctctctatca 
ggggttgcaa 
atactaccat 
gctggcgctt 
tatgcaacag 
tttgtatgta 
gtaaatgctg 
cttcgacaca 
gtccttgttc 
ttgtgtgagt 
aaccaaattg 
tcgattgtgt 
agctaactct 
actcatagtt 
gtcgacgtgg 
ggggaaccaa 
actccacggc 
atgtcacacc 
cgaaaaacac 
gagatgagtg 
tacaaatcac 
taaccacaaa 
aataaagcct 
tgtatttcga 
gtttataaat 
aagatagatg 
aatctaaaac 
aaagtgtagg 
aatattaact 
aataaagact 
ccacatattc 
ataactaatg 
ctgttagatg 
tcattgaaaa 
cataaagaat 



aggccgaccg 
tgcccgcttc 
ctgctgctgg 
gtggagcagt 
atcatcatga 
atggctcaaa 
attaactcgg 
tgtataaaca 
gtagacttaa 
aaagaatcta 
gcgatagttg 
actcggccct 
agaagaccct 
tcgacatctt 
gaattctctg 
gacagggcca 
tcgactggca 
attattttgg 
atcaaatata 
aaaaagatat 
cgcggagagc 
atctactggt 
gcacgttctt 
gtttcattct 
tgcttgggct 
ttgccatata 
actgtgctga 
caaaacagaa 
tgcccagcta 
atgatgaaat 
ccagcaacac 
gtcccttttt 
ctgttgtttg 
tcacagacat 
cgcccgtccc 
attaaattac 
gggtgtcctc 
cagggggcga 
tggagcccaa 
ctcgtcggag 
caccacggac 
atatggcgta 
cgcagtgcac 
tcgataaaaa 
taatagctaa 
tcaagagcag 
catacaatac 
gcgcatgcaa 
ttaacccaca 
gaatcgcaaa 
ttcaaggtct 
ttaaatacga 
gacaatttta 
aaagtactta 
ttagttgtag 
taatgacaaa 
acaatcaaat 
acatataggg 



acaggagcgt 
caattaactc 
agaagcatct 
cgaccatctc 
gcgccttcct 
ttgggcaatt 
gctcatatca 
attaacaata 
gcagtagtta 
acactttccc 
catcactccg 
gaaccccatt 
caaggtgagt 
gatgggactt 
caccacctta 
acgccaccca 
acgcggaggc 
tagctattgc 
ttaagtaact 
ggaaaaaaga 
gtagatagcg 
tgctaagcaa 
tgctctacag 
cgagggtatc 
ttctaatcgc 
tagtgccata 
ttcgttgcct 
agcataaaac 
caaccagcta 
gttattcttt 
caacaccaac 
tttatatgaa 
ttttttgttt 
gtgccgccag 
acgcggcgta 
atagcgtact 
ttcattccac 
gacgatgacc 
cggaccggat 
atacacatga 
gcccagatgc 
tacttga'ttt 
agcaagtgtt 
cattcctcaa 
agcaacagcc 
caaattgacg 
atatttaggt 
atagatatat 
aaaaaaaaaa 
ttaaagctac 
ggttgaaaga 
tcaaacttat 
ttttatcagt 
accaggcctt 
cagagtgttc 
gctttgattt 
tatatatgtc 
cctaaagcgc 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
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<210> 68 
<211> 1675 
<212> DNA 
<213> Drosophila 



<400> 68 

ggtttggcaa 

atgtcgtcct 

tagactaaca 

ggccaccacg 

tgagcctctg 

gaaattcatt 

cagcatcgcc 

cgaaggatcc 

tttcattatt 

ggtcatgcgg 

cgccgacatg 

caagtggatg 

caccgccagc 

gccactggac 

gtggctatcg 

cgagaactac 

atacgccata 

gtactaccgc 

ggccgctctg 

cattcaggtg 

cctgggcatc 

cgtatcctcg 

gatcggctac 

cagggccgcc 

acgtggcagg 

aaccaatgga 

cacggcctcg 

cacacccacc 



acgattggca 
tattccgtgg 
acgcccgtga 
accaggacaa 
gcgagcaaaa 
gccgacgggc 
gtatccgttg 
accgacgacg 
ggtttcatca 
caccggaaat 
ctggtggccc 
tttggttccg 
atcatgcacc 
tatccactaa 
ccggcgctcc 
aagtatctca 
gtcagctcgt 
atttaccagg 
ctgctggaga 
gagcagtcga 
atcatgagcg 
ctgtgcgata 
ttcaactcgg 
ttcaagaaga 
gggcgagacg 
gcccaacgga 
tcggagatac 
acggacgccc 



ataataggag 
cgttagtgcg 
ccattaccat 
tactggaggg 
tacccgttct 
tcatcgacga 
aagacgtggt 
ccgacggcag 
tactggccgc 
tgcgcatcat 
tctgtgcgat 
tgatgtgcga 
tctgttgcat 
tcatgacaca 
tctcgttcct 
aatcgaatcc 
cgatgagctt 
aggccgaccg 
agcatctgca 
ccatctcgac 
ccttcctcat 
gttgcatcac 
ccctgaaccc 
ccctcaagag 
atgaccggga 
ccggatccgg 
acatgagcgt 
agatgcagca 



tcatcaaaac 
acatttggag 
gacgctactg 
cagcatcagc 
ggaggagtca 
gggactgggc 
cgccggacag 
tagccatttg 
catcctgggc 
taccaactac 
gacatttaat 
catgtggaac 
atcggtcgac 
gcgacgcgtg 
gcccatctgc 
gcatatatgc 
ctggattccc 
acaggagcgt 
aattagccaa 
gatgcggcgt 
ctgctggctg 
tccgcgcctg 
cattatttat 
tctgtttccc 
tctggagttc 
atccgccgag 
gatgcgtgcc 
gctgcatccc 



gagagcaaat 
ttggaaacta 
cagaggcttc 
agttttggtg 
gcctcacatg 
agtgcggtgg 
gcgcaggaca 
gcattagtct 
aacatgctgg 
tttgtggtct 
gcttccgtca 
agcttcgacg 
agatactacg 
ttcatcatgc 
tcgggatggt 
gagttcaaag 
ggcatcgtaa 
ctggtgtaca 
attcccaagc 
gagcggaagg 
ccgttcttcc 
ctcgttggca 
gcatacttca 
tacgctttct 
ggcggtccca 
atggccaact 
cgccagtatg 
ctgtacacca 



gtcatacagc 
ggagcccact 
aggccatgtc 
gcgggacaaa 
ccagatattt 
gcagtgggag 
tccaggcgag 
tcgtcaagtg 
tgattgtgtc 
ctctggccgt 
tgatctcggg 
tctacttctc 
ccattgtgca 
tattgatggt 
acacaacgac 
tgaacaaggc 
tgctgtcgat 
gatccaaggt 
cccggccgag 
ccgcccgcac 
tctggtacat 
tcctgttttg 
accgcgactt 
acttctgtcg 
gccgacgggg 
gcgtcaactc 
ccgtcaatgt 
actaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1675 



<210> 69 
<211> 508 
<212> PRT 
<213> Drosophila 

<400> 69 
Met Thr Leu Leu 
1 

Thr lie Leu Glu 
20 

Pro Leu Ala Ser 
35 

Arg Tyr Leu Lys 
50 

Ser Ala Val Gly 
65 

Val Ala Gly Gin 



Gin Arg 
5 

Gly Ser 

Lys lie 

Phe He 

Ser Gly 
70 

Ala Gin 
85 

Ser Ser 



Asp Ala Asp Gly 
100 

He He Gly Phe He He 



Leu Gin Ala 

He Ser Ser 
25 

Pro Val Leu 
40 

Ala Asp Gly 
55 

Ser Ser He 

Asp He Gin 

His Leu Ala 
105 

Leu Ala Ala 



Met Ser 
10 

Phe Gly 

Glu Glu 

Leu lie 

Ala Val 

75 
Ala Ser 
90 

Leu Val 
lie Leu 



Ala Thr Thr 

Gly Gly Thr 
30 

Ser Ala Ser 
45 

Asp Glu Gly 
60 

Ser Val Glu 

Glu Gly Ser 

Phe Val Lys 
110 

Gly Asn Met 



Thr Arg 
15 

Asn Glu 

His Ala 

Leu Gly 

Asp Val 

80 
Thr Asp 
95 

Cys Phe 
Leu Val 
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115 










120 


He 


Val 


Ser 


Val 


Met 


Arg 


His 


Arg 




130 










135 




Phe 


Val 


Val 


Ser 


Leu 


Ala 


Val 


Ala 


145 










150 






Met 


Thr 


Phe 


Asn 


Ala 


Ser 


Val 


Met 










165 








Ser 


Val 


Met 


Cys 


Asp 


Met 


Trp 


Asn 








180 










Ala 


Ser 


He 


Met 


His 


Leu 


Cys 


Cys 






195 










200 


He 


Val 


Gin 


Pro 


Leu 


Asp 


Tyr 


Pro 




210 










215 




Phe 


He 


Met 


Leu 


Leu 


Met 


Val 


Trp 


225 










230 






Leu 


Pro 


He 


Cys 


Ser 


Gly 


Trp 


Tyr 










245 








Leu 


Lys 


Ser 


Asn 


Pro 


His 


He 


Cys 








260 










Ala 


He 


Val 


Ser 


Ser 


Ser 


Met 


Ser 






275 










280 


Leu 


Ser 


Met 


Tyr 


Tyr 


Arg 


He 


Tyr 




2 90 










295 




Leu 


Val 


Tyr 


Arg 


Ser 


Lys 


Val 


Ala 


305 










310 






Gin 


He 


Ser 


Gin 


He 


Pro 


Lys 


Pro 










325 








Ser 


Thr 


He 


Ser 


Thr 


Met 


Arg 


Arg 








340 










Gly 


He 


He 


Met 


Ser 


Ala 


Phe 


Leu 






355 










360 


Trp 


Tyr 


He 


Val 


Ser 


Ser 


Leu 


Cys 




370 










375 




Leu 


Val 


Gly 


He 


Leu 


Phe 


Trp 


He 


385 










390 






Pro 


He 


He 


Tyr 


Ala 


Tyr 


Phe 


Asn 










405 








Lys 


Thr 


Leu 


Lys 


Ser 


Leu 


Phe 


Pro 








420 










Gly Arg 


Gly 


Arg 


Asp 


Asp 


Asp 


Arg 






435 










440 


Arg 


Arg 


Gly 


Thr 


Asn 


Gly 


Ala 


Gin 




450 










455 




Met 


Ala 


Asn 


Cys 


Val 


Asn 


Ser 


Thr 


465 










470 






Val 


Met 


Arg 


Ala 


Arg 


Gin 


Tyr 


Ala 










485 








Ala 


Gin 


Met 


Gin 


Gin 


Leu 


His 


Pro 



500 



125 



Lys 


Leu 


Arg 


He 


He 


Thr 


Asn 


Tyr 








140 










Asp 


Met 


Leu 


Val 


Ala 


Leu 


Cys 


Ala 






155 










160 


He 


Ser 


Gly 


Lys 


Trp 


Met 


Phe 


Gly 




170 










175 




Ser 


Phe 


Asp 


Val 


Tyr 


Phe 


Ser 


Thr 


185 










190 






He 


Ser 


Val 


Asp 


Arg 


Tyr 


Tyr 


Ala 










205 








Leu 


He 


Met 


Thr 


Gin 


Arg 


Arg 


Val 








220 










Leu 


Ser 


Pro 


Ala 


Leu 


Leu 


Ser 


Phe 






235 










240 


Thr 


Thr 


Thr 


Glu 


Asn 


Tyr 


Lys 


Tyr 




250 










255 




Glu 


Phe 


Lys 


Val 


Asn 


Lys 


Ala 


Tyr 


265 










270 






Phe 


Trp 


He 


Pro 


Gly 


He 


Val 


Met 










285 








Gin 


Glu 


Ala 


Asp 


Arg 


Gin 


Glu 


Arg 








300 










Ala 


Leu 


Leu 


Leu 


Glu 


Lys 


His 


Leu 






315 










320 


Arg 


Pro 


Ser 


He 


Gin 


Val 


Glu 


Gin 




330 










335 




Glu 


Arg 


Lys 


Ala 


Ala 


Arg 


Thr 


Leu 


345 










350 






He 


Cys 


Trp 


Leu 


Pro 


Phe 


Phe 


Leu 










365 








Asp 


Ser 


Cys 


He 


Thr 


Pro 


Arg 


Leu 








380 










Gly 


Tyr 


Phe 


Asn 


Ser 


Ala 


Leu 


Asn 






395 










400 


Arg 


Asp 


Phe 


Arg 


Ala 


Ala 


Phe 


Lys 




410 










415 




Tyr 


Ala 


Phe 


Tyr 


Phe 


Cys 


Arg 


Arg 


425 










430 






Asp 


Leu 


Glu 


Phe 


Gly 


Gly 


Pro 


Ser 










445 








Arg 


Thr 


Gly 


Ser 


Gly 


Ser 


Ala 


Glu 








460 










Ala 


Ser 


Ser 


Glu 


He 


His 


Met 


Ser 






475 










480 


Val 


Asn 


Val 


Thr 


Pro 


Thr 


Thr 


Asp 




490 










495 




Leu 


Tyr 


Thr 


Asn 










505 

















<210> 70 
<211> 3651 
<212> DNA 
<213> Drosophila 

<400> 70 

ctggacgccg gacacgaaag cagatcacct gccgtacagt gccaagatca tcgaacgcat 60 

ttacaatgag gagatcggtg atggcggtgg ccacagtgca cggaggataa acatgctgga 120 

gttcagtcag tatctggagc agtacctgtg gccacactac caacgtgaaa ccgccaccca 180 
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tgcccacctt 
ttggaccgtg 
gagctgtttg 
gctgttgatg 
ggccaaacga 
cgcaactctt 
caagagcttc 
aaagacagca 
ttcctgcaca 
tactgtgagc 
ttcttcaaca 
gttcgtcgcg 
ctatctcgta 
aaactacgaa 
tataatgcag 
ttttcagcag 
tctgcagggc 
ttatgaggaa 
tgcaaggaag 
ttggctttca 
gtggtcacat 
catatgccta 
tcggcgtcga 
caagcatacg 
ctcgaacgaa 
aagaggctga 
gtgaaaattc 
acaatgttac 
aattcaaact 
caggataaca 
tgagtcactc 
ctctccagta 
gactttagct 
cggttccaga 
tgcttcgttt 
atcagagctc 
cttcaaactc 
aatcccgatg 
catcggctgg 
aaaatcataa 
agccagattt 
agctgttcca 
attatcggag 
gcacacggaa 
ttactcaaat 
cagtaatagt 
atccagtttt 
ttcccacaat 
caccattagc 
ttatattacc 
atttgggaat 
ataaatcata 
ggggagtgcg 
tgatagccaa 
aaatccataa 
gcgactagga 
cactcatggt 
ggatcggaag 

<210> 71 



atgtccatcg 
tttgaaaagc 
ccgagcaaaa 
ttcattaacc 
ctagtctcct 
cacatctatc 
gagaagttcc 
aaccggaggt 
tactcgaaag 
gcttcctaga 
cggttttaga 
aggagggaaa 
cagcataaat 
aacacttata 
ctcatcgccc 
aaatgtcaca 
gttaactacg 
tagccaggct 
ataatgtttt 
atttgtgcgc 
agaagaatat 
gccagccgat 
ggaagaaatg 
cagaataata 
aggtcttcca 
agcacaggat 
tgacaaggtc 
caaccagatc 
attgctcacc 
agcgagatgc 
cacttgatct 
ctggaagtgg 
actcttacct 
ctgccggcaa 
tcggcattgg 
tagaaaatat 
acgccagcat 
acgaatttga 
aagtagtctg 
aggcagttca 
tcttttctct 
tgcgccttca 
ttcggataca 
attagattgc 
gactacaatt 
tacattttta 
agttgtccga 
tgttgcgatt 
tccacaaaca 
ttactaatgg 
ccctcatcaa 
tttctcatca 
aactgtggag 
caaatgggcc 
agttgaatgg 
gttttccgcc 
ccacgtgtat 
atagggctgt 



ttataatggc 
tcccggatca 
aggcaaagga 
actgctttaa 
tgtccatgtg 
aaagtgatca 
tgagtggcga 
cctctgggag 
cattcccgcc 
gtttattatc 
cgactgtcat 
actgtttggt 
tagttaaaat 
ttagtgtgga 
cagtcgtttg 
tgtcgctggc 
atgtgggcgt 
atagacatta 
catatactta 
aatgcggccg 
gtttaccaaa 
cgtctcttgc 
tgcgaagaca 
gcagtacttc 
aatgtaaaaa 
aatgtccgcc 
tgctgcctgg 
tctgctgaat 
tgagagtagg 
catggaaaat 
ccttgcgggc 
cctgcgaatg 
tgtcaatgca 
tatcctcctc 
tgtaatgcct 
aaaaatatct 
agtggtatat 
acttaacggg 
caaatggaaa 
agttaacgct 
ctgcgaatat 
tcagctggtt 
tttttttcca 
atgttgcgat 
aaaatcggat 
tgtgtcacct 
gctagaatat 
gtttgtcaaa 
ttaattacca 
attttttgcc 
gtggctactt 
ggattggggc 
ttggggatat 
atttaatgtt 
ccaaccaact 
tggccagctg 
ctcgaacttc 
tgatgatccg 



caacgagaag 
gtatccagcc 
ggctagtagc 
cagcatggag 
gcattgcctg 
aaaatgtatt 
aagtactgga 
agacacttta 
gaaggcgagg 
gacttggagg 
ctgattgtgc 
caggtaggtg 
taagtacaaa 
cggtggctcc 
ccgacggtgg 
gcagcacgca 
aaagcaggcc 
ctaacagcat 
tatgtaatgc 
tataccgtcc 
atggtacaag 
tcgcccacca 
gccagtgcag 
ctgccgtccg 
ccgaaactgt 
acaatgaagc 
cttaccataa 
ggattgaaaa 
aagtataaaa 
tgggtttaag 
caagcaaatg 
gattttatga 
atacaatccc 
ataatccgat 
tagcctccca 
ataagtatct 
tggccacatc 
cctgttttgc 
cggacgataa 
gcttttgttg 
aaatatttac 
ttgcaatttg 
ctcagccatt 
tatgtgccct 
gagatcagta 
atatggccga 
cgatatagat 
cattcaccca 
gggctcgcat 
tcattttact 
tatagtttaa 
caccctctgt 
ttagctgttg 
gggcatacat 
tcgttgtggt 
ttgaacttac 
tcgcaacatt 
gcgctagtga 



ttccgggagc 
ttctttcgcc 
acgctaaggg 
atagagctgt 
cagcctcgta 
tgcttgcagg 
aacgcttgct 
tgcagaacct 
tgccgcgcaa 
cacttttgcc 
gggctcttct 
tcagacttct 
tctcaatgtt 
tcagtaatcg 
ctcattgtag 
tatgtatagc 
ttccatctgc 
cagcgagaac 
ctagatgccg 
tgcggttaat 
caattggagc 
tgtgctcctg 
ccatcgttgc 
tgactcgcag 
tcagccagaa 
tcacatggct 
ggcacatggc 
ttaacgcatg 
tagatgatgg 
attttgcgga 
ttggcgaata 
tcatccatgc 
ttatcgtagt 
tgtattccat 
tcatccagta 
aaaagtttct 
tgcatggatc 
tctcctccat 
atagagaaca 
cggctggtgg 
gcttctgggc 
tgcagattgg 
catatgttaa 
gttgtattgt 
caatgcaatg 
tccatggtca 
gctagagtac 
aggcaatata 
ttatggggaa 
tccaagagat 
acagaaacat 
ttgttttaag 
gcttcgttcc 
atgttgcatg 
tgttatggca 
ccgtttcatt 
tgttgaccag 
catgactggt 



gcgtcgaagt 
acgtgctaga 
agcggacagc 
gcagagaaca 
agttcttctt 
ccgtcgcgaa 
caagaaggag 
tatcattgac 
cgtggttcac 
gactcgacgc 
gtcacccttg 
catttaataa 
ggacattgct 
ttgccgttca 
cagcaggtct 
aacagcggcg 
aaccaggaca 
atgataaaac 
atctacttac 
caacttcctg 
aaagaatatg 
tttgtaggat 
tgtgcatccc 
gaagacgttc 
gaaagctgcc 
ggtcagctcg 
tatcgttgtc 
aagcaagtca 
cgatgaccag 
gcagaaagtt 
gcacattttc 
aatttatgtg 
catggtatag 
gacgctcctc 
acactagctt 
atgcaaagaa 
cacaatttgg 
agctggtgaa 
cagaccgtgg 
atcagagggg 
aaagaacttg 
ttcttcttca 
tcatagctaa 
agtattggga 
attcaactat 
taaatgtttt 
gcttttctat 
tcactatatt 
cagaagcatc 
aatattcccg 
gccccacagc 
aactttcttg 
actgatccac 
ccagaaggga 
actttttcga 
gacctgctga 
cactaggttg 
t 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3651 



102 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/US01/09341 



<211> 1272 

<212> DNA 

<213> Drosophila 



<400> 71 

atggaacagc 

ggaggagagc 

tccatgcaga 

gatgatggga 

atacaatcgg 

gataagggat 

ttcgccaaca 

cgcaaaatct 

atcatctatt 

atagccatgt 

accagccatg 

ttcttctggc 

ttcctgcgag 

acagcaacga 

aaacaggagc 

tttgctccaa 

ttgattaacc 

atgttctcgc 

ttgcagatgg 

ttgctataca 

tgctacaatg 

aacgattact 



tcaagttctt 
aaaacaggcc 
tgtggccaat 
ggctaaggca 
attatgagga 
tgtattgcat 
tttgcttggc 
taaacccaat 
ttatacttcc 
gccttatggt 
tgagcttcat 
tgaacagttt 
tcacggacgg 
tggctgcact 
acatggtggg 
ttgcttgtac 
gcaggacggt 
tgatgctgtt 
aaggcctgct 
tatgcgtgct 
agccaccgtc 
ga 



tgcccagaag 
cgttaagttc 
ataccactat 
ttacaccaat 
ggatattgcc 
tgacaaggcc 
ccgcaaggag 
tttccatggc 
tactctcagc 
aagccaggca 
tgtggcggac 
cggtttttac 
caggaagtac 
ggctgtcttc 
cgagcaagag 
cattttggta 
atacggccgc 
agtaatgtct 
ttacgcccac 
gcgccagcga 
ggcaaacgac 



cactacttcc 
aaattcgtca 
gctggcagct 
gccgaaaacg 
ggcagtctgg 
acttccagta 
atcaagtgga 
atctcgcttg 
agagatctgg 
gcagaccttg 
attatcctgt 
atttggaaga 
tgctattatt 
gcacatttct 
acgatcggct 
aacatattct 
attgcgcaca 
atagcctggc 
atcgtagtta 
catgtgacat 
tggggcgatg 



<210> 72 
<211> 423 
<212> PRT 
<213> Drosophila 

<400> 72 

Met Glu Gin Leu Lys 

1 5 
Phe Thr Ser Tyr Gly 
20 

Val He Gly He Pro 
35 

His Tyr Ala Gly Ser 
50 

Leu Arg His Tyr Thr 
65 

He Gin Ser Asp Tyr 
85 

Tyr His Asp Tyr Asp 
100 

Ser Thr Gly Glu Glu 
115 

Lys Glu He Lys Trp 
130 

Asn Pro He Phe His 
145 

He He Tyr Phe He 
165 

He Val Thr Thr He 
180 

Leu Val Arg He Phe 
195 



agccgatgtt 
tcgggattcc 
ctgataagct 
aagcagagga 
aaccgctata 
ctggagagga 
gtgactcaaa 
ttatcctgct 
ttggtaacat 
tcagaatttt 
gcttcagcct 
cctttcgttc 
ctgcgtatgc 
tcctcgacgc 
ggctaggcat 
tctatgtgac 
aattgaaagc 
tattcctcat 
acgccctgca 
ttctgctgaa 
agctgcatta 



caccagctat 
aaattgtgga 
agtgttactg 
gcgtcatgga 
ccatgactac 
aaatgtgcta 
ctttctgctc 
ggtcatcgcc 
tgtgacaacg 
caccgagctg 
cttggcagct 
gaggaacgtc 
ttggggatgc 
cgaatcctac 
atgcatattc 
caccaggaag 
caattttatc 
aatgtcctgg 
gacgccgctg 
aaagacctgc 
tatgaacggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1272 



Phe Phe Ala 

Gly Glu Gin 

Asn Cys Gly 
40 

Ser Asp Lys 
55 

Asn Ala Glu 
70 

Glu Glu Asp 

Lys Gly Leu 

Asn Val Leu 
120 

Ser Asp Ser 

135 
Gly He Ser 
150 

Leu Pro Thr 

Ala Met Cys 

Thr Glu Leu 
200 



Gin Lys 

10 
Asn Arg 
25 

Ser Met 

Leu Val 

Asn Glu 

He Ala 

90 
Tyr Cys 
105 

Phe Ala 

Asn Phe 

Leu Val 

Leu Ser 
170 
Leu Met 
185 

Thr Ser 



Tyr 


Phe 


Gin 


Val 


Lys 


Phe 






30 


Met 


Trp 


Pro 




45 




Leu 


Asp 


Asp 


60 






Glu 


Glu 


Arg 



75 

Gly Ser Leu Glu 
He Asp 
Asn He 



Leu Leu 
140 
He Leu 
155 

Arg Asp 
Val Ser 
His Val 



Lys Ala 
110 
Cys Leu 
125 

Arg Lys 

Leu Val 

Leu Val 

Gin Ala 
190 
Ser Phe 
205 



Pro Met 
15 

Lys Phe 

He Tyr 

Gly Arg 

His Gly 

80 
Pro Leu 
95 

Thr Ser 

Ala Arg 

He Leu 

He Ala 
160 
Gly Asn 
175 

Ala Asp 
He Val 
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Ala 


Asp 


He 


He 


Leu 


Cys 


Phe 


Ser 


Leu 


Leu 


Ala 


Ala 


Phe 


Phe 


Trp 


Leu 




210 










215 










220 










Asn 


Ser 


Phe 


Gly 


Phe 


Tyr 


He 


Trp 


Lys 


Thr 


Phe 


Arg 


Ser 


Arg 


Asn 


Val 


225 










230 










235 










240 


Phe 


Leu 


Arg 


Val 


Thr 


Asp 


Gly 


Arg 


Lys 


Tyr 


Cys 


Tyr 


Tyr 


Ser 


Ala 


Tyr 










245 










250 










255 




Ala 


Trp 


Gly 


Cys 


Thr 


Ala 


Thr 


Met 


Ala 


Ala 


Leu 


Ala 


Val 


Phe 


Ala 


His 








260 










265 










270 






Phe 


Phe 


Leu 


Asp 


Ala 


Glu 


Ser 


Tyr 


Lys 


Gin 


Glu 


His 


Met 


Val 


Gly 


Glu 






275 










280 










285 








Gin 


Glu 


Thr 


He 


Gly 


Trp 


Leu 


Gly 


He 


Cys 


He 


Phe 


Phe 


Ala 


Pro 


He 




290 










295 










300 










Ala 


Cys 


Thr 


He 


Leu 


Val 


Asn 


He 


Phe 


Phe 


Tyr 


Val 


Thr 


Thr 


Arg 


Lys 


305 










310 










315 










320 


Leu 


He 


Asn 


Arg 


Arg 


Thr 


Val 


Tyr 


Gly 


Arg 


He 


Ala 


His 


Lys 


Leu 


Lys 










325 










330 










335 




Ala 


Asn 


Phe 


He 


Met 


Phe 


Ser 


Leu 


Met 


Leu 


Leu 


Val 


Met 


Ser 


He 


Ala 








340 










345 










350 






Trp 


Leu 


Phe 


Leu 


He 


Met 


Ser 


Trp 


Leu 


Gin 


Met 


Glu 


Gly 


Leu 


Leu 


Tyr 






355 










360 










365 








Ala 


His 


He 


Val 


Val 


Asn 


Ala 


Leu 


Gin 


Thr 


Pro 


Leu 


Leu 


Leu 


Tyr 


He 




370 










375 










380 










Cys 


Val 


Leu 


Arg 


Gin 


Arg 


His 


Val 


Thr 


Phe 


Leu 


Leu 


Lys 


Lys 


Thr 


Cys 


385 










390 










395 










400 


Cys 


Tyr 


Asn 


Glu 


Pro 


Pro 


Ser 


Ala 


Asn 


Asp 


Trp 


Gly 


Asp 


Glu 


Leu 


His 










405 










410 










415 




Tyr 


Met 


Asn 


Gly 


Asn 


Asp 


Tyr 





















420 



<210> 73 
<211> 5299 
<212> DNA 
<213> Drosophila 



<400> 73 

gctgttcgca 

tggaaatggg 

tcaactgctg 

tcaattaacg 

ttgctatcct 

gaaatatata 

tgaaaccgtt 

ttttctctct 

tctcgcagga 

agttaatatg 

tcaatgggcc 

atgcgtttag 

tcacattatg 

atatacatag 

tgaatcactt 

aagccctctt 

agtttaaaac 

tattacgatt 

tttgggtcag 

gacgtctata 

gtacacagta 

ttcccgccac 

tcgcccaagg 

cgcgattttc 



cggcattttt 
caagaatctg 
ctaactggca 
caatagcagg 
tgcaaacaat 
tatatattgt 
gttatctgaa 
gttcctgcat 
tatgctaact 
cattgatgcg 
tttgttaatt 
tacttaggat 
gcacaatgta 
atatcgattg 
ttgattcaat 
ctcgtgtttc 
cataaatatt 
tcgacgactt 
tgtagtcgtt 
gcgatatccc 
tagaaaaaaa 
cacctgagtg 
caattggcaa 
ctgcgatacg 



gggccagcaa 
gtttttggct 
actggagatt 
taaaaagggg 
tgatggtatg 
ttattagttt 
gggaatgcaa 
agttcctctt 
aaggactatg 
ttattcatac 
tgtatgtgtg 
aacaattacg 
tgttatgtag 
atataacaaa 
catctcactt 
atgagaagtt 
ttattataaa 
ccggctccgc 
gtatgcggca 
gttatcagtt 
ttggatgccg 
ccaatttgaa 
actatgaaca 
ctgcctttta 



acaggatgcg 
gccattcaat 
tggcgattgc 
acatcttttc 
tcgtaagttt 
atggccccaa 
cgattcgagc 
tcatttcgca 
ccaccatatt 
gccccgtggg 
tgtatgtgta 
tttaggttaa 
tatatatata 
ttaaattttc 
gtgattccgc 
tcatgatatc 
tattcactat 
cataattact 
catagagact 
ctcgaaccgt 
catgtaggga 
tttccatata 
ttaacaccac 
gtccagaatt 



gaaaaccgga 
taggtatcgg 
agctgcccat 
atcgagatct 
gtgcgaataa 
aaaatgctac 
ttcatattga 
tctttttcca 
cgacaatttg 
acgggcgaat 
ttttgcgtgt 
ctagcataaa 
tcgtatatat 
agctaaccgc 
gtgatttgct 
atctatcact 
tcacgctatc 
accataatga 
cgtattgtta 
ctttcggtac 
ggccaatagg 
ctgattgact 
cgcaataagt 
tgatttgcgc 



agctgggaac 
tcagcccaat 
tgtcaacact 
cgaccttgtt 
aatggttttg 
aacacaatat 
gaaggtcctg 
ctggccgcag 
ttgggcacca 
agcaaaactc 
atttgatttg 
tctataatca 
agagagactg 
acgcttcaac 
ggcaaattta 
cctctcatgg 
aggcgacccg 
tcgccattta 
cgcgcatgtc 
ttatcggaaa 
ttacacacat 
atcaaataga 
gtgatcgtca 
taaggattca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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gaatttatac 
ttatgctgct 
tgagaatgga 
tgaacaggtt 
tgtaataggt 
aagaacaacc 
aacccaaagc 
tacaccgctc 
cgacggtatt 
gaatgcatcc 
cagcaaactc 
ctccagcgtt 
atgtcgtgga 
cttataaagc 
attaatacaa 
gctctccatc 
tggactccat 
tggccagtgg 
tttggttgca 
cgaatgccgt 
ataaagttgc 
tgctatcacc 
tatcgcaatg 
ccggtaaaaa 
caaagatgca 
tatttattca 
ctgtgccttt 
ccaaaccatc 
tcaaaacgca 
gagggtggga 
ggttcggaaa 
aagcgctgct 
aacagcagca 
aattgatggg 
tccggggatc 
ccacttagtg 
attgttttca 
aatataaaaa 
actcgtgttt 
ttgcatactt 
atttggtcag 
gtagagcttg 
cgtcaggtag 
catgcgcttt 
tgttgggcac 
cgctgccaag 
ggttgtgcgc 
ggcattttcc 
aaaactaact 
ttctcttgta 
ccattcatgc 
ttaaaataat 
ttgctaagat 
ggacatatgg 
ttccggcaga 
ttgcatttgg 
gtacccaacg 
ttaaagtgca 
cgcttcatcc 
acatgcagtt 



aacatatcat 
ggcattggtc 
gttgaaggtg 
ccacagatag 
cacctaaaac 
ttttaagcaa 
actgatatca 
gtgattttca 
tccaacatta 
gtattgttga 
tctgtgaccc 
tttctcagtg 
atgttctcta 
tgtctaaaag 
cttcgatttt 
accactgcca 
ggcggagtcc 
gggcgtgact 
cggctggtag 
cgatagtgtg 
atatggaaat 
tttttggcca 
aatctctcga 
cagagaaagc 
ctgcaaatat 
ggttaagtgt 
tttccgtgta 
catttttgac 
gcgcactcta 
tattcaaggg 
aacgctatcg 
tactatacct 
gcagcagcag 
cccatcgctg 
cagacgaggc 
ctctccaaac 
ccgcatgata 
cacccttttg 
cgatttaaag 
tgcgactgct 
tactcaccga 
aaatgatact 
gcaatatccg 
ctaaaaatac 
tcgacgcatt 
tgcaacacca 
agagggggtg 
gaaatgcgca 
gttagcattt 
ttcccgtaaa 
cgtcaacgct 
aaagttcatg 
cttagcaagg 
tgtcatagca 
cataaaaaac 
cacataaaat 
tagttgttgc 
gcggtggcta 
atgttccact 
ggatgcacag 



catcatcatc 
aggtcaccac 
gccaatagaa 
ctgtatagct 
atatagatag 
tggaatatcc 
ttttaagtta 
ctggttaaca 
ttttactcac 
tgttgtgatt 
ctgacctctg 
tgggcagcgt 
gttgagggca 
tatgcttcgt 
tccaagtgta 
ctgccccagt 
ggcgattgcc 
gccacgggtg 
gcgtcatcaa 
gacagcaaac 
ggattctatt 
gcgattccgt 
tggtgaagca 
aattgtgaat 
tttgggaaaa 
ctaatataga 
ctctggaatt 
cgcaccacca 
agcgattgac 
gggtgagggg 
catggcagtg 
cgtaaattgg 
cagcagcagc 
gaatccggtt 
taacccgggg 
gttgcattcg 
tgtacgcttt 
gtaaattttt 
aacactaatc 
ttgaattatt 
aactgtcgca 
tgggatagtc 
taatggccag 
cgtcaagaat 
tggaaatcag 
attgaaatag 
gaaaattcca 
caaatgtcgg 
agttttctcc 
acaaaataaa 
caatgttaag 
ctataaatat 
aacacagttt 
gtcagcctaa 
acacccatcc 
ggctcttatt 
tgttgttatt 
caattaaatg 
tatttcacac 
agagaaaatc 



atcgtttcga 

gcagattctt 

gtagggggaa 

catcatagcc 

atacatagaa 

caagcgacta 

tcaccgaatt 

attttctcat 

attttgacag 

ctcagcgctg 

ggccgcgtct 

cacaaatcca 

aagaaagtaa 

acttttcaat 

cactactgct 

cgacgagctg 

aaatggtcgc 

gcggcggcat 

tctgcctagt 

agaagatggc 

gttacatacg 

gcagaatgtt 

cacggctatg 

gggggcgtgg' 

tccaatccaa 

ttggtaaatt 

ataaataaac 

atttatttgc 

gtccgtgctg 

ggtgaaaagt 

agcctcagct 

cagagctcaa 

agcagcagca 

gatacggggg 

aactggggaa 

aactgttttt 

tcttattgta 

tcaaattgaa 

gagttacttt 

gaattattac 

cagccagaca 

atagtgctgg 

tgccgttaga 

atcgaaaaga 

gccaaaccat 

caaattgcct 

ggcggttgat 

cacagtcata 

ctggcatcgc 

ataacgtaat 

ctagttgctt 

caataaattt 

atcttttttt 

cgcacagagc 

acacagcata 

gtttcagtta 

gttatcgttg 

tcgtgtttaa 

acacagacac 

gtgcaagtta 



ataactcact 
ggaccgatga 
taccacgaag 
cagt tccgaa 
tacatacata 
agtaaaagca 
tatcatagtc 
gtttttttct 
tattccgtta 
cgcttgcgac 
gccaccttct 
gatgactgcc 
tgcaactatc 
gcatacttct 
ataccctcgt 
attagagctg 
cggcgtgggc 
gggcgagatg 
accaatcgta 
caccgctggc 
cctatttcag 
tgcctcttga 
taaattgaga 
cagtacaata 
aaaggatata 
aaattcaaac 
gcagctccgc 
cgcgcgggct 
tgctgatgag 
ggggtgggaa 
gaccgctggc 
tgaaattgat 
acagccgcag 
gaaatggggg 
tttatggaat 
gttttttgcc 
gttttgcggg 
agcttattta 
cggtctcaca 
tccattgcta 
aaatagccat 
agtgacagga 
tatatgttcg 
attgcaaatg 
ttcggtggac 
gaaagccaaa 
gtgatgggcg 
aagccaaaag 
ccgcagctct 
gaaaatattt 
tgaaaagttg 
tgtttagcaa 
ttcacgtttt 
acttgcaact 
tcgcctatct 
ctgttattgt 
ttggctgtgt 
tgcgctgcgt 
acacacacac 
gctggcattt 



tttgtgcgcc 
accaatagaa 
accagcaggg 
agaccatgat 
gatttccatt 
tgtcttacaa 
atggcgcttg 
ccagggaaat 
cgttaaatgc 
gccacaattg 
gatcctgctc 
agtgacggcg 
aactacttca 
aagcatctct 
tactcacctg 
ccggacgtcg 
agcgggggcg 
ttggccaccg 
attgtatgct 
aatgtagaat 
ccaactcacc 
gcggatagcg 
tgtaacctgt 
cgaacgcaat 
tatacatata 
ttcggttaga 
tgttaatacc 
gtcaatttat 
gttgctgcgg 
agcggaaaac 
atttcgcaga 
gcacaacaga 
gtgcaattcg 
atacattgga 
ccgcaccatt 
cccggcagca 
aataatgtta 
agcataaaat 
accttattta 
cttaattcta 
agagctgcca 
ttaactggca 
tcgtgcatct 
gtgttgattc 
aatgcaaaaa 
cgatcggtgg 
cggggccttg 
atgaacaata 
gatagttaaa 
aattaaaaag 
taaatatggt 
ataaatgaat 
cctgggaaat 
gtcgcatatt 
gtgtgcgttt 
tgttgcactt 
ggaaatgatt 
acctgc'tcgc 
gcacaacagg 
taaatggaaa 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
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attgtaactt aaaactagtt taaaacaagc tcaaacgaat aagttgaagt ttggaataaa 5100 

attaatttga aaaggctcat atattttata tgcattgcat actttttagt acttaaccaa 5160 

atgcatgaaa atccaattag tcgcttttgc atacctaata acttaacatt cttaacgtgg 5220 

cagctaaaac ttctttcaaa tgccatcatc agctaaatga ttcatattta attcgaaaat 5280 

cttgtgaccc tgctgcatg 5299 



<210> 74 
<211> 1467 
<212> DNA 
<213> Drosophila 



<400> 74 

atgactgtgc 

tcaaccgcct 

ggcaatttgc 

tggtttggcc 

cttttcgata 

ctaacggcac 

cagcactatg 

gtctggctgt 

gagagattca 

gccaaaaagg 

ttcgagcata 

caaacggtgg 

gccacgcccc 

tccacgacgt 

gagccagaga 

cccacactga 

tcaggggtca 

aacatcaaca 

tattacaatc 

ctgttcaccc 

attctcattc 

agcataaggc 

agacgtcgac. 

acccaaataa 

cgtaatacgg 



cgacatttgt 
ggaattttcc 
tatttcaatt 
tgatttccaa 
ttcttgacgg 
tggccattac 
actatcccaa 
gcgacagttt 
ttgcgatacg 
tgatagcagc 
caattacgat 
ccaacatctc 
cgctgcccac 
ccggcagctc 
acattccacg 
gaaaaacgct 
cagagagttt 
atacggatgc 
atggtctttc 
tgctggtctt 
tattggttca 
gcacaaaata 
atgcgcgtaa 
atggcgatca 
gtcgcctgat 



gcgcatttcg 
accccctctg 
ggtgttgcac 
atgcgtcgag 
tatttttaga 
ggatattgcc 
gtatcatttc 
cggttacatc 
ctatccgctc 
ggtggccatc 
tggtactagg 
gcccatgccg 
gccggcgacc 
taatcagctc 
acatcgccgt 
ggaggagcag 
gctgcaattg 
attcgcattt 
ggaactgggc 
cgtggtattc 
tcggtccaag 
ccgaaagacg 
caatacgagt 
ttatggtagt 
agcgtga 



gaaaatgccc 
cgcacaaccc 
ttggcagcgt 
tgcccaacag 
aagcgcatga 
tacctgacgt 
aagctctact 
tcaatttaca 
aagaggcaaa 
ttctgtttgc 
cagattgatg 
ccgccacccg 
atttggcaat 
gtcgactggg 
cactggcagt 
gatcagaagg 
tggcgtcgca 
aacgtaacgg 
tatgatgagc 
cccctacttc 
aatctgcgtg 
gttcgagaac 
ctctatgtgc 
aattatggcg 



aaggccccgc 
caccgatcgt 
ttttgcattg 
aatcaacacc 
gatgcacgac 
gccagttaat 
ggcagctcta 
tagccgtgtg 
cattctgcac 
tgtccacact 
acgcctacca 
tggcagtcac 
cgccggactc 
gcagtggcag 
catctggatt 
tggcagacgc 
agcgcagcgc 
aatactgtca 
tatacagcta 
tattggccac 
gtgacctgac 
tgataacggg 
cgcatacaac 
gagccggaag 



gcccatcaca 
ttggctttca 
tccaccgaaa 
atttgcaatt 
gaacatatat 
cctgtcactc 
tggctatttt 
cttcaccatc 
ggaatcgctg 
atcgacggca 
gccgtgcaac 
gcccccactg 
cgccatggag 
tggtgatgga 
tgtgacgctg 
ggcccagagg 
tgagaatcac 
aaatgtgacc 
tctgtggaac 
cttcaactcc 
caatgccagc 
atatcgctat 
gactacactg 
tcgtcgaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1467 



<210> 75 
<211> 488 
<212> PRT 
<213> Drosophila 



<400> 75 



Met 


Thr 


Val 


Pro 


Thr 


Phe 


Val 


Arg 


He 


Ser 


Glu 


Asn 


Ala 


Gin 


Gly 


Pro 


1 








5 










10 










15 




Ala 


Pro 


He 


Thr 


Ser 


Thr 


Ala 


Trp 


Asn 


Phe 


Pro 


Pro 


Pro 


Leu 


Arg 


Thr 








20 










25 










30 






Thr 


Pro 


Pro 


He 


Val 


Trp 


Leu 


Ser 


Gly 


Asn 


Leu 


Leu 


Phe 


Gin 


Leu 


Val 






35 










40 










45 








Leu 


His 


Leu 


Ala 


Ala 


Phe 


Leu 


His 


Cys 


Pro 


Pro 


Lys 


Trp 


Phe 


Gly 


Leu 




50 










55 










60 










He 


Ser 


Lys 


Cys 


Val 


Glu 


Cys 


Pro 


Thr 


Glu 


Ser 


Thr 


Pro 


Phe 


Ala 


He 


65 










70 










75 










80 


Leu 


Phe 


Asp 


He 


Leu 


Asp 


Gly 


He 


Phe 


Arg 


Lys 


Arg 


Met 


Arg 


Cys 


Thr 










85 










90 










95 




Thr 


Asn 


He 


Tyr 


Leu 


Thr 


Ala 


Leu 


Ala 


He 


Thr 


Asp 


He 


Ala 


Tyr 


Leu 



100 105 110 
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Thr 


Cys 


Gin 


Leu 


He 


Leu 


Ser 


Leu 


Gin 


His 


Tyr 


Asp 


Tyr 


Pro 


Lys 


Tyr 






115 










120 










125 








His 


Phe 


Lys 


Leu 


Tyr 


Trp 


Gin 


Leu 


Tyr 


Gly 


Tyr 


Phe 


Val 


Trp 


Leu 


Cys 




130 










135 










140 










Asp 


Ser 


Phe 


Gly 


Tyr 


He 


Ser 


He 


Tyr 


He 


Ala 


Val 


Cys 


Phe 


Thr 


He 


145 










150 










155 










160 


Glu 


Arg 


Phe 


He 


Ala 


He 


Arg 


Tyr 


Pro 


Leu 


Lys 


Arg 


Gin 


Thr 


Phe 


Cys 










165 










170 










175 




Thr 


Glu 


Ser 


Leu 


Ala 


Lys 


Lys 


Val 


He 


Ala 


Ala 


Val 


Ala 


He 


Phe 


Cys 








180 










185 










190 






Leu 


Leu 


Ser 


Thr 


Leu 


Ser 


Thr 


Ala 


Phe 


Glu 


His 


Thr 


He 


Thr 


He 


Gly 






195 










200 










205 








Thr 


Arg 


Gin 


He 


Asp 


Asp 


Ala 


Tyr 


Gin 


Pro 


Cys 


Asn 


Gin 


Thr 


Val 


Ala 




210 










215 










220 










Asn 


He 


Ser 


Pro 


Met 


Pro 


Pro 


Pro 


Pro 


Val 


Ala 


Val 


Thr 


Pro 


Pro 


Leu 


225 










230 










235 










240 


Ala 


Thr 


Pro 


Pro 


Leu 


Pro 


Thr 


Pro 


Ala 


Thr 


He 


Trp 


Gin 


Ser 


Pro 


Asp 










245 










250 










255 




Ser 


Ala 


Met 


Glu 


Ser 


Thr 


Thr 


Ser 


Gly 


Ser 


Ser 


Asn 


Gin 


Leu 


Val 


Asp 








260 










265 










270 






Trp 


Gly 


Ser 


Gly 


Ser 


Gly 


Asp 


Gly 


Glu 


Pro 


Glu 


Asn 


He 


Pro 


Arg 


His 






275 










280 










285 








Arg 


Arg 


His 


Trp 


Gin 


Ser 


Ser 


Gly 


Phe 


Val 


Thr 


Leu 


Pro 


Thr 


Leu 


Arg 




290 










295 










300 










Lys 


Thr 


Leu 


Glu 


Glu 


Gin 


Asp 


Gin 


Lys 


Val 


Ala 


Asp 


Ala 


Ala 


Gin 


Arg 


305 










310 










315 










320 


Ser 


Gly 


Val 


Thr 


Glu 


Ser 


Leu 


Leu 


Gin 


Leu 


Trp 


Arg 


Arg 


Lys 


Arg 


Ser 








325 










330 










335 




Ala 


Glu 


Asn 


His 


Asn 


He 


Asn 


Asn 


Thr 


Asp 


Ala 


Phe 


Ala 


Phe 


Asn 


Val 








340 










345 










350 






Thr 


Glu 


Tyr 


Cys 


Gin 


Asn 


Val 


Thr 


Tyr 


Tyr 


Asn 


His 


Gly 


Leu 


Ser 


Glu 






355 










360 










365 








Leu 


Gly 


Tyr 


Asp 


Glu 


Leu 


Tyr 


Ser 


Tyr 


Leu 


Trp 


Asn 


Leu 


Phe 


Thr 


Leu 




370 










375 










380 










Leu 


Val 


Phe 


Val 


Val 


Phe 


Pro 


Leu 


Leu 


Leu 


Leu 


Ala 


Thr 


Phe 


Asn 


Ser 


385 










390 










395 










400 


He 


Leu 


He 


Leu 


Leu 


Val 


His 


Arg 


Ser 


Lys 


Asn 


Leu 


Arg 


Gly 


Asp 


Leu 










405 










410 










415 




Thr 


Asn 


Ala 


Ser 


Ser 


He 


Arg 


Arg 


Thr 


Lys 


Tyr 


Arg 


Lys 


Thr 


Val 


Arg 








420 










425 










4 30 






Glu 


Leu 


He 


Thr 


Gly 


Tyr 


Arg 


Tyr 


Arg 


Arg 


Arg 


His 


Ala 


Arg 


Asn 


Asn 






435 










440 










445 








Thr 


Ser 


Leu 


Tyr 


Val 


Pro 


His 


Thr 


Thr 


Thr 


Thr 


Leu 


Thr 


Gin 


He 


Asn 




450 










455 










460 










Gly Asp 


His 


Tyr 


Gly 


Ser 


Asn 


Tyr 


Gly 


Gly 


Ala 


Gly 


Ser 


Arg 


Arg 


Asn 


465 










470 










475 










480 


Arg 


Asn 


Thr 


Gly Arg 


Leu 


He 


Ala 



















485 



<210> 76 
<211> 3946 
<212> DNA 
<213> Drosophila 

<400> 76 

ataaatggcc gaatttgcaa gtcagttcca cttgctcttt aaaaagcgaa tctcaccgaa 60 

atttaagcaa aatttatcac cgaccaatgt gaaattccac taataatttt tcagcggtgt 120 

tgatggcttt attgacatcg ccagaaatga ttcgagctgt agcagcgaaa atttaatcaa 180 

agacctaacc cacgaacctg gacgttctca catagaaaat gaaaacgaaa gctgaaagac 240 
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agatggatcg 
gtaaaattaa 
gccgtcgcac 
agtggccgag 
aaaaggaaaa 
ataatgacag 
atgaaaccga 
ttccattgaa 
acaagtatgt 
atatatattt 
tctgctgatg 
ttttgctttt 
gcccctttgc 
gacagctgac 
tccgaccgcc 
acttaccacc 
tggatgcaga 
ggaagaacga 
ggtcctccac 
acatcaccaa 
tgtgccgctc 
ctccgatggc 
tgatggctag 
gctccgagaa 
tgtagatctc 
agttggtctg 
tggtgtatac 
gcagcgcaaa 
gactaatcag 
ccactccgaa 
ccaaactagc 
gcgacgggga 
gcatggtgag 
cgaagggcaa 
tcacccgtcc 
ctggtgtggt 
agaattccgt 
gatccaattg 
tgctacttgt 
tactcggctt 
cccgccgcca 
tcaactgcat 
gtgaccttcg 
cgcacactaa 
gcaggcccta 
gatcgaccaa 
tttgagacca 
ttgctaaccc 
tggggcgctt 
cctttgaatt 
cttcaacaac 
cgaaagctat 
ttccgtgcta 
ttccatttga 
ttttttcctc 
ggatccacac 
aatttcggtt 
atctattagc 
aacaaaaggc 
aatggtatat 



ttcgatgtga 
tataccaaac 
agtggctgca 
aaatgaaagc 
ttgctacaca 
ggcgcggttt 
gagtcccaca 
atcttatttg 
tggacacaat 
gaattcatgt 
ctcacagtgt 
ggaaagacca 
gcacatgagg 
aagggcagag 
tcgggcagaa 
tccattgttg 
gttgggcagt 
gggcgagtag 
gtacaggccc 
gaagaccgtg 
ggccacggcc 
cagcccaatg 
cacggcgatg 
ctgcgtcttg 
cggagtcccg 
tgaggccacc 
ctccggtggt 
gaggagcagc 
aaatacgcat 
gcggatggcc 
gcaggcgaac 
cgtccgccag 
ggtggccagg 
gtgggtggac 
agtggtgccg 
ggtggacgtg 
cgacatttcg 
tttggcagat 
agtacttgta 
aagcacctgg 
aatgttgttg 
gatcagcagc 
atgccctgga 
ccacgcactc 
gatcatgaat 
ctgcaactga 
gtttgacggc 
ggctaacact 
tatatgattt 
tgatacaaca 
taatgttcta 
ggtacttaat 
atccgctgga 
taattattcc 
tgcggttggt 
tcaaaccgac 
tatgggctct 
caaccgcttt 
caaatagcga 
cgatatacaa 



gctcttttta 
aattacacgg 
agccacgcaa 
gctcaaattt 
tactaattgg 
gcgaagctgt 
gcagagtaat 
gacatataaa 
tgataaggaa 
tgtcatgata 
ttgaaagtta 
agcaaatgga 
caagaaagaa 
gaatggggcc 
aatgtgacaa 
gcactggagc 
ccgcagccgc 
ccggaaccgg 
tccttgccca 
gcggatgtgg 
aggccgcata 
taggtggcct 
aagacgatca 
cagagcggta 
tccagcgttg 
gtggtgctaa 
tgggtgagca 
aatccctggt 
ttgaccagca 
ccgcaactga 
agaccgagaa 
gccttgaaga 
actagcaccg 
actatactgg 
gctgctcgcg 
gtgctgctag 
agacgggcag 
cgctgggtct 
gtcgtgctgg 
taggcgttgc 
ccattaacgc 
aggatcagca 
cctctgacaa 
gcgatcgcga 
tcttcttctc 
ctggcggctc 
agccaagtcg 
tacaggaacg 
tttgcaatgt 
ttcgttactc 
aatagctcaa 
ttcctattta 
acatatactt 
aattagattt 
tgcaacatca 
tgcgggatgg 
gatgctcgag 
tacggtctcc 
aatccgagca 
agtgttgagg 



cacttggctt 
cccggcaaaa 
attcgccgcg 
gcatactcgt 
aaaagaaatg 
ctaattattt 
ggatttttct 
ccatacgtag 
tctttagtac 
gctactcgaa 
agacttttgc 
taaacaagag 
aagaaagtta 
gaaatggaac 
atttggaggg 
agtgtttgga 
tcatcacgcc 
cgcgactcag 
tggccgccag 
ctaccagacc 
gcatccatcc 
cacggtagtt 
gaaagacaat 
ttagtacggt 
tgtaggacgt 
ggaagccact 
gccactgccc 
atggagccgg 
gggcggcaaa 
tcaacgaggg 
gcagcatctg 
gcacaaagat 
gcaccaccca 
gcggtgtgct 
ccttcgcccc 
aactgctggc 
cactggagac 
gaggagctgc 
tagtagtggt 
gatccgattc 
tgaggctgcg 
gcaggtggcg 
tataatcaaa 
ttgttcaggt 
gattctcgcg 
tgcaagttgg 
aactaacact 
caacatgatg 
agagtattgc 
gatcttctgg 
tgtcttgctg 
gagccagtaa 
taacacaaca 
catattgcgg 
actgcagacg 
agaacacttt 
ttggctggta 
tcgcggagcc 
caaaaggtga 
cgggaaaggc 



acacgacgac 
caattttaag 
aaaaattgcc 
actgtcaaat 
caaaaccaat 
atttgacatt 
ttcaactgct 
aatgataaga 
tttctcacct 
ccactgtgcg 
tttccttccc 
ttcaagaggt 
ttacaaaatt 
tgtatctagc 
gaaaagctcg 
gctgagtccc 
gtacttgatg 
ggaactgaac 
ttgcctgccc 
gaaggccacg 
cagccagatg 
gtccctgatg 
gtagatcagc 
gctaaccgcc 
ggggtaaaac 
gcccatcaca 
tccgatcgct 
cagatacacc 
gaccagggcg 
ctgtgcggtg 
gcccagaaag 
ctcgaacgct 
cggctccttg 
gttggagctg 
tgaactccga 
cgatagtgtg 
attgccaccc 
cggagtagtg 
ggtgctggtg 
ctggctggtc 
cgacgccggc 
tctaccggcc 
cgcggcacat 
tcgtaaattc 
ctaaaacgtc 
gctttggacc 
tgtttgagtt 
cgtgggaggc 
acgagcattt 
ctttaaggtt 
aagggctatt 
gaactcaatt 
tttacatacg 
gatattaaca 
cgatccacag 
cattcgttct 
ggaaagtgct 
tctctcctcg 
ttgtaaatca 
atcgccaata 



tgcggcgtcg 
cccagttaat 
cacaattgcc 
gaagcgtagc 
gcaaatgaaa 
ggcaatgaca 
taagttcgtt 
ataataaaaa 
aaaaaaggtg 
ttggctacag 
ccgacttggt 
tgtcagcccc 
aacagaggct 
ttgagttggg 
tatgagcggg 
tggcccgtgt 
ggcttgaagt 
tgctcctcgc 
ttgggcatga 
caagcgtcct 
gggatggctc 
ccgcgcgact 
gagaagagca 
gctatccggg 
agtgccgagt 
ggcacactgg 
acctggatga 
cctccattta 
taggccacgc 
atgatggccc 
aggtgcctac 
gccatcatga 
cgtagtcccg 
ctcccgggag 
cggggcgtgg 
ggtaccacaa 
ggaaaattga 
gtactagtgg 
gcattacggc 
accaggcgca 
gtggtggcga 
attggatgtg 
ctcttgcaaa 
ttcagtcctc 
cgctcgctcc 
ggcttagtcg 
cgatgtgcgt 
agttggggtt 
actaaagca't 
tagtatatag 
gatttcctct 
tatctttagc 
taagtttttt 
ttggacacaa 
acctagctca 
gcatatgcgg 
atgggccatt 
aaaacgagcc 
aaagagcgct 
gcgttagaaa 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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cggcaatagc aatagcaacc cgctctgatt ccttaggtct tattgtattc ggcgcgcgct 3900 
ttcagtcaac cgcagttaga attcagatca cctgaatgag tgggca 3946 

<210> 77 
<211> 1946 
<212> DNA 
<213> Drosophila 



<400> 77 

tcgatcggag 

gcctgcgagg 

tgtgcgtttg 

ggtcaccaca 

agttgatcgc 

ggcgggtgcg 

cgagtagccg 

gtagcaccac 

tggatctcaa 

aattctttgt 

caccagccac 

gggtgactcc 

ccttcgcggg 

ccatgctcat 

cgtcgcgtag 

gtttgggggc 

gagtgggcgt 

ttagtctaaa 

cgctgctcat 

acaccaccag 

ccaactactc 

tctacacccg 

cggagctgct 

ccatcaagtc 

tcggaggagc 

ggcacaagga 

tgatgttcat 

aggaccgcga 

tcttccactt 

catccaacac 

gtaagtcccg 

gtcggaccca 

gctgtcagcc 



cgagcggacg 
actgaagaat 
caagagatgt 
tccaatggcc 
caccacgccg 
cctggtgacc 
taatgccacc 
tagtaccact 
ttttccgggt 
ggtacccaca 
gccccgtcgg 
cgggagcagc 
actacgcaag 
gatggcagcg 
gcacctcttt 
catcatcacc 
ggcctacgcc 
tggaggggtg 
ccaggtagcg 
tgtgcctgtg 
ggcactgttt 
gatagcggcg 
cttctcgctg 
gcgcggcatc 
catccccatc 
cgcttgcgtg 
gcccaagggc 
ggagcagttc 
caagcccatc 
gggccaggga 
ctcatacgag 
actcaagcta 
tctgttaatt 



ttttagcgcg 
ttacgaacct 
gccgcgtttg 
ggtagacgcc 
gcgtcgcgca 
agccaggaat 
agcaccacca 
actccggcag 
ggcaatgtct 
ctatcggcca 
agttcagggg 
tccaacagca 
gagccgtggg 
ttcgagatct 
ctgggccaga 
gcacagccct 
ctggtctttg 
tatctgccgg 
atcggagggc 
atgggcagtg 
taccccacgt 
gttagcaccg 
atctacattg 
agggacaact 
tggctgggat 
gccttcggtc 
aggcaactgg 
agttccctga 
aagtacggcg 
ctcagctcca 
cttttcccct 
gatacagttc 
ttgtaa 



agaatcgaga 
gaacaatcgc 
attatattgt 
acctgctgct 
gcctcagcgt 
cggatcgcaa 
ctactaccag 
ctcctcagac 
ccagtgctgc 
gcagttctag 
cgaaggcgcg 
caccgcccag 
tggtgccggt 
ttgtgctctt 
tgctgcttct 
cgttgatcag 
ccgccctgct 
ctccatacca 
agtggctgct 
gcttccttag 
cctacacaac 
tactaatacc 
tctttctgat 
accgtgaggc 
ggatgctatg 
tggtagccac 
cggccatggg 
gtcgcgccgg 
tgatgagcgg 
aacactgctc 
ccaaatttgt 
catttcggcc 



agaagaattc 
gatcgcgagt 
cagaggtcca 
gatcctgctg 
taatggcaac 
cgcctaccag 
cacgactaca 
ccagcgatct 
ccgtctcgaa 
cagcaccacg 
agcagccggc 
tatagtgtcc 
gctagtcctg 
caaggcctgg 
cggtctgttc 
ttgcggggcc 
ggtcaaatgc 
gggattgctg 
cacccaacca 
caccacggtg 
gctggacggg 
gctctgcaag 
cgtcttcatc 
cacctacatt 
cggcctggcc 
atccgccacg 
caaggagggc 
ttccggctac 
ctgcggactg 
cagtgccaac 
cacattttct 
ccattcctct 



atgatctagg 
gcgtggttag 
gggcatcgaa 
ctgatcatgc 
aacatttggc 
gtgcttaagc 
agtactacaa 
gccaaacaat 
atgtcgacgg 
tccaccacca 
accactggac 
acccacttgc 
gccaccctca 
cggacgtccc 
gcctgcgcta 
atccgcttcg 
gtatttctga 
ctcctctttg 
ccggaggtat 
gcctcacaga 
actccggaga 
acgcagttct 
gccgtgctag 
gggctggcca 
gtggccgagc 
gtcttcttgg 
ctgtacgtgg 
tcgccctcgt 
cccaactctg 
aatggaggtg 
gcccgaggcg 
gcccttgtca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1946 



<210> 78 
<211> 583 
<212> PRT 
<213> Drosophila 

<400> 78 

Met Ala Gly Arg Arg His Leu Leu Leu He Leu Leu Leu He Met Gin 

15 10 15 

Leu He Ala Thr Thr Pro Ala Ser Arg Ser Leu Ser Val Asn Gly Asn 

20 25 30 

Asn He Trp Arg Arg Val Arg Leu Val Thr Ser Gin Glu Ser Asp Arg 

35 40 45 

Asn Ala Tyr Gin Val Leu Lys Pro Ser Ser Arg Asn Ala Thr Ser Thr 

50 55 60 

Thr Thr Thr Thr Ser Thr Thr Thr Ser Thr Thr Ser Ser Thr Thr Ser 
65 70 75 80 

Thr Thr Thr Pro Ala Ala Pro Gin Thr Gin Arg Ser Ala Lys Gin Leu 
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o c 
DD 










y u 










y b 




Asp 


Leu 


Asn 


Phe 


Pro 


Gly 


Gly 


Asn 


Val 


ber 


ber 


Ala 


Ala 


Arg 


Leu 


blU 








100 










105 










110 






Met 


Ser 


Thr 


Glu 


Phe 


Phe 


Val 


Val 


Pro 


Thr 


Leu 


Ser 


Ala 


Ser 


Ser 


Ser 






115 










120 










IOC 

125 








Ser 


Ser 


Thr 


Thr 


Ser 


Thr 


Thr 


Thr 


Pro 


Ala 


Thr 


Pro 


Arg 


Arg 


Ser 


Ser 




130 










135 










140 










Gly 


Ala 


Lys 


Ala 


Arg 


Ala 


Ala 


Gly 


Thr 


Thr 


Gly 


Arg 


Val 


Thr 


Pro 


Gly 


14 5 










150 










ICC 

1 b b 










lbU 


Ser 


Ser 


Ser 


Asn 


Ser 


Thr 


Pro 


Pro 


Ser 


He 


Val 


Ser 


Thr 


His 


Leu 


Pro 










165 










17 0 










175 




Phe 


Ala 


Gly 


Leu 


Arg 


Lys 


Glu 


Pro 


Trp 


Val 


Val 


Pro 


Val 


Leu 


Val 


Leu 








180 










185 










190 






Ala 


Thr 


Leu 


Thr 


Met 


Leu 


Met 


Met 


Ala 


Ala 


Phe 


Glu 


He 


Phe 


Val 


Leu 






195 










200 










205 








Phe 


Lys 


Ala 


Trp 


Arg 


Thr 


Ser 


Pro 


Ser 


Arg 


Arg 


His 


Leu 


Phe 


Leu 


Gly 




210 










215 










220 










Gin 


Met 


Leu 


Leu 


Leu 


Gly 


Leu 


Phe 


Ala 


Cys 


Ala 


Ser 


Leu 


Gly 


Ala 


He 


225 










230 










235 










240 


lie 


Thr 


Ala 


Gin 


Pro 


Ser 


Leu 


He 


Ser Cys 


Gly 


Ala 


He 


Arg 


Phe 


Gly 










245 










250 










255 




Val 


Gly 


Val 


Ala 


Tyr 


Ala 


Leu 


Val 


Phe 


Ala 


Ala 


Leu 


Leu 


Val 


Lys 


Cys 








260 










265 










270 






Val 


Phe 


Leu 


lie 


Ser 


Leu 


Asn 


Gly 


Gly 


Val 


Tyr 


Leu 


Pro 


Ala 


Pro 


Tyr 






275 










280 










285 








Gin 


Gly 


Leu 


Leu 


Leu 


Leu 


Phe 


Ala 


Leu 


Leu 


He 


Gin 


Val 


Ala 


He 


Gly 




290 










295 










300 










Gly 


Gin 


Trp 


Leu 


Leu 


Thr 


Gin 


Pro 


Pro 


Glu 


Val 


Tyr 


Thr 


Thr 


Ser 


Val 


305 










310 










315 










320 


Pro 


Val 


Met 


Gly 


Ser 


Gly 


Phe 


Leu 


Ser 


Thr 


Thr 


Val 


Ala 


Ser 


Gin 


Thr 










325 










330 










335 




Asn 


Tyr 


Ser 


Ala 


Leu 


Phe 


Tyr 


Pro 


Thr 


Ser 


Tyr 


Thr 


Thr 


Leu 


Asp 


Gly 








340 










345 










350 






Thr 


Pro 


Glu 


lie 


Tyr 


Thr 


Arg 


He 


Ala 


Ala 


Val 


Ser 


Thr 


Val 


Leu 


He 






355 










360 










365 








Pro 


Leu 


Cys 


Lys 


Thr 


Gin 


Phe 


Ser 


Glu 


Leu 


Leu 


Phe 


Ser 


Leu 


He 


Tyr 




370 










375 










380 










lie 


Val 


Phe 


Leu 


lie 


Val 


Phe 


He 


Ala 


Val 


Leu 


Ala 


He 


Lys 


Ser 


Arg 


385 










390 










395 










400 


Gly 


lie 


Arg 


Asp 


Asn 


Tyr 


Arg 


Glu 


Ala 


Thr 


Tyr 


He 


Gly 


Leu 


Ala 


He 










405 










410 










415 




Gly 


Gly 


Ala 


lie 


Pro 


He 


Trp 


Leu 


Gly 


Trp 


Met 


Leu 


Cys 


Gly 


Leu 


Ala 








420 










425 










4 30 






Val 


Ala 


Glu 


Arg 


His 


Lys 


Asp 


Ala 


Cys 


Val 


Ala 


Phe 


Gly 


Leu 


Val 


Ala 






435 










4 4 0 










4 4 5 








Thr 


Ser 


Ala 


Thr 


Val 


Phe 


Leu 


Val 


Met 


Phe 


Met 


Pro 


Lys 


Gly 


Arg 


Gin 




450 










4 55 










4 60 










Leu 


Ala 


Ala 


Met 


Gly 


Lys 


Glu 


Gly 


Leu 


Tyr 


Val 


Glu 


Asp 


Arg 


Glu 


Glu 


4 65 










4 70 










475 










4 dU 


Gin 


Phe 


Ser 


Ser 


Leu 


Ser 


Arg 


Ala 


Gly 


Ser 


Gly Tyr 


Ser 


Pro 


Ser 


Phe 










485 










490 










495 




Phe 


His 


Phe 


Lys 


Pro 


lie 


Lys 


Tyr 


Gly 


Val 


Met 


Ser 


Gly 


Cys 


Gly 


Leu 








c r\ A 
bUU 










505 










CI ft 

b i u 






Pro 


Asn 


Ser 


Ala 


Ser 


Asn 


Thr 


Gly 


Gin 


Gly 


Leu 


Ser 


Ser 


Lys 


His 


Cys 






515 










520 










525 








Ser 


Ser 


Ala 


Asn 


Asn 


Gly 


Gly 


Gly 


Lys 


Ser 


Arg 


Ser 


Tyr 


Glu 


Leu 


Phe 




530 










535 










540 










Pro 


Ser 


Lys 


Phe 


Val 


Thr 


Phe 


Ser 


Ala 


Arg 


Gly Gly 


Arg 


Thr 


Gin 


Leu 


545 










550 










555 










560 


Lys 


Leu 


Asp 


Thr 


Val 


Pro 


Phe 


Arg 


Pro 


His 


Ser 


Ser 


Ala 


Leu 


Val 


Ser 



no 
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565 

Cys Gin Pro Leu Leu He Leu 
580 



570 



575 



<210> 79 
<211> 3524 
<212> DNA 
<213> Drosophila 



<400> 79 

aaatttgtat 

atttaaatac 

catttaccat 

ccatgaaata 

ttttttggga 

ttactacata 

ctgacccaac 

tacgggcgtc 

acttttgcga 

ggagaagcaa 

gatattatta 

taagagggta 

attattaata 

tcagttaaat 

tgaattttaa 

taagttcttc 

aatattggct 

ctgcgcccga 

taatcaaatt 

ccgtggttcc 

tacccgcatt 

agccacccct 

cgttgatatc 

cacttaggaa 

gaagtaggaa 

tcagggccac 

aattacctga 

attgaaaaga 

tcagtaccag 

gtttgggacc 

acagcggtgt 

tcttgtacgc 

cagtgctggc 

ctgagaccac 

ccgctatata 

tggatgtgct 

gcattgtgta 

tcaggatata 

ccagggtgtt 

agtcagcaaa 

aaagggactc 

tggtatataa 

tcatttcccc 

ctaagcgctg 

cgctcgtttt 

tattttcaga 

gctattaaag 

taagtttaaa 

tacaagtttt 



aactttcagt 
acatacaaat 
atctgtatta 
ctgtttgtca 
aagattctca 
ttcaattaaa 
tggatgcatc 
ttggctggct 
ttttaattgc 
ccattttatt 
tatatcttta 
tattggcagt 
tgttattagc 
atattgctaa 
ttctgatctc 
attcatattt 
tatttaaagg 
tcctcttctc 
ctccggcgga 
ggtcttttcc 
cagtggcagc 
ctttttactg 
ctgcttattc 
ggcataaagt 
aatgagtatt 
cttggagatc 
gaaatccagt 
aggtgggaat 
tcgagtgacc 
caccgatcgc 
ggcgaatcct 
atctggccac 
ataaaggatc 
tttggcaata 
tcgatccgcg 
caccatgtag 
cagcagaaag 
tatattcgtg 
gccaaataat 
ggaattccgc 
ctctggctga 
gctctcgttc 
tttcacaaac 
tccagttcaa 
gttcttgcat 
cacagttgtt 
aaaaaaacct 
atgtactgta 
ataagatatt 



tcttatactc 
tttttataca 
ttaaaatact 
aacaaacaga 
tatatttaca 
tctcaggagg 
ctgcgccttt 
gtcttacctc 
ataaatcagc 
ttaggtaagc 
tttacttagt 
aaaatattta 
tgtcaaggca 
catatcgatg 
ggtttttatt 
ggataagaaa 
aaattcctac 
gcggtggtgc 
gctgtcactt 
gctgtcgtgg 
agtggaggaa 
ccctgtttgg 
atacaggtaa 
atgggattca 
tccagtcggg 
cagtggtgtg 
gagggagcca 
ctacaaatga 
ttcctgtgag 
agtttcctta 
aggaaaaatg 
attatgttgc 
acgggcagca 
tgcagagttc 
gacatgatga 
gctttgcaca 
ggtattccaa 
accgtttgca 
ccgataatgc 
gtggctatgc 
attgtggaac 
gtgttatacc 
tggaggacat 
atttaattat 
tttggcgttg 
ttggccaaac 
aaaaccttca 
acaattggtg 
ttaatataac 



agttattcca 
aagtttgctt 
tagcatagat 
tttcatcagt 
attataattt 
ctgacttttc 
cctcgcagaa 
tggctaagat 
ttagttattt 
actaagaaaa 
aaaaggatgg 
ttcaaattgt 
ataagaaaat 
tgcggttgta 
acgtagacac 
ggattataaa 
ttaaactggg 
aaaatgcctc 
tgccattgga 
tcgaggatgt 
tctgcggatt 
tgaaaacact 
aggccttgaa 
ccgccgaatt 
agaggtccct 
taaataaaaa 
acatagaatg 
attgaactga 
cccgcctctt 
taaccaagta 
tgtacagtat 
acgagtaatt 
tgaggaccgc 
gatatcgtgg 
gcagaaaaat 
taaactcccc 
tcaggaagca 
ttttggaaaa 
acacaaatcc 
aatgttgata 
taccactaat 
aatttgtggc 
ccatcatcag 
tttaaatagg 
ctgcaaaata 
ttgattagtt 
ctagtctagt 
ataagtattc 
tagattgcag 



ttaatctcta 
tattgactat 
tacagaattt 
ttttgttttt 
aaaatagtct 
cgcggacttt 
aagtatttta 
gcttatttta 
aattggttta 
agattggatc 
ggatttacgt 
gggttgtaca 
gtaatccatt 
cagactcatt 
atattaatac 
atcatttaaa 
ttacaaatct 
ctgctgctcg 
attgcatgat 
ggtggtcgat 
gctggtcaac 
gggctccagc 
gaagctcttc 
cgagtaaacc 
ttgcgcggga 
atcgaatata 
tatacactta 
ttatttcggc 
ctccttggac 
gtagaaactc 
gaaggtggtg 
gatgccatcc 
tgaagttgac 
cgaggaaatc 
cgacgaggtg 
gaatcgccag 
ctcgtctgcc 
gcgcaacacc 
gtagagcacc 
ggtgggcaaa 
ccatctatgg 
atttgcctct 
ccacgtaaac 
ttttacgcgt 
cattttgtta 
actaattgaa 
caatattttt 
aaatggaatt 
ctctttaatg 



atcccacgac 
ttttgaagca 
tcagcttatt 
ataatccttt 
ctactttagc 
tatggtggct 
tttcacattt 
tacaaatccg 
ttaatctacg 
ttaagattcc 
atattttatt 
gtttatatta 
tcagccccct 
ttcaaatctt 
tgcttagtct 
tgcatactat 
gtctgctgca 
ctcaaacata 
ttctgtgtcc 
gaattgttgt 
aggcgcttgg 
tggagttgag 
cggaagttct 
agtgccccca 
ttcgagtgaa 
ctttataatt 
tcacctgcgg 
attattaccg 
ttcgttcctg 
aggataaagc 
cccgaatgct 
tcctgctcca 
caggcaatcg 
gggtggcata 
aaggaggtga 
ctgcaaattc 
accgccagat 
acgtagatca 
acagtgaaca 
tcagtgccat 
ttcagttccg 
gaatcggctg 
atatttccac 
tgttgctccc 
ttacgtgaaa 
atactgitggt 
tattccaacg 
aagttaagtc 
catagtgtat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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atttttaata 
atctttgaat 
tttactacaa 
acaacttgta 
tgcttttgaa 
tatttaaatt 
tatattctga 
agcatttatg 
tatacagatc 
ttccatgcac 



tattatgagt 
cgtatgtacc 
cttttactga 
gaaaaacaaa 
aagtgagtta 
aaattgctca 
cctaactcga 
caaatatcag 
cccatggttt 
atcaaatatt 



ttacttttaa 
ccattcgtaa 
ctttctgaga 
cctttaaaaa 
ccacaaattt 
cagcttgaca 
cactttctgt 
gtgaaaatat 
tgtattagga 
gttgactgag 



gtggcagtca 
tgccacttca 
aatcccctcg 
tacgacgtat 
ggtttgtaag 
cgtccaacaa 
aaataaatta 
ttgtcttctc 
gattgaaatt 
ccaaaacaca 



attttatttg 
catacagatt 
aaaattgaaa 
gaaataaatc 
gcgaaggaaa 
acaagaaaca 
ggcgtaagga 
ggaagtcgga 
taagccgatt 
tgtg 



ctgtcttaac 
cttttgcaac 
actttcctga 
gtctgtttcc 
ttaacattta 
agaacttttt 
aaagctgggc 
aaactccagc 
tgtttgtgca 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3524 



<210> 80 
<211> 1356 
<212> DNA 
<213> Drosophila 



<400> 80 

atgacagccg 

accacggaac 

ctttatggca 

gacttgttca 

ctggtgatct 

ctgaatctgg 

atgcgaattt 

tccatcacct 

gcggtatgcc 

tcagcgattg 

actgtggagc 

aagaagcatt 

ctgtgcttta 

aaaccaggaa 

ctgacggtgg 

ctcattttcc 

tatgccttcc 

aagcaggata 

aaaaagaggg 

ccactgaatg 

aagaccggaa 

ccggagaatt 

agaggatcgg 



attcagaggc 
tgaaccatag 
ctgatttgcc 
ctgtggtgct 
acgtggtgtt 
cggtggcaga 
gcagctggcg 
ccttcacctc 
acccgatttc 
cctggtcaac 
aggaggatgg 
cgggcaccac 
tcctgagttt 
cgaagtccaa 
ccctgattca 
tacttctggg 
taagtgagaa 
tcaacgctca 
gtggctccaa 
cgggtaacaa 
ccacggggac 
tgattatatg 
gcgcagtgca 



aaatgccaca 
atggattagt 
cacctatcaa 
ctacggattt 
gcgcttttcc 
cgagtgcttc 
attcggggag 
gtcgattttt 
ctcgccacga 
ttcagcggtc 
catcaattac 
cttcatactg 
ctactacttg 
ggagaagagg 
ctcgaatccc 
ggcactggtt 
cttccggaag 
actccagctg 
gcgcctgttg 
caattcatcg 
acagaaatca 
tttgagcgag 
gcagacagat 



aattggtata 
ggtagttcca 
cattgcatag 
gtgtgcatta 
aaaatgcaaa 
ctgattggaa 
tttatgtgca 
ctgctcatca 
tatcgaactc 
ctcatgctgc 
tcgtgcaaca 
tacacatttt 
gttataagga 
cgggctcaca 
gcgcaaaggg 
tactcgaatt 
agcttcttca 
gagcccagtg 
accagcaatc 
accaccacat 
tgcaattcca 
cagcaggagg 
ttgtaa 



acacgaacga 
caattcagcc 
ccacgcggaa 
tcggattatt 
cggtcacgaa 
taccctttct 
aagcctacat 
tgtccgcgga 
tgcatattgc 
ccgtgatcct 
taatgtggcc 
tcctaggatt 
aactgcgatc 
ggaaggtcac 
acctctcccg 
cggcggtgaa 
aggcctttac 
ttttcaccaa 
cgcagattcc 
cctcgaccac 
atggcaaagt 
cattttgcac 



gagcttatat 
agaggagtcc 
ttcctttgct 
tggcaacacc 
tatatatatc 
gctgtacaca 
ggtgagcaca 
tcgatatata 
caaagtggtc 
ttatgccagc 
agatgcgtac 
cgccacaccg 
ggtgggtccc 
tcgactggta 
actggaaata 
tcccatactt 
ctgtatgaat 
acagggcagt 
tccactgctg 
gacagcggaa 
gacagctccg 
caccgcgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1356 



<210> 81 
<211> 451 
<212> PRT 
<213> Drosophila 

<400> 81 

Met Thr Ala Asp Ser Glu Ala Asn Ala Thr Asn Trp Tyr Asn Thr Asn 

15 10 15 

Glu Ser Leu Tyr Thr Thr Glu Leu Asn His Arg Trp lie Ser Gly Ser 

20 25 30 

Ser Thr He Gin Pro Glu Glu Ser Leu Tyr Gly Thr Asp Leu Pro Thr 

35 40 45 

Tyr Gin His Cys He Ala Thr Arg Asn Ser Phe Ala Asp Leu Phe Thr 

50 55 60 

Val Val Leu Tyr Gly Phe Val Cys He He Gly Leu Phe Gly Asn Thr 
65 70 75 80 

Leu Val He Tyr Val Val Leu Arg Phe Ser Lys Met Gin Thr Val Thr 

85 90 95 

Asn lie Tyr He Leu Asn Leu Ala Val Ala Asp Glu Cys Phe Leu He 
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100 



105 



110 



Gly lie Pro Phe Leu Leu Tyr Thr Met Arg He Cys Ser Trp Arg Phe 







115 










120 










125 








Gly 


Glu 
130 


Phe 


Met 


Cys 


Lys 


Ala 
135 


Tyr 


Met 


Val 


Ser 


Thr 
140 


Ser 


He 


Thr 


Ser 


Phe 


Thr 


Ser 


Ser 


He 


Phe 


Leu 


Leu 


He 


Met 


Ser 


Ala 


Asp 


Arg 


Tyr 


He 


145 










150 










155 










160 


Ala 


Val 


Cys 


His 


Pro 
165 


He 


Ser 


Ser 


Pro 


Arg 
170 


Tyr 


Arg 


Thr 


Leu 


His 
175 


lie 


Ala 


Lys 


Val 


Val 
180 


Ser 


Ala 


He 


Ala 


Trp 
185 


Ser 


Thr 


Ser 


Ala 


Val 
190 


Leu 


Met 


Leu 


Pro 


Val 
195 


He 


Leu 


Tyr 


Ala 


Ser 
200 


Thr 


Val 


Glu 


Gin 


Glu 
205 


Asp 


Gly 


He 


Asn 


Tyr 
210 


Ser 


Cys 


Asn 


He 


Met 
215 


Trp 


Pro 


Asp 


Ala 


Tyr 
220 


Lys 


Lys 


His 


Ser 


Gly 


Thr 


Thr 


Phe 


He 


Leu 


Tyr 


Thr 


Phe 


Phe 


Leu 


Gly 


Phe 


Ala 


Thr 


Pro 


225 










230 










235 










240 


Leu 


Cys 


Phe 


He 


Leu 
245 


Ser 


Phe 


Tyr 


Tyr 


Leu 
250 


Val 


He 


Arg 


Lys 


Leu 
255 


Arg 


Ser 


Val 


Gly 


Pro 
260 


Lys 


Pro 


Gly 


Thr 


Lys 
265 


Ser 


Lys 


Glu 


Lys 


Arg 
270 


Arg 


Ala 


His 


Arg 


Lys 
275 


Val 


Thr 


Arg 


Leu 


Val 
280 


Leu 


Thr 


Val 


Ala 


Leu 
285 


He 


His 


Ser 


Asn 


Pro 
290 


Ala 


Gin 


Arg 


Asp 


Leu 
295 


Ser 


Arg 


Leu 


Glu 


He 
300 


Leu 


He 


Phe 


Leu 


Leu 


Leu 


Gly 


Ala 


Leu 


Val 


Tyr 


Ser 


Asn 


Ser 


Ala 


Val 


Asn 


Pro 


He 


Leu 


305 










310 










315 










320 


Tyr 


Ala 


Phe 


Leu 


Ser 
325 


Glu 


Asn 


Phe 


Arg 


Lys 
330 


Ser 


Phe 


Phe 


Lys 


Ala 
335 


Phe 


Thr 


Cys 


Met 


Asn 
340 


Lys 


Gin 


Asp 


He 


Asn 
345 


Ala 


Gin 


Leu 


Gin 


Leu 
350 


Glu 


Pro 


Ser 


Val 


Phe 
355 


Thr 


Lys 


Gin 


Gly 


Ser 
360 


Lys 


Lys 


Arg 


Gly 


Gly 
365 


Ser 


Lys 


Arg 


Leu 


Leu 
370 


Thr 


Ser 


Asn 


Pro 


Gin 
375 


He 


Pro 


Pro 


Leu 


Leu 
380 


Pro 


Leu 


Asn 


Ala 


Gly Asn 


Asn 


Asn 


Ser 


Ser 


Thr 


Thr 


Thr 


Ser 


Ser 


Thr 


Thr 


Thr 


Ala 


Glu 


385 










390 










395 










400 


Lys 


Thr 


Gly 


Thr 


Thr 
405 


Gly 


Thr 


Gin 


Lys 


Ser 
410 


Cys 


Asn 


Ser 


Asn 


Gly 
415 


Lys 


Val 


Thr 


Ala 


Pro 
420 


Pro 


Glu 


Asn 


Leu 


He 
425 


He 


Cys 


Leu 


Ser 


Glu 
430 


Gin 


Gin 


Glu 


Ala 


Phe 
435 


Cys 


Thr 


Thr 


Ala 


Arg 
440 


Arg 


Gly 


Ser 


Gly 


Ala 
445 


Val 


Gin 


Gin 



Thr Asp Leu 
450 



<210> 82 
<211> 7448 
<212> DNA 
<213> Drosophila 

<400> 82 

agcatgcggc tgaacaaaaa aaaactcgag gaatggccgc aatttcggtg agccatataa 60 

acaacataac atatagtttt cccatccatc tacgctgtct gcgccactgg ccaagtccaa 120 

gttgtcttga cggcccggca aaaagcaaca tccaaggaaa atgttgccat tttagatgcg 180 

ccgcccgctg agcggaaaag tcgcgcggaa attactaaat cgatacgtgc tgcgtatcag 240 

attttccatg ggttctgcat gcaaaaatac cttgaaatcc ttgcctgcag tcgcacacat 300 

acacgtattg ccatctttgt acagacaagc ttgactaagt cgaactttac tatgttattc 360 

tactaaaagg ttttaaagca aatgatctat actatactat actattaact tcaatagaat 420 
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tcctattttc 
gagatacgta 
caatggaaat 
tttatgtctg 
aagaggcttc 
aactgaaagc 
agaatcggaa 
acatggactg 
aatgaatatg 
ctgtgtttct 
tgttcgtggg 
agaaggccgc 
tgaaccaacc 
ggaacggggt 
ccacaatgct 
acagcgattt 
ccgcctggtg 
ccccatgcga 
cctggcccgt 
ggcccatgcc 
aggagggtcg 
cgccggaggg 
ctccaccccc 
gcatcatgtc 
gtctctggcg 
ccgctgccgt 
tgtcgatgat 
aaacggaaac 
tcatatactt 
catacccaaa 
acataaaact 
tttaagtgaa 
tgcagatgca 
aaagctcgac 
ttagcaacat 
tctctcgtta 
ttgaagtgaa 
gcgagggtat 
gtgccacagt 
tgactttcgc 
tattccgtat 
gtttgcaggc 
gagcttacct 
caggaaattc 
gccgagaaga 
cctggctcgt 
agtcaactaa 
tcagaggtcg 
tgtcgccagt 
tcggcgacat 
ttatgcaatg 
catttaagca 
ttacatttcg 
tgactgtgct 
gttagacaaa 
caaggaagtt 
caacttagaa 
ttatttcttc 
cagccagacg 
cgtgagcggt 



tctggctagc 
gattgtgtgg 
gtttatctcc 
caaataagag 
cctatagact 
agtggaattg 
gcaaaccctg 
tgggtgcaat 
ttaataatat 
gtttccaggc 
gaactgtggc 
ccggaagtcg 
cagccaggtg 
gatcagatag 
cagggtttga 
actgttggtg 
atgataatga 
atgcctaaag 
ttggcccggc 
actcgtcatt 
tgtgagttcg 
aagggccagc 
accacctgct 
ttggagctcc 
ctggcccacc 
gtagcgcttg 
aaataactaa 
ggaagtcacg 
taaagggaac 
tgccatgcgg 
gaatttcaga 
ttttaagcct 
attttcgaag 
ataaatgagt 
ttcataacta 
gaaaacggtt 
aaacttgttt 
ggtatttaaa 
ttgtgccgca 
tggtcgggcc 
ttgtgcacta 
aaacagaagg 
tgttgtgaac 
ttgcatacac 
tgacgtagcc 
acctgcggga 
tgcaggaagt 
caagagcagc 
cagtctgctc 
ttgtaatgct 
gcatcttttg 
aattaagctt 
gaattctgct 
aacgaagtta 
agtagagaga 
gatgtttagg 
gacttcttgc 
aagtgtactt 
attaggatca 
ctggtcacgg 



aaacaaattt 
ctcaaccttt 
aaagcttagg 
atattacgtg 
gtttgaaaat 
ctggtgggcg 
acgatgaaat 
tagggaccac 
gttatgcgcg 
gagtccagtc 
ttctggccgg 
acgctgtaga 
agggccttgg 
ttgataaaga 
gtggtcttgt 
ttggcactgg 
ggatggtggt 
ctgtgtgttc 
ggctgggact 
gtggtaatgg 
tagcacccat 
agggattgac 
cctccagcgc 
ttggctatcg 
gagctgtgct 
aatttctaga 
aatagagtca 
ctaggaaaag 
cacactggta 
ccattgttct 
ccggttttaa 
attttaaaat 
cgagcatcgt 
cgaatatgta 
acatttttga 
agttgaaaac 
tatagcggac 
tctgtgtgag 
cgtggagtgt 
agcgtttgtt 
tgttggcttt 
acatgcggac 
acttatgtcg 
ataaatcatg 
ctcgttctgg 
aatggaagtg 
tcccaattat 
agacaattgc 
ggaggtctcg 
taaaatggac 
acacaatttt 
taattatttt 
gcctacagga 
gatataaaca 
acaaaatttg 
ttaaaattcg 
tgcactcctt 
accatcccag 
tgatgagggc 
ccaaatatct 



ggatcataaa 
tatctatatt 
aaaat ttgga 
tatactgagt 
cctataatgc 
aggttcgtct 
aaactatacg 
acctttttcg 
tgcaaatttg 
caatccaact 
cgctgaagaa 
aggcgtagat 
ccagcatttg 
agggcagcca 
tctccttctt 
tgttcccgct 
gatggcttcg 
gcggtggctg 
gccattgcgc 
tcgttgaggc 
tcttggcgca 
agtgtgtgcc 
ccatcgccgc 
tctggttgga 
gctcctgctc 
cacgcaaagt 
atgggcggca 
ccgaagcttt 
agcagatggc 
atcaactgca 
tgcgtatgga 
gcacaccagg 
ttagtgttag 
attaccctgg 
aaagtattca 
tcattaattc 
actggaccgg 
ctcctggcat 
cctttaagcc 
gtttgcagct 
ggggttttgg 
attgggacat 
gtcatattgt 
actgccatgg 
ttgtatcggc 
agcaggatac 
ttgactacgg 
cagctaatta 
aaatttattc 
caaaaagata 
ctgcagtcaa 
tgtaagttaa 
acatttagta 
aagcacttta 
ttcttaaagc 
gtgcatctat 
cttcgttatg 
catgggggga 
caatcgtttg 
gtcggatgga 



aagattttca 
atttccataa 
agtatatata 
ttcgcttgag 
cttggcattt 
cctaagtggc 
agaaactgga 
ctgccgacgc 
ttcagggctt 
ctatcgcctg 
gcgtttgcag 
gaagggatta 
gctggcctgg 
gcaggcgata 
cagcgacgta 
agtcgatgtc 
cagtctctcg 
cgatagggag 
atccaggaga 
ggtggaagcg 
gcccatggag 
gccagtggca 
acaattgtcg 
gaacgttcgc 
cgacagctca 
gcacaaggcg 
aaaggcggca 
aaacactctt 
tttaagctga 
cgatgtgcta 
gctgcctgga 
tacacataca 
ctatttagtg 
tattcaagga 
actctaaagc 
agctatacgg 
aaattggcac 
ttgggaatta 
ggagtggttc 
gccgtggttt 
atggcgggag 
ggttgcggat 
cgtgcctgga 
gtatgaagaa 
actcgcgcag 
gcttaatggg 
agctgtgacc 
tattcgtttg 
taattaattt 
cccaaagtgg 
caaaattgtt 
ataaattaaa 
tatttcttca 
attaattctc 
tagcagatgc 
ttagtgtgtg 
gaaataaccc 
caggtgatgg 
gagcgccgtt 
cgtgaaggat 



tggcatttat 
atttaagcac 
ataaaaacct 
tttcggagtt 
ctaaaaggtc 
aaaatgatcc 
gaacaattac 
gtctaaagtt 
gaggggtttt 
atggacatgg 
gtgaggcgcc 
atggcgctat 
tgttcggcca 
aagccaccca 
atccgattgg 
gtggttacgc 
cggtcccgct 
tgggtctgca 
ctactgcctg 
cgcttcagcg 
ccacctactc 
ctacttgctc 
ctgtcgctga 
gatgtggcct 
ttctccacgt 
gttaagtcaa 
aagataaaag 
gtaggcatca 
tttaagacta 
atctctctaa 
cttgattcag 
aatgtatata 
atgctaactt 
tgtagcaaca 
gccaaactgc 
gctgccagag 
ataccccgat 
tgaggcaaac 
gcaatagaga 
tcctattcca 
acgtgcgaat 
ggctgcgacg 
tgcaatgaca 
ggagcccatg 
atcccttcgt 
ccaccaagtt 
ctcggagggg 
ttcagctggg 
acactcacac 
tccttcaaaa 
gatttaattt 
catctcttgg 
aaaaatctgt 
cttttagtta 
ggtataccaa 
gactgttgac 
tacatactga 
ccaacgcgag 
tacgggagta 
gcaaagattt 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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acgatatcag 
atggcgcaca 
tcgcagagca 
tccgccagcc 
atttgattag 
cggccagctc 
ttctataccg 
gctttctcct 
gtctcccggc 
ggagctcatt 
taaggttgct 
ttggagtgaa 
tgtagtgtat 
tcatatcagc 
aaactataaa 
tactttggac 

tgggggcatg 

gatgaaaatc 
agaagggttt 
ttctgacgag 
ctttcgadat 
tttttcggct 
agaaattctt 
atttcaccta 
acacctcctc 
ttgggataca 
ttgctctaaa 
aaacattgct 
tgttctcagt 
atttaccgga 
ctttcgcgga 
acactcacac 
tgacgaagat 
tggtgcgaag 
tgaccacgat 
gatcgtagaa 
tcacgggcgc 
tgggatatcc 
catatacgat 
atccccgcag 
aattattcag 
gctagctccg 
cggagcagga 
aattaaagga 
gaggaaacaa 
attttttatg 
tgatttcaca 
attcattcgc 
tatacccgtc 
ggcctttgta 
taaacgttaa 
agcaataaat 
ttaaaatgta 
ataattcgtg 
ttgatttggt 
taatcattat 
caaagtggcg 
ttttattt 



gggaatacta 
ggctgagaat 
cccagcccag 
ttttgcagcg 
caacattttc 
ggcaaatttt 
aaaaagcacc 
cgtcggtggc 
gccaaatgaa 
acacggatat 
gaatttgctg 
ttagaaaaac 
gtgtatgttt 
actttcagtg 
tcccgaccgt 
aaaaatgaac 
agtaattatg 
ggaaaacaag 
agcatatttt 
aataattcat 
gccgttgcct 
ttttttttcg 
tcagttcaaa 
gcagcattta 
ctcgagttcc 
cttggagaaa 
cagaatctca 
actacatttt 
gcattcgctt 
atggcatctt 
aggactctaa 
tcatacacac 
gcccacgaga 
ccggcgagtg 
gaaggtggca 
gtgcgcccat 
cgtgggtccg 
acctcctact 
ggatatgttt 
ggaggaggag 
ctgcaagttg 
gcagcaggcc 
acacggccca 
aagttcgaca 
taaccatttg 
tatgcacgca 
gcattccata 
ttgcaattaa 
tgtgtaaatt 
ttgggttgcc 
tcctttagtt 
caaattaagt 
tgattccatt 
gaggtatttt 
gaagaaaatt 
atgaaaatta 
aaatgtatta 



tggatgtttt 
ggatgccgtg 
ttgccacgag 
cggagtgggg 
catggctttc 
ggcaatgttt 
gatttctcca 
ttgtcaagcg 
aagttgtttt 
tgcctatggg 
aatctccaag 
acaaggatta 
ttatctaaaa 
cctaccccaa 
gtgggtgtat 
tgaatgtggg 
aatgagattc 
cttaatgttc 
ccattcaact 
tccgattcgg 
ttaaagtatt 
tgctgaggaa 
gaatacaaat 
cataaataca 
tgttcgtgtc 
aatatatata 
ttgatattta 
tacttcatga 
ggttctgggc 
catcttgtaa 
tgccttaatg 
atagactcac 
aggtcggcca 
gtcaatatgg 
aagactgcaa 
tcggcggccg 
ggttcggtga 
cctcctactc 
tccggcagac 
gaggcagcca 
acagccatgg 
gcctaggccc 
actgtggtca 
tttagcaaag 
catatccctt 
ttatctgatt 
caatctcatt 
gcattcaaca 
acgttgaaag 
attaccgcga 
agtcatgctt 
ttaccttatc 
tgctttggat 
ctatcagaaa 
caagaaaatc 
caaatataat 
ccaataaatt 



ggataggcct 
cagaggagca 
cctgcaatca 
cagccaactt 
agcaaatgcc 
caggtcccct 
tttgacccca 
attaattttt 
tgtgaaaggc 
ccagcacact 
gactcggatt 
cttaattaat 
actggagtag 
tttcaactgg 
tacgtcctaa 
gcttggtaaa 
aaaaggtatc 
cataagacag 
tcacgagttc 
attgtcagtg 
tccacaaagt 
attgcacgag 
ccataaggcg 
cgcccccttt 
cgtgctgcta 
tatgctagtc 
caggttcgaa 
gatacaaatt 
gccaagcttt 
tcacgcgaaa 
gcgctgccac 
ctatgagatg 
ccgccaggct 
ccagtatgac 
tcaggatgat 
ccgccgactc 
gagccactgc 
ctcctccacc 
cacccactcc 
catacatgag 
atcggccagt 
atccatatcg 
ttgtgtccag 
tcctttgcat 
ttatgctgaa 
tattccttgt 
taaattgcaa 
atctcaaggc 
ttgtcaacgg 
tacagttaaa 
tcaacgaaaa 
atctagaagc 
taggtaccga 
attaaaaatt 
atcaacttcg 
gagctttata 
tgccaaggga 



atgctcacct 
cgtccaggga 
agattccaaa 
aaaattccgg 
tcacctaagg 
tcaacccgtt 
acacttcgag 
aaatcaagtg 
gctaggagcg 
tgggttggtc 
gaagcagtta 
tatgggaatg 
gaactggcac 
atgataaggc 
cttaatgata 
ttgtttgcct 
tccacagaga 
aagccaagtg 
agcaacaaag 
gtccggatga 
tgggctcggg 
agccgaggca 
agagagatac 
gccccgcccc 
ttcctgttcg 
ctcctagaaa 
agttccttaa 
acaaagactt 
tgttagcaat 
agcccggcta 
acactcacac 
gacggcgacg 
catcacaaag 
cagggtgttc 
gccctgccag 
gttgtagtaa 
atccgtaccg 
tgtcccattg 
cactcccgat 
cgacggataa 
tgggttcaca 
tgccatatcc 
ttgcacgtcg 
ggtaataaaa 
atctatttgt 
gacaatttcc 
tcgtcgaaat 
aaaaagtgca 
tgtcaaaaga 
tactttactt 
tgccgaagtg 
acccactggc 
acttagagac 
tttatacaaa 
gaattcctgg 
caaagtaagc 
atggctacat 



gtccacactg 
gatccaaatg 
ttaaaaactt 
tccaaagtca 
ccaattaaag 
agtggtttaa 
cttctacagt 
ccccttaatg 
aaggaaccaa 
tagttcaatg 
aaggcaaaat 
cttgttcagt 
aaagcagtcg 
tccaaaaagc 
tatgtgcgta 
cgaaaaaacc 
gaaactagtg 
agctcaaatc 
atcaactatt 
ttatgacaaa 
aattttcccc 
aaaaaatgaa 
atactcgcat 
accatctgac 
tgctactttg 
tccctacaca 
atacctcaag 
gtattggatt 
tttttgaggg 
gaggcgcctc 
acacacttac 
gcggggggca 
cagttggtaa 
ccgatgacgg 
gagagcacca 
ttaaatgtgg 
ttcggtccgc 
cccgcatcct 
cccagtccca 
atggcgctta 
atcgagtgtg 
ctggagttga 
ggttaactgg 
acgaagtcaa 
gtttgtttta 
gaaattaaat 
tgtcaatttc 
cagcattgaa 
ttcttaagat 
tctaagtaga 
cgctactgtg 
ttgatttaat 
ttaagaataa 
tattcacggt 
cgcattctaa 
actattacta 
tttgggaata 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7448 
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<210> 83 . 
<211> 1896 
<212> DNA 
<213> Drosophila 



<400> 83 

atggctgtca 

gctgcctcct 

ctgccggaaa 

gtaggaggag 

accgaacccg 

gccgccgaat 

gcagtctttg 

atattgacca 

gccgaccttc 

tcgtggcaac 

acggcatcca 

ccgctcacgt 

gtctggctgc 

cgaagggatc 

atgggctcct 

gtcattgcat 

cgctacacgg 

ggccagcgcc 

ctccaagaca 

ggtggtgggg 

gcccttccct 

ctcacacgac 

acgagtggca 

ggccaaacgg 

aggcattcgc 

tatcatcacc 

aacagtaaat 

accctgagca 

ctgatcaccc 

tggctgggtt 

ttccgggcgg 

cagttcccca 



acttgcagct 
cctcctccct 
acatatccat 
taggaggtgg 
gacccacggc 
gggcgcactt 
ccaccttcat 
ctcgccggct 
tcgtgggcat 
tgggctgggt 
ttctcagcct 
actcccgtaa 
tcgcgttggc 
tgcgcgagtg 
tcttcatacc 
ccaggcacga 
cagcggacgt 
agagacaggc 
tgatgctcag 
gtggaggagc 
ccggcggagt 
cctcctcgct 
tgggcccagg 
gccaggtgca 
atggggagcg 
aggcgggcgt 
cgctgtccaa 
ttgtggtggg 
cgttcctggc 
ggttcaatag 
ccttctggcg 
cgaacaccat 



gaataattta 
gcggggattg 
cgtatatgag 
atatcccagc 
gcccgtgacc 
ctacgatctg 
cgtggtcacc 
tcgcaccatt 
cttcgtcatg 
gctctgcgac 
gtgcgccatc 
acggcgctcc 
catcacctgt 
ccgatacaac 
catggcagtc 
caatatgacc 
ggagaatgag 
cacatcgcga 
cgacagcgac 
aagtagtgcc 
aggtggctcc 
gaagcgcgct 
cagtagtctc 
gacccactcc 
ggaccgcgag 
aaccacgaca 
tcggattacg 
tggctttatc 
cgaacaccag 
cgccattaat 
cctcacctgc 
gtccatcagg 



agcgccattt 
ggactgggat 
gatgcgggca 
ggaccgaacg 
acatttaatt 
gtgctctcct 
gtcatcggga 
accaactgct 
ccccccgccg 
atttggatct 
agtgtggaca 
aaacgattgg 
ccccccatgc 
cagaacgagg 
atgatttatg 
gacataagtg 
ctgtcggagc 
acgttctcca 
aattgtgcgg 
actggcggca 
atgggctgcg 
tccaccgcct 
ctggatgcgc 
ctatcgcagc 
agactgcgaa 
tcgactagcg 
tcgctgaaga 
gcctgctggc 
gccagccaaa 
cccttcatct 
aaacgcttct 
cgatag 



atccgtcgct 
cgggagtggg 
atgggacagg 
gtacggatgc 
actacaacga 
ggcagggcat 
acaccctggt 
ttgtgatgag 
tcgccgtcca 
ccctggacgt 
gatatttggc 
ccctcatcat 
tgggatggta 
gctacgtcat 
tgtatgcaag 
ttcacaacaa 
aggagcagca 
accagacgat 
cgatgggcgc 
cacactgtca 
ccaagaatgg 
caacgaccat 
aatggcagtc 
caccgcgaac 
gccatcacca 
ggaacaccag 
aggagaacaa 
tgcccttctt 
tgctggccaa 
acgccttcta 
tcagcgccgg 



catgtatgtg 
agtgggtggt 
tggaggagga 
agtggctctc 
gtcggcggcg 
catcctgatt 
catactggcc 
cctggcggtg 
tctcataggc 
gctcctctgc 
cgtgaccaga 
gatcctaatc 
cgagccagga 
cttctcggcc 
aatttcctgt 
gaaattcaag 
cagctcggtg 
agccaaggag 
tggaggagca 
atccctgctg 
gtgctacgaa 
taccacaatg 
ccagccgccg 
acacagcttt 
ccatcctcat 
tgccaacacc 
gaccactcaa 
tatcaactat 
ggccctcacc 
cagcgtcgac 
ccagaagcca 



60 

120 

180 

240 

300 

360 

420 

480 

54 0' 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1896 



<210> 84 
<211> 631 
<212> PRT 
<213> Drosophila 

<400> 84 



Met 


Ala 


Val 


Asn 


Leu 


Gin 


Leu 


Asn 


Asn 


Leu 


Ser 


Ala 


lie 


Tyr 


Pro 


Ser 


1 








5 










10 










15 




Leu 


Met 


Tyr 


Val 
20 


Ala 


Ala 


Ser 


Ser 


Ser 
25 


Ser 


Leu 


Arg 


Gly 


Leu 
30 


Gly 


Leu 


Gly 


Ser 


Gly 
35 


Val 


Gly 


Val 


Gly 


Gly 
40 


Leu 


Pro 


Glu 


Asn 


He 
45 


Ser 


He 


Val 


Tyr 


Glu 


Asp 


Ala 


Gly 


Asn 


Gly 


Thr 


Gly 


Gly 


Gly 


Gly Val 


Gly 


Gly 


Val 




50 










55 










60 










Gly 


Gly 


Gly 


Tyr 


Pro 


Ser 


Gly 


Pro 


Asn 


Gly 


Thr 


Asp 


Ala 


Val 


Ala 


Leu 


65 










70 










75 










80 


Thr 


Glu 


Pro 


Gly 


Pro 
85 


Thr 


Ala 


Pro 


Val 


Thr 
90 


Thr 


Phe 


Asn 


Tyr 


Tyr 
95 


Asn 


Glu 


Ser 


Ala 


Ala 
100 


Ala 


Ala 


Glu 


Trp 


Ala 
105 


His 


Phe 


Tyr 


Asp 


Leu 
110 


Val 


Leu 



16 
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WO 01/070980 



Ser 


Trp 


Gin 


Gly 


He 


lie 


Leu 


He 


Ala 






115 










120 




Val 


Thr 


Val 


lie 


Gly 


Asn 


Thr 


Leu 


Val 




130 










135 






Arg 


Arg 


Leu 


Arg 


Thr 


He 


Thr 


Asn 


Cys 


145 










150 








Ala 


Asp 


Leu 


Leu 


Val 


Gly 


He 


Phe 


Val 










165 










His 


Leu 


He 


Gly 


Ser 


Trp Gin 


Leu 


Gly 








180 










185 


He 


Ser 


Leu 


Asp 


Val 


Leu 


Leu 


Cys 


Thr 






195 










200 




Ala 


He 


Ser 


Val 


Asp 


Arg 


Tyr 


Leu 


Ala 




210 










215 






Ser 


Arg 


Lys 


Arg 


Arg 


Ser 


Lys 


Arg 


Leu 


225 










230 








Val 


Trp 


Leu 


Leu 


Ala 


Leu 


Ala 


He 


Thr 










245 










Tyr 


Glu 


Pro 


Gly 


Arg 


Arg Asp 


Leu 


Arg 








260 










265 


Glu 


Gly 


Tyr 


Val 


lie 


Phe 


Ser 


Ala 


Met 






275 










280 




Ala 


Val 


Met 


lie 


Tyr 


Val 


Tyr 


Ala 


Arg 




290 










295 






Arg 


His 


Asp 


Asn 


Met 


Thr 


Asp 


lie 


Ser 


305 










310 








Arg 


Tyr 


Thr 


Ala 


Ala 


Asp 


Val 


Glu 


Asn 










325 










His 


Ser 


Ser 


Val 


Gly 


Gin 


Arg 


Gin 


Arg 








340 










345 


Ser 


Asn 


Gin 


Thr 


He 


Ala 


Lys 


Glu 


Leu 






355 










360 




Ser 


Asp 


Asn 


Cys 


Ala 


Ala 


Met 


Gly 


Ala 




370 










375 






Gly 


Gly 


Ala 


Ser 


Ser 


Ala 


Thr 


Gly 


Gly 


385 










390 








Ala 


Leu 


Pro 


Ser 


Gly 


Gly 


Val 


Gly 


Gly 










405 










Gly 


Cys 


Tyr 


Glu 


Leu 


Thr 


Arg 


Pro 


Ser 








420 










425 


Ala 


Ser 


Thr 


Thr 


He 


Thr 


Thr 


Met 


Thr 






435 










440 




Ser 


Leu 


Leu 


Asp 


Ala 


Gin 


Trp 


Gin 


Ser 




450 










455 






Gin 


Val 


Gin 


Thr 


His 


Ser 


Leu 


Ser 


Gin 


465 










470 








Arg 


His 


Ser 


His 


Gly 


Glu 


Arg 


Asp 


Arg 










485 










His 


His 


Pro 


His 


Tyr 


His 


His 


Gin 


Ala 








500 










505 


Ser 


Gly 


Asn 


Thr 


Ser 


Ala 


Asn 


Thr 


Asn 






515 










520 




He 


Thr 


Ser 


Leu 


Lys 


Lys 


Glu 


Asn 


Lys 




530 










535 






Val 


Val 


Gly 


Gly 


Phe 


He 


Ala 


Cys 


Trp 


545 










550 








Leu 


He 


Thr 


Pro 


Phe 


Leu 


Ala 


Glu 


His 










565 










Lys 


Ala 


Leu 


Thr 


Trp 


Leu 


Gly 


Trp 


Phe 



580 585 



PCT/US01/09341 



Val 


Phe 


Ala 


Thr 


Phe 


He 


Val 








125 








lie 


Leu 


Ala 


He 


Leu 


Thr 


Thr 






140 










Phe 


Val 


Met 


Ser 


Leu 


Ala 


Val 




155 










160 


Met 


Pro 


Pro 


Ala 


Val 


Ala 


Val 


170 










175 




Trp 


Val 


Leu 


Cys 


Asp 


lie 


Trp 










190 






Ala 


Ser 


He 


Leu 


Ser 


Leu 


Cys 








205 








Val 


Thr 


Arg 


Pro 


Leu 


Thr 


Tyr 






220 










Ala 


Leu 


lie 


Met 


lie 


Leu 


lie 




235 










240 


Cys 


Pro 


Pro 


Met 


Leu 


Gly 


Trp 


250 










255 




Glu 


Cys 


Arg 


Tyr 


Asn 


Gin 


Asn 










270 






Gly 


Ser 


Phe 


Phe 


He 


Pro 


Met 








285 








lie 


Ser 


Cys 


Val 


He 


Ala 


Ser 






300 










Val 


His 


Asn 


Lys 


Lys 


Phe 


Lys 




315 










320 


Glu 


Leu 


Ser 


Glu 


Gin 


Glu 


Gin 


330 










335 




Gin 


Ala 


Thr 


Ser 


Arg 


Thr 


Phe 










350 






Gin 


Asp 


Met 


Met 


Leu 


Ser 


Asp 








365 








Gly 


Gly Ala 


Gly 


Gly 


Gly 


Gly 






380 










Thr 


His 


Cys 


Gin 


Ser 


Leu 


Leu 




395 










400 


Ser 


Met 


Gly 


Cys 


Ala 


Lys 


Asn 


410 










415 




Ser 


Leu 


Lys 


Arg 


Ala 


Ser 


Thr 










430 






Ser 


Gly Met 


Gly 


Pro 


Gly 


Ser 








445 








Gin 


Pro 


Pro 


Gly 


Gin 


Thr 


Gly 






4 60 










Pro 


Pro 


Arg 


Thr 


His 


Ser 


Phe 




475 










480 


Glu 


Arg 


Leu 


Arg 


Ser 


His 


His 


490 










495 




Gly 


Val 


Thr 


Thr 


Thr 


Ser 


Thr 










510 






Ser 


Lys 


Ser 


Leu 


Ser 


Asn 


Arg 








525 








Thr 


Thr 


Gin 


Thr 


Leu 


Ser 


He 






540 










Leu 


Pro 


Phe 


Phe 


He 


Asn 


Tyr 




555 










560 


Gin 


Ala 


Ser 


Gin 


Met 


Leu 


Ala 


570 










575 




Asn 


Ser 


Ala 


He 


Asn 


Pro 


Phe 



590 



1 17 
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He Tyr Ala Phe Tyr Ser Val Asp Phe Arg Ala Ala Phe Trp Arg Leu 

595 600 605 

Thr Cys Lys Arg Phe Phe Ser Ala Gly Gin Lys Pro Gin Phe Pro Thr 

610 615 620 

Asn Thr Met Ser He Arg Arg 
625 630 



<210> 85 
<211> 3836 
<212> DNA 
<213> Drosophila 



<400> 85 

gtgctcgaca 

ctggaccatc 

tatgcgtttg 

ggtctggagg 

ttcggccttt 

ttgtagcgga 

aacggcgacc 

gaggagcagc 

ttatgggttg 

ttctcgttcg 

gtggccagat 

ctcccgccta 

attataaatt 

gacgaatatt 

actaaatttg 

gtttgcccaa 

tcgggcaaag 

tttctgtaca 

caactactac 

tgatgacatc 

cggttcgatt 

tattcgaatc 

caaagaggag 

acgagtacca 

gatttacatc 

atgtggactt 

ggaaaaacag 

gtggataata 

aattattatt 

aaataactcc 

tttaattaca 

caatgatatg 

ttactggcaa 

tactagatca 

agatatctct 

tttttccgga 

ttgacgtcgc 

tggagcacag 

gatcccagta 

aaactttcaa 

agattggcac 

caatgtcgcc 

taagttcaca 

agatatgata 

ttacctacag 

tccgtattct 



tttcgcttgc 
agatcgtcga 
cgattcttat 
ctacgaactt 
tgttctcgaa 
ctttccgtgg 
tttccgcttg 
gagatggctg 
tgtactgcag 
gctttagaaa 
ggctcaaacg 
attttaaaat 
ataatgaatt 
tttcgtttta 
tttgagggct 
acggctattg 
ttcttagtgc 
gttctgcttt 
gacacggttg 
gaaattcctg 
acgccgattg 
tgctgcccag 
ctcgcccggt 
ctcaccgata 
agcgacttta 
atagtttggt 
gactactgct 
cgacaccgct 
tataacaaaa 
ttattggtgt 
ataagctggt 
ctttgtccta 
gtgcattgta 
tttaaatcta 
ggaagcttta 
tggcttccaa 
tcaatcgagt 
ccgcaatcat 
aatggaactg 
tgcgattgct 
ccatccgtct 
atgttcgccc 
aatgaggagg 
aaaccttgga 
ttcctgcggc 
gcacggctcc 



gtcccgtaaa 
ccaagtggga 
agttcaggta 
gttcctccaa 
taagagagct 
gcggccattt 
ggcagaagct 
caacgatctt 
ggttccagtg 
ctatataaac 
gaatttctaa 
ttacatcaaa 
gataatgaat 
acgataaaag 
ttcttaggct 
aaggctattt 
caagtgttgt 
taatttttac 
atatctcata 
ctagcttaac 
aaaagcattt 
ccaagaactt 
tcaaacccta 
tggctattat 
actatttctt 
tccaggatgg 
tgtaccggca 
gcacaagcca 
gtcatcaata 
gcaaactgct 
tttttatttc 
acaatcgcag 
tgctgtatag 
ttgaatggca 
aatgttatat 
cctctggctc 
cgaccctaac 
gacgggaagt 
gttgcctctg 
tttaaatcgt 
ggatctatat 
tgacagccat 
agggaaggat 
ccttggtaaa 
tctccatcgt 
acattttctg 



gaaattgtcc 
gcccacgaag 
tttaaccttc 
tctggccaaa 
ctgctcctcg 
ggcctgaacc 
ggccattcgg 
cagacgcttt 
cttatcagca 
aatacaattg 
atggattaat 
agtatcgaca 
tagacatttt 
gtcggaaatt 
ggtcagcagg 
aaactgccaa 
agtgctcagc 
gaacaactca 
cattgaaagg 
tggatactac 
aagggcctgt 
tttggccaac 
tatatacttc 
cagagatgag 
agaagaggtt 
aaaattttgg 
caactttgat 
catatctcct 
tcgttttcat 
gactgcattc 
ttatattttc 
tatatctata 
tttcgaggtt 
tttgctctcc 
cgttgacttc 
tccgtcatca 
tatcggttcc 
atttatatag 
gtcggtttta 
taccatggtt 
aagtggaccg 
ttacattagg 
taactgcata 
ctagttatcc 
gatgggcctt 
ggaatgggtc 



accacgatga 
ccagctcctc 
ggatacttgc 
ttctcccttg 
tacgccatct 
tcctcgacgc 
taaaggtaga 
ttggtggcag 
actgatatcg 
cgttatcagg 
aatctaacgc 
ggcgaacatt 
tgttggtgat 
gccaaaaaat 
gactactttc 
cgatgagcat 
atgaggcttc 
aatgcagata 
caaaacgatt 
gagttcaggc 
gtttgcagcg 
ggaaaatgcg 
acatacatgg 
ttctttgatt 
agcattcaaa 
gttaccgttg 
tcggattttc 
ggatccttag 
agctatatag 
tgtgtgccct 
tttttcagtt 
cattaagaag 
tatccagtgc 
agctggttag 
ttgcaggtta 
gctaccatac 
tgcggtacaa 
ttaatcagat 
ttcggtgctc 
acgtattgct 
tcgctggccc 
aaagtgaagg 
aactttgaca 
ttgccgtatt 
acttggatat 
gggataatat 



cctcgtgccc 
cggtaattag 
gcggtgtgct 
tgcgctggag 
gaaggctcct 
cgggaacgga 
ccaggagcag 
tcatcgtcgg 
ccacattgca 
aggtgccaga 
cagggaaaag 
ttgattttgg 
cttccccagg 
gagtttgaaa 
cgaataaagt 
cgatcgccag 
cttgggtgat 
ttcccgggtg 
cgtatttata 
agtttgggga 
tgcggccctg 
atgatggtct 
acctacaggc 
gtgatgaaat 
tcttcaataa 
acctcttcat 
caaagtccat 
agagtaattc 
aaagaaacga 
ttgcaagttt 
ctaattataa 
ctgcgaaatg 
ttgatcatga 
ttcgaattgg 
cagctcgcac 
gtggaaagtc 
cgccttcgtc 
ttgggaaaac 
ggtcaaaggt 
tttccatatc 
tgagcacttt 
ggggtataaa 
gccagacgta 
atccacatag 
tcaatgtcat 
ccgagtattt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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tcacagcgcg 
tctaatgatg 
ctttaaacaa 
tgaattttat 
caagaggtca 
ccactttcga 
tgtagttttg 
gcaaaatatg 
caaggaggtg 
ccaggacacg 
caagaatgag 
catttagagg 
ccaacctctc 
ttctgtgcaa 
aatacatatt 
attacgaata 
ttcattgaaa 
atgccttaag 



tttggaattg 
gattcttaag 
ttaaagcccc 
tgaactcaat 
ccgcgacaag 
ttctttaaaa 
cgatttcatt 
ccgaacgaaa 
gccgacgcat 
gcatcgacga 
gcttcagtgt 
ctcctcctgc 
taacttgaac 
ttactccaaa 
tgattttgat 
aaatatgtgt 
ccccaaaatt 
ctttttttgg 



ttttgttcgt 
cttgtttcta 
tcattttatt 
aaataaatat 
cttttcactt 
aacatcgata 
gcaattttaa 
acgcagagga 
cggaggcaat 
atgcagcagg 
ctgcgaatgc 
ggctgaaaaa 
tgctgtgcta 
gattagccct 
gaaatatgaa 
aatataagat 
gggcaaattt 
ggcagattag 



tctgctcgtc 
tataattcta 
aatatccttt 
agttttagaa 
cgccaccctc 
ggtatcgcca 
tttgctttac 
cctctcgggt 
gctcgaaaag 
aaacgaggat 
aagtaagttt 
atgaggttag 
ttcatttcaa 
aacggaaaaa 
tatacttaca 
ttctaccttg 
gttgtcatgg 
tcattccagt 



ctgaagcgca 
ataattgttt 
gctgaacgaa 
aacgatgtac 
tggtaacact 
aaaaaatcag 
tgttcaaaaa 
caagagctaa 
gttgttgccg 
gcggagcagc 
tcaatatgcc 
gttttgtcgg 
ttcaattttc 
taaacaactc 
tagatacaat 
ttgattccat 
gaaactagtc 
ttaaactaat 



gcacatggac 
acaattaaag 
ggaattgaat 
tttaaactgc 
gattctccag 
ctgctctgct 
gaaatacagt 
agcagaaggc 
gcttaaatat 
ctgatggtgc 
tgtgaaaagt 
gaactacagt 
aattttcatt 
ataaatcata 
tgagtttttt 
tgattgatgc 
cactgtcgtg 
cgaaaa 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3836 



<210> 86 
<211> 1536 
<212> DNA 
<213> Drosophila 



<400> 86 

atgaggcttc 

aatgcagata 

caaaacgatt 

gagttcaggc 

gtttgcagcg 

ggaaaatgcg 

acatacatgg 

ttctttgatt 

agcattcaaa 

gttaccgttg 

tcggattttc 

ggatccttag 

ctatacatta 

aggtttatcc 

tctccagctg 

atcagctacc 

ttcctgcggt 

atagttaatc 

tttattcggt 

tcgctggccc 

aaagtgaagg 

aactttgaca 

ttgccgtatt 

acttggatat 

gggataatat 

ctgaagcgca 



cttgggtgat 
ttcccgggtg 
cgtatttata 
agtttgggga 
tgcggccctg 
atgatggtct 
acctacaggc 
gtgatgaaat 
tcttcaataa 
acctcttcat 
caaagtccat 
agattctaat 
agaagctgcg 
agtgcttgat 
gttacagctc 
atacgtggaa 
acaacgcctt 
agatttggga 
gctcggtcaa 
tgagcacttt 
ggggtataaa 
gccagacgta 
atccacatag 
tcaatgtcat 
ccgagtattt 
gcacatggac 



tttctgtaca 
caactactac 
tgatgacatc 
cggttcgatt 
tattcgaatc 
caaagaggag 
acgagtacca 
gatttacatc 
atgtggactt 
ggaaaaacag 
gtggataata 
tataacaatg 
aaatgttact 
catgatacta 
gcactttttc 
agtcttgacg 
cgtctggagc 
aaacgatccc 
agattggcac 
caatgtcgcc 
taagttcaca 
agatatgata 
ttacctacag 
tccgtattct 
tcacagcgcg 
tctaatgatg 



gttctgcttt 
gacacggttg 
gaaattcctg 
acgccgattg 
tgctgcccag 
ctcgcccggt 
ctcaccgata 
agcgacttta 
atagtttggt 
gactactgct 
cgacaccgct 
atatgctttg 
ggcaagtgca 
gatcatttaa 
cggatggctt 
tcgctcaatc 
acagccgcaa 
agtaaatgga 
ccatccgtct 
atgttcgccc 
aatgaggagg 
aaaccttgga 
ttcctgcggc 
gcacggctcc 
tttggaattg 
gattct 



taatttttac 
atatctcata 
ctagcttaac 
aaaagcattt 
ccaagaactt 
tcaaacccta 
tggctattat 
actatttctt 
tccaggatgg 
tgtaccggca 
gcacaagcca 
tcctaacaat 
ttgtatgctg 
atctattgaa 
ccaacctctg 
gagtcgaccc 
tcatgacggg 
actggttgcc 
ggatctatat 
tgacagccat 
agggaaggat 
ccttggtaaa 
tctccatcgt 
acattttctg 
ttttgttcgt 



gaacaactca 
cattgaaagg 
tggatactac 
aagggcctgt 
tttggccaac 
tatatacttc 
cagagatgag 
agaagaggtt 
aaaattttgg 
caactttgat 
catatctcct 
cgcagtatat 
tatagtttcg 
tggcatttgc 
gctctccgtc 
taactatcgg 
aagtatttat 
tctggtcggt 
aagtggaccg 
ttacattagg 
taactgcata 
ctagttatcc 
gatgggcctt 
ggaatgggtc 
tctgctcgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1536 



<210> 87 
<211> 426 
<212> PRT 
<213> Drosophila 



<400> 87 

Met Arg Leu Pro Trp Val lie Phe Cys Thr Val Leu Leu Leu lie Phe 
15 10 15 



119 
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Thr 


Asn 


Asn 


Ser 


Asn 


Ala 


Asp 


lie 


Pro 


Gly 


Cys 


Asn 


Tyr 


Tyr 


Asp 


Thr 








o n 

ZU 










£- 0 
















Val 


Asp 


lie 


Ser 


Tyr 


He 


GlU 


Arg 


bin 


Asn 


Asp 


ber 


Tyr 


Leu 


Tyr 


Asp 






OO 










a n 

*i U 










H 0 








Asp 


lie 


GlU 


lie 


Pro 


Ala 


O — 

ber 


Leu 


Thr 


Gly 


Tyr 


Tyr 


(j±U 


rfie 


Arg 


Gin 




OU 










03 










oil 










Phe 


Gly 


Asp 


Gly 


Ser 


He 


Thr 


Pro 


He 


Glu 


Lys 


His 


Leu 


Arg 


Ala 


Cys 


65 










7 0 




















o n 

ou 


Val 


Cys 


Ser 


Val 


Arg 


Pro 


Cys 


He 


Arg 


He 


Cys 


Cys 


Pro 


Ala 


Lys 


Asn 










85 










90 










95 




Phe 


Leu 


Ala 


Asn 


Gly 


Lys 


Cys 


Asp 


Asp 


Gly 


Leu 


Lys 


Glu 


Glu 


Leu 


Ala 








100 










1U0 










inn 
1 1U 






Arg 


Phe 


Lys 


Pro 


Tyr 


He 


Tyr 


Phe 


Thr 


Tyr 


Met 


Asp 


Leu 


Gin 


Ala 


Arg 






115 










120 










IOC 








Val 


Pro 


Leu 


Thr 


Asp 


Met 


Ala 


He 


He 


Arg 


Asp 


Glu 


Phe 


Phe 


Asp 


Cys 




130 










135 










14 0 










Asp 


Glu 


Met 


He 


Tyr 


He 


Ser 


Asp 


Phe 


Asn 


Tyr 


Phe 


Leu 


Glu 


Glu 


Val 


145 










150 










155 










160 


Ser 


He 


Gin 


He 


Phe 


Asn 


Lys 


Cys 


Gly 


Leu 


lie 


Val 


Trp 


Phe 


Gin 


Asp 










165 










170 










175 




Gly 


Lys 


Phe 


Trp 


Val 


Thr 


Val 


Asp 


Leu 


Phe 


Met 


Glu 


Lys 


Gin 


Asp 


Tyr 








180 










185 










190 






Cys 


Leu 


Tyr 


Arg 


His 


Asn 


Phe 


Asp 


Ser 


Asp 


Phe 


Pro 


Lys 


Ser 


Met 


Trp 






195 










200 










1 ft c 

205 








He 


He 


Arg 


His 


Arg 


Cys 


Thr 


Ser 


His 


He 


Ser 


Pro 


Gly 


Ser 


Leu 


Glu 




210 










215 










220 










He 


Leu 


He 


He 


Thr 


Met 


He 


Cys 


Phe 


Val 


Leu 


Thr 


lie 


Ala 


Val 


Tyr 


225 










230 










235 










24 0 


Leu 


Tyr 


He 


Lys 


Lys 


Leu 


Arg 


Asn 


Val 


Thr 


Gly 


Lys 


Cys 


He 


Val 


Cys 










245 










OCA 










255 




Cys 


He 


Val 


Ser 


Arg 


Phe 


He 


Gin 


Cys 


Leu 


He 


Met 


He 


Leu 


Asp 


His 








260 










265 










270 






Leu 


Asn 


Leu 


Leu 


Asn 


Gly 


lie 


Cys 


Ser 


Pro 


Ala 


Gly 


Tyr 


Ser 


Ser 


His 






275 










280 










285 








Phe 


Phe 


Arg 


Met 


Ala 


Ser 


Asn 


Leu 


Trp 


Leu 


Ser 


Val 


He 


Ser 


Tyr 


His 




290 










2 95 










300 










Thr 


Trp 


Lys 


Val 


Leu 


Thr 


Ser 


Leu 


Asn 


Arg 


Val 


Asp 


Pro 


Asn 


Tyr 


Arg 


305 










310 










315 










320 


Phe 


Leu 


Arg 


Tyr 


Asn 


Ala 


Phe 


Val 


Trp 


Ser 


Thr 


Ala 


Ala 


lie 


Met 


Thr 










325 










330 










335 




Gly 


Ser 


He 


Tyr 


He 


Val 


Asn 


Gin 


He 


Trp 


Glu 


Asn 


Asp 


Pro 


Ser 


Lys 








340 










34 5 










o c n 

350 






Trp 


Asn 


Trp 


Leu 


Pro 


Leu 


Val 


Gly 


Phe 


He 


Arg 


Cys 


Ser 


Val 


Lys 


Asp 






JJJ 










JuU 










J DO 








Trp 


His 


Pro 


Ser 


Val 


Trp 


He 


Tyr 


He 


Ser 


Gly 


Pro 


Ser 


Leu 


Ala 


Leu 




370 










375 










380 










Ser 


Thr 


Phe 


Asn 


Val 


Ala 


Met 


Phe 


Ala 


Leu 


Thr 


Ala 


He 


Tyr 


He 


Arg 


385 










390 










395 










400 


Lys 


Val 


Lys 


Gly Gly 


He 


Asn 


Lys 


Phe 


Thr 


Asn 


Glu 


Glu 


Glu 


Gly 


Arg 










405 










410 










415 




He 


Asn 


Cys 


He 


Asn 


Phe 


Asp 


Ser 


Gin 


Thr 















420 425 



<210> 88 
<211> 6238 
<212> DNA 
<213> Drosophila 

<400> 88 
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tgaacctgca 
cgctgttgcc 
gaaggcaacg 
ccgcggtcgc 
taaaccttaa 
cagatcaaat 
aatgaatgga 
acgacgaagc 
cgtattacct 
cattcaactt 
aacaaacacg 
aaatgagatt 
caattaatac 
ggtacagaga 
atacgaagtg 
tgctccgagg 
ttaagtactt 
tcgcctccgt 
ccggacgcca 
tggagcgact 
cggcgatgat 
tggcgatcta 
agatttgcat 
tttagaatgg 
tgctgaggaa 
ccattgtgac 
ttccgcttcc 
cgatgctgat 
ccgtgcgact 
cgattacaat 
gtaaccaatt 
acacgcccgg 
tgtatgcacc 
tgttgatgct 
tatttaatga 
aaattaaatt 
aatcaagttc 
gccataactg 
acgcaatcca 
gtcgtggtca 
ttgagagctc 
gaaagtaaat 
taataagcta 
taattaattg 
tatggttgta 
gattcgaacg 
aaaaaggaaa 
tattataaag 
cgttcatata 
atttttatct 
tttattggta 
tttggggcgc 
tgtagggctt 
gaacacaaaa 
gcatcgcaat 
atacctgatt 
gccaaaagct 
cgacgatttg 
tttactctag 
catgaaggga 



ccaccagcca 
agtcccgcag 
cctgctcgcg 
tacaatcgct 
tatacaagaa 
cgatctcttc 
aactgagaag 
ttcttgaaac 
acccgattcc 
atgtaatcat 
attcctgtgt 
cagtagcagt 
atttaatact 
ttaagtgagg 
ggatatattt 
gaggcggaaa 
gggggcgcgc 
catcgtcgta 
tccgctcctc 
cagcacatag 
gaagaacttg 
aaacggagat 
aatacggaat 
tatccaccca 
cgccattagt 
gctaacgctg 
ggtgctcagc 
ggcactgttt 
ccttccgatc 
gcacagcagg 
tgataggaac 
cgcatcgcga 
aaaaccaacg 
cttgcgaatg 
atgagttggc 
acactaagaa 
ataccttgtg 
ctgcaatgca 
ctcggatgaa 
gcatgcccag 
cgtaaatttc 
gcttttcctt 
cgaatataag 
ccacttttat 
caactatcga 
ccgaccaata 
tatctattat 
gatgctcctc 
tttaacattt 
acttagtgaa 
tcataaaata 
cgttgacaca 
ttttccaatg 
atgggattga 
gactttattc 
tcagcgaaca 
aaaatgtcgg 
cagtgcgaaa 
caacgggaaa 
taccctgatt 



agaccccagc 
ctgcggcaac 
ctgccgccgg 
actaagtaga 
aaaatttaca 
ccgcaatttg 
aagacggtca 
tgacgaggga 
cccctcacag 
ataaattcta 
gtcattgcgg 
gtatatatct 
tgatcttttt 
tgagaataca 
attatttaca 
cgcttttttg 
ggtatgaaga 
cgcatgcggc 
cagcgcttaa 
acatacggat 
ttcgatgcgg 
aattgttgat 
ggttgaagat 
gtactcacca 
ttggcgaact 
tgcctgtact 
tgcgtctggt 
ttgtccctgg 
acacagagca 
gtacctggaa 
ttaccgaaga 
tatcgtatac 
aatggcagga 
agctcgtagg 
aatgtttgtg 
taaaatgcat 
gtagataaag 
tccagaactg 
cgactgcagg 
tagagccacc 
ggatagctta 
gccattagcg 
ttgggtataa 
aaaccgaaac 
ttttggttat 
gcaatgaata 
atggatattt 
aaatttctcc 
gtatatttat 
atatgtgact 
ttgcatttgt 
ggaaactcat 
gaaatatatg 
caattcccgg 
atttttgttt 
agcgattaaa 
aactgttcat 
tgcaaaatcg 
atacacacag 
tatcgacacc 



caagactccg 
gtcggcagac 
agccaaggcc 
cgtcgaagga 
ttcacgccca 
acatttgcat 
acttttagtt 
gaacacaaac 
aacgcatacc 
taataaacaa 
gatatggaaa 
atgagggctg 
acacaagcgt 
tacatataag 
gcttacgaga 
agtgagatcc 
tcgagtggaa 
tctggtcgct 
tgcatcccag 
ccgaggtgaa 
gcacccgatt 
aaaggccatg 
actagacgtt 
tctgtggcat 
tgatctcgac 
gccttgccgg 
agagggtggt 
aaaccaggtc 
aggtgtgggc 
tacatcgatg 
ccataaagag 
acttaagctg 
acgtgatcac 
taatgtgcta 
atttcctgtg 
agaaaatgtt 
ggtctggcta 
agcccgaaaa 
acttcctttg 
agattgttgg 
gctttatggc 
cccaaattgc 
gatcgataaa 
tttcaatcaa 
ctaagacgtg 
attattttgg 
aaaaatttaa 
attgaatatt 
actaatataa 
gtcaatggtg 
ggggcacgaa 
ttgtctccgt 
tatatatata 
ggattggcct 
tcattatttg 
atgcatttta 
tacagggcta 
ttttgctaac 
ctattccgct 
gctcaagtgc 



accaaggcag 
gcctctccat 
acgccgcagc 
attcgtaatt 
ggaggaggat 
cccaatttgc 
gaaacagttg 
aagttaattg 
atgccagcca 
agaaacaaaa 
ttggctttca 
tacgatttcg 
tatatttaat 
gaaagccttg 
ctaggcgcta 
cactgagtcg 
tcagttgaat 
ctgggaacga 
caaggaccag 
gtgcagcgcc 
tggagctata 
taaatttatt 
tagtagttac 
ccagcagatg 
tggcgaggag 
ctgtacactg 
gccgctgctg 
gtagtgcata 
cacgaagagg 
gtcaaaccaa 
caccgcatac 
gtccggattg 
cacggcgatc 
tggcgagaga 
tgcacttgta 
atttattata 
gggccatcca 
atcgccaaac 
agagatacat 
tcgccaggca 
atttcgaagc 
ctacagttcg 
atgtaataac 
gatacctcaa 
actcggtcct 
ttcgcaaaat 
ttcatttttt 
gtgtatattc 
actaaagtac 
acttagatta 
tgtgtttcga 
gtgaaatgaa 
taaaacacac 
aacagtccgt 
tttgtgcaca 
tggtgcattg 
aaatgtctaa 
ttgtaaaggc 
tttcgttcat 
cattcaccag 



ctgctccagc 
ctccggccaa 
gacaacgcgc 
taacataaca 
gaagaaagaa 
atatacactg 
aatatattgt 
aatcgagaat 
gcgaaccaac 
ctagttgtaa 
atctgttgca 
ttttagcgca 
gctacaaatc 
atatagtaag 
ctcccaggaa 
ggaaatggga 
tccagagtat 
cgaagtccgc 
ttgatggact 
gtaagaacat 
gccaacggga 
atattatatt 
tatctgcaaa 
acgaacgaga 
tcggtggccg 
cgatggctgt 
ctctcgggat 
tgccgcttgg 
ttgcaggcca 
tatttgaaat 
gtcttgttcc 
gattcatcga 
atcagaatgc 
gatttgtgaa 
tcatttctta 
catatattgc 
tctttctgcc 
cacgagtccc 
tttcagcagc 
tctgaaatag 
tcataaatcg 
ttgggttctt 
gaattgcttt 
agctgctcca 
taaacgcttt 
gctcaattat 
tgaatgttta 
actttatatt 
agatccctgt 
agtgtttcct 
tttgattgtt 
actatatgaa 
gcttggaata 
gcatcgttag 
ttgcaccttt 
taaaagtttc 
atcttcaatt 
aaaatcattg 
agtcaaaaag 
cttcattcaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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tcaatcaaat tacagatcaa aggctctgca tgaaggaggc ctgatgcctg aagtggaatt 3660 
cgcgttaatt tatgcttcct tgacggctca aattgctggc gacagcaatt tacgcatatt 3720 
acgcatacgc actgtcgtct cagcaaaaaa aaaatgttgg gaaaggagca aatcgcaaac 3780 
aagctcactg aattcgccat gggccaatgc tgataaatgt aatataaatt gtgacaaata 3840 
tttggcttat gaactaaaag aatttccgtg tgtgtcatag attttatcac ttctttacag 3900 
tacacaccct accgaatgat taacacgatg tttaaaatat caacaaagtc tattatgagt 3960 
attcgtattg agaagcgaat attttacaag aaattattat tattatatta atattaaaag 4020 
gaatttggtg aaatacacaa gttttatttc actcatattt acaaaggaat agattatcat 4080 
ttttgagtat tattatatct accagttttt tgcacctaag caaagttaat ttaaatacat 4140 
attcatataa atcttttccc cagtatgcac atttgccaat caagtcaacc accgatcgcc 4200 
gtcttcagtt acaccataaa gcttttaatg cattcgcaat tggctgcatt cccacgtcgc 4260 
cgataagatt gacagaaaac agactggaat atggaatatg gacaaaatct caatgcccaa 4320 
aacacatggc caacgaacta attccgtcat ttgataagat atgcaaaaat aaaacagcaa 4380 
aaaccatgcc aagcaaaaca attcaaacta tgcaaatcag acccgcatag acatattata 4440 
gagcaccaaa gtggccgcag ttgagagcag gttgaataaa cttacatagc catgaactgg 4500 
acacaaacag tggcaaacat taactgcatg aaatgagtag ctacacgctg agaaaatatg 4560 
caaaatgtca gctcaataat attattaaca ataatactat atggttttgc atgcttaata 4620 
tttatgcaaa tcaaatagaa tttggtgaac tacacaacta taaattacta ctaagatggg 4680 
tacattttga aagattttag ggcaagcaaa acctactttt tgcattaata tattattatt 4740 
tccactatgc atagctgcac ttaccgcagc atgagcgtgt atttgctgtt tttgttgagc 4800 
ttcttgcgcg ccaaaatgaa aagggccagc gaattgccaa aaacgcccag aaccatgata 4860 
ataacaccaa taatcaggcg atttcgattt gcatagagcg ctttcggagg cgccacagta 4920 
gttgagttgt gcggcatcaa caggtcgagc acaggtgttg tggtttccat tttgccggta 4980 
tgataattgc gttcgattgc aggtgattca ggtgaaaatc taactggccg caaacgatat 5040 
ttctttttca taaatgagcc tgccgcaaag gcaaggcagc tcgcaaatgc actggtagat 5100 
ggaaacccgt ttcgatagcc aaattgggac ttttttccta agacacttaa ttaatgatta 5160 
gcgcacgcga cacttgatca accaacgacc gaccaattcg aaggcgattc gattcgatcc 5220 
gaggagcgcg tccgcagtca gccaggcgtc gatttagaat gactgccgac cgtgccgagc 5280 
agagcactcg tcggctgctg cccatagctc caagctatcc aagccatcat atacacgtgg 5340 
gggtggtggt aggggccagt tcggagcgag gccagttcgg ctggcgccgc cgccggtcgg 5400 
atcgcatcgg atgtagcata cagcacatag acacccccaa gcgaagcggc cacccacatc 54 60 
cacgcccagc gatggcgacc accccatcga aaattcacag ctcgtgccag cgcctttgcc 5520 
ttttggacgt gtgcgaacgg tcggaatagc acgacgaatt ttattgtttt atactctctc 5580 
cacctgcggc tcactttaat ttcactttca actggatgta atattaatta tgcagttttt 5640 
aagcttccat ttgattttat acagttgact aaattagaat aaaataatat actcaaaagt 5700 
ttacttcagc aatatattta ctcatgtttc tttgtcattg aaaaattaac aaaacaaaaa 5760 
aacagctatt tcatagatag attaattatt attattaaga gtcttcagat tcgtcatctt 5820 
tagtatatgg tgtgtcattt gaaaacctat ttttgaggaa tatatgaact gaaatctgcg 5880 
tatactttaa ttactgtaat gtattaaagt attgtttctt ttatacaaaa agtctaaaaa 5940 
tataagcaaa caaattattt tactgaaaat ttgtttttta tgccactttt aagaaatttt 6000 
gcgaattaaa catatatttt taaagctagg agcctggtaa ttatagtagt ttagctttaa 6060 
ctctttgctg aaaatatttc caactaattt atttgttttc aacagaatgc attacgaaag 6120 
ataaaaatta aatttgcagt tactaagcgt tgcagttgca ccttttctta ttcttatctt 6180 
cttagtatat agtactgttt tgttgggcaa aaatgtttat tgtttagttt caactgcc 6238 



<210> 89 
<211> 1507 
<212> DNA 
<213> Drosophila 



<400> 89 

actgcggacg 

atcaagtgtc 

ctatcgaaac 

ctcatttatg 

aatcgaacgc 

gatgccgcac 

cctgattatt 



cgctcctcgg 
gcgtgcgcta 
gggtttccat 
aaaaagaaat 
aattatcata 
aactcaacta 
ggtgttatta 



atcgaatcga 
atcattaatt 
ctaccagtgc 
atcgtttgcg 
ccggcaaaat 
ctgt.ggcgcc 
tcatggttct 



atcgccttcg 
aagtgtctta 
atttgcgagc 
gccagttaga 
ggaaaccaca 
tccgaaagcg 
gggcgttttt 



aattggtcgg 
ggaaaaaagt 
tgccttgcct 
ttttcacctg 
acacctgtgc 
ctctatgcaa 
ggcaattcgc 



tcgttggttg 60 
cccaatttgg 120 
ttgcggcagg 180 
aatcacctgc 240 
tcgacctgtt 300 
atcgaaatcg 360 
tggccctttt 420 
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cattttggcg cgcaagaagc tcaacaaaaa cagcaaatac acgctcatgc tgcgatgcct 480 

ggcgaccaac aatctggtgg ctctactggg catgctgacc acgacgctgc tgaaaatgta 540 

tctctcaaag gaagtcctgc agtcgttcat ccgagtggat tgcgtgggac tcgtggtttg 600 

gcgatttttc gggctcagtt ctggatgcat tgcagcagtt atggcggcag aaagatggat 660 

ggccctagcc agacccttta tctaccacaa gcacattacc tacgagctca ttcgcaagag 720 

catcaacagc attctgatga tcgccgtggt gatcacgttc ctgccattcg ttggttttgg 780 

tgcatacatc gatgaatcca atccggacca gcttaagtgt atacgatatc gcgatgcgcc 840 

gggcgtgtgg aacaagacgt atgcggtgct ctttatggtc ttcggtaccc tgctgtgcat 900 

tgtaatcgtg gcctgcaacc tcttcgtggc ccacaccttg ctctgtgtga tcggaaggag 960 

tcgcacggcc aagcggcata tgcactacga cctggtttcc agggacaaaa acagtgccat 1020 

cagcatcgat cccgagagca gcagcggcac caccctctac cagacgcagc tgagcaccgg 1080 

aagcggaaac agccatcgca gtgtacagcc ggcaaggcag tacaggcaca gcgttagcgt 1140 

cacaatggcg gccaccgact cctcgccagt cgagatcaag ttcgccaaac taatggcgtt 1200 

cctcagcatc tcgttcgtca tctgctggat gccacagatg atcgccatcc cgttggctat 1260 

agctccaaat cgggtgcccg catcgaacaa gttcttcatc atcgccgatg ttcttacggc 1320 

gctgcacttc acctcggatc cgtatgtcta tgtgctgagt cgctccaagt ccatcaaetg 1380 

gtccttgctg ggatgcatta agcgctggag gagcggatgg cgtccgggcg gacttcgtcg 1440 

ttcccagagc gaccagagcc gcatgcgtac gacgatgacg gaggcgaata ctctggaatt 1500 

caactga 1507 



<210> 90 
<211> 412 
<212> PRT 
<213> Drosophila 



<400> 90 
Met Glu Thr Thr 
1 

Thr Thr Val Ala 
20 

He He Gly Val 
35 

Ala Leu Phe He 
50 

Thr Leu Met Leu 
65 

Gly Met Leu Thr 

Leu Gin Ser Phe 
100 

Phe Phe Gly Leu 
115 

Arg Trp Met Ala 
130 

Tyr Glu Leu He 
145 

Val He Thr Phe 

Ser Asn Pro Asp 
180 

Val Trp Asn Lys 
195 

Leu Cys He Val 
210 

Leu Cys Val He 
225 

Asp Leu Val Ser 

Ser Ser Ser Gly 
260 



Thr Pro Val Leu 
5 

Pro Pro Lys Ala 

He He Met Val 
40 

Leu Ala Arg Lys 
55 

Arg Cys Leu Ala 
70 

Thr Thr Leu Leu 
85 

He Arg Val Asp 

Ser Ser Gly Cys 
120 

Leu Ala Arg Pro 
135 

Arg Lys Ser He 
150 

Leu Pro Phe Val 
165 

Gin Leu Lys Cys 

Thr Tyr Ala Val 
200 

He Val Ala Cys 
215 

Gly Arg Ser Arg 
230 

Arg Asp Lys Asn 
245 

Thr Thr Leu Tyr 



Asp Leu Leu Met 
10 

Leu Tyr Ala Asn 
25 

Leu Gly Val Phe 

Lys Leu Asn Lys 
60 

Thr Asn Asn Leu 
75 

Lys Met Tyr Leu 
90 

Cys Val Gly Leu 
105 

He Ala Ala Val 

Phe He Tyr His 
140 

Asn Ser He Leu 
155 

Gly Phe Gly Ala 
170 

He Arg Tyr Arg 
185 

Leu Phe Met Val 

Asn Leu Phe Val 
220 

Thr Ala Lys Arg 
235 

Ser Ala He Ser 
250 

Gin Thr Gin Leu 
265 



Pro His Asn Ser 
15 

Arg Asn Arg Leu 
30 

Gly Asn Ser Leu 
45 

Asn Ser Lys Tyr 

Val Ala Leu Leu 
80 

Ser Lys Glu Val 
95 

Val Val Trp Arg 
110 

Met Ala Ala Glu 
125 

Lys His He Thr 

Met He Ala Val 
160 

Tyr He Asp Glu 
175 

Asp Ala Pro Gly 
190 

Phe Gly Thr Leu 
205 

Ala His Thr Leu 

His Met His Tyr 
240 

He Asp Pro Glu 
255 

Ser Thr Gly Ser 
270 
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Gly 


Asn 


Ser 


His 


Arg 


Ser 


Val 


Gin 


Pro 


Ala 


Arg 


Gin 


Tyr 


Arg 


His 


Ser 






275 










280 










285 








Val 


Ser 


Val 


Thr 


Met 


Ala 


Ala 


Thr 


Asp 


Ser 


Ser 


Pro 


Val 


Glu 


He 


Lys 




290 










295 










300 










Phe 


Ala 


Lys 


Leu 


Met 


Ala 


Phe 


Leu 


Ser 


He 


Ser 


Phe 


Val 


He 


Cys 


Trp 


305 










310 










315 










320 


Met 


Pro 


Gin 


Met 


lie 


Ala 


lie 


Pro 


Leu 


Ala 


He 


Ala 


Pro 


Asn 


Arg 


Val 










325 










330 










335 




Pro 


Ala 


Ser 


Asn 


Lys 


Phe 


Phe 


lie 


lie 


Ala 


Asp 


Val 


Leu 


Thr 


Ala 


Leu 








340 










345 










350 






His 


Phe 


Thr 


Ser 


Asp 


Pro 


Tyr 


Val 


Tyr 


Val 


Leu 


Ser 


Arg 


Ser 


Lys 


Ser 






355 










360 










365 








lie 


Asn 


Trp 


Ser 


Leu 


Leu 


Gly 


Cys 


He 


Lys 


Arg 


Trp 


Arg 


Ser 


Gly 


Trp 




370 










375 










380 










Arg 


Pro 


Gly 


Gly 


Leu 


Arg 


Arg 


Ser 


Gin 


Ser 


Asp 


Gin 


Ser 


Arg 


Met 


Arg 


385 










390 










395 










400 


Thr 


Thr 


Met 


Thr 


Glu 


Ala 


Asn 


Thr 


Leu 


Glu 


Phe 


Asn 











405 410 



<210> 91 
<211> 5340 
<212> DNA 
<213> Drosophila 



<400> 91 

tacactcatg 

tacgcgggta 

catcgaactg 

tatttttgca 

tgaacgtttg 

caaaagggct 

atggcctacc 

agtattattc 

gtaaatgtgt 

tggttatcat 

cttagtgcca 

acgattctaa 

attaaatttt 

cagatattcc 

ggaggtgtaa 

aagtgtctgg 

aaccaccagc 

ccttgaggat 

gattcatcgg 

tgcagggcgg 

aggccacgaa 

ccgacttgga 

tgaagtgctt 

aactcttctc 

cgtcttgttc 

gcatcttgtt 

gtggtgttac 

tgggtgagag 

ccataggact 

acaggcagat 

tgtacgagac 

tctggttgtg 

tgggagtttg 

gtggtggagg 



aatattcaaa 
tgtatgctgg 
cttaacccca 
atattctaag 
aaatcaaaat 
tcgactggca 
cccttcatat 
tatattggga 
atatcttatg 
agattggtta 
ctctagacct 
ctatagcaca 
gaatttaata 
attaagcgga 
ggggtttggg 
tcctttgcgc 
agcttaacca 
ccgcgtaaag 
gctgttggcg 
atgccggcag 
gcagcccaaa 
cttctgccgt 
tcctatcgaa 
ctccctgttg 
cccgcctcca 
ggtattactg 
acccgccgcc 
tggcattggt 
ggaggcccgg 
ggcctcaatg 
gagaagacgt 
attcggattc 
actgaggctc 
caggactggt 



caaaactctt 
gttttgtgtt 
cgacggcaac 
tactataacc 
taggtccgat 
tatccctcaa 
gtctgctttt 
tatgctttag 
cttaatttat 
ttaacattaa 
atgtatgcat 
cacctttggc 
agtgtttgat 
gcactttcgt 
aatggaaaag 
aagtgcagtg 
cttcagatcc 
gtcttcttga 
tagcagagga 
aatccctcga 
atgaacgaga 
ccacccactc 
tgaatgaatg 
gcatcgtcat 
cccacaccgc 
tccgaggttt 
gactggggcg 
ggcatcgggg 
cgatggggac 
cagctctccc 
ttttcaatgg 
aggttgaggc 
tggttcaggt 
ggtttcccct 



caacttgtct 
cttaaaatcg 
cattcgtcga 
taccaaaata 
tttgcgtagt 
agggttaaca 
cattgatata 
agttttggtt 
tttattagct 
aatatttatt 
cccaaatcgg 
tacttacaga 
tctaatactt 
tgggtatgta 
tttgcaaggc 
caaaatggaa 
ccaaaccatc 
actgctgatt 
agtacgagaa 
cgagggccaa 
tcgtgcgaaa 
caaggatcag 
cccttttgct 
cccccgcgtt 
ccgtactgct 
ctcgctcctc 
ttcccgtttg 
aatagagtat 
agtccgcagc 
gcctcacctt 
ccaccgcctc 
tcaggctttg 
tcgattttgg 
ggacgctgtc 



ggcatatatc 
tatatctgct 
aaaagtttga 
actgtaaaag 
tttcgacaaa 
tgcttttctt 
ttctgcggaa 
tgtagggttt 
atcatatcat 
gtctaaaaca 
attgcttgtt 
taactcttcc 
tgctgatggt 
tagaaaactt 
gccagactct 
atctatcagg 
ttaggtcatg 
ggccagagcg 
catgtagagg 
cacatggtag 
cgctttgcga 
gggtagagcc 
ggagtttcgg 
ctttttggtg 
ggtggtgccc 
aatcctttcc 
tggcgttgaa 
ggcatccttg 
tagctttccg 
agcaaagtcc 
ctcgatactc 
gctgggactc 
tggatttcgg 
attcggggag 



catagacaat 
gcaggattga 
acacagttta 
aaagtttcta 
gtgtcaccgt 
tggtcttctt 
caacaaaaga 
tctagtctcg 
atactttgca 
tttcgccccg 
ttccccccca 
ctgtccaccg 
cgaaatccag 
atgggcctaa 
cggggctctg 
gatttaactc 
tgcagatccc 
tagcaaaaag 
tgttcattga 
ggtgtccagc 
gcacgattct 
ttcttgctct 
ctggccgcca 
gcaccaacac 
gaggtggagg 
agtggcaccg 
gtctttacct 
ctgctggtct 
ggcacatcca 
tcgattcctg 
tgactctgac 
tggttttggc 
cgtgcgggcg 
gtcgtcgagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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tggccgtcac 
tcatcaggtc 
ccgagtagtc 
tctgcaccag 
gggacagggg 
catgctcggc 
ggttcggctg 
gggagcgcat 
gttggtggtg 
gcaagttacc 
tcgatgaccg 
tgacggatga 
catccacggc 
tctgttccgg 
cggaaccact 
ccgcaagttc 
gacctgccat 
gcttcgcctc 
caaaaagcac 
gatccttgag 
gctcccatcc 
ccatatatat 
agcagtagcg 
agtcgacgga 
cgtatatcgt 
ataatttttt 
tgctagggag 
cagtagcacc 
tatcaggatc 
ggaactggat 
aaggttcgta 
ggaattgaag 
cgtgtgcgtc 
gctcctggaa 
tctctatctc 
ctaacaatgt 
tctaataggc 
atggtataga 
gacttgttaa 
gctccctgct 
ttcaagttgt 
ctccatggaa 
gcagccacag 
aatatatata 
tttatatatg 
ttttccatcc 
ggaatggaat 
aaccagtcgt 
tccgaataag 
tttgatttgg 
gacaaaaaaa 
aaatctattt 
ataatggaaa 
aaactttctt 
ttgcaaccgt 



actggccagc 
ataggtcagg 
atcggagagc 
accctgatgc 
atcactggtg 
tatctgggac 
gttttggtgg 
ctcttccttg 
gtggtactgg 
tttgccacga 
cttcctcatg 
ttcctcatcg 
tgcctcacgt 
tgtcatcttc 
taatctctta 
cactttggtc 
tccggacatc 
actgcgcttt 
aattagtgtg 
aaactgaacg 
aaaaatcgag 
cactctggtt 
atctatcgtg 
aagccagaga 
gtagaagggc 
ttttttgaat 
gcaataaagt 
aggatattac 
gtctgccaca 
cccgacatgg 
ttgttcacat 
tcactgctct 
aggactcgct 
gtgcgaaagt 
tatctctatc 
atctttttaa 
gcctttattc 
tggagtcgct 
gcgaccgcat 
ccccgcactt 
tcctccttaa 
tgccaagact 
tcttggttag 
tgcatataaa 
tatacataat 
aaaaattaat 
atttcgttga 
tttccatcca 
gaatggaata 
aaccatttat 
atatttttcc 
gaaaaataca 
ttaaattaaa 
ttcgtgtagt 
taactggcaa 



atggcggagc 
gccagttcac 
ggggtggggc 
acttgatcgc 
ttgagtaatg 
agtgtggagc 
tgctggtggt 
ctctgcgccc 
tgcatggcct 
gtggccagca 
ttgttcaatt 
gaatcgaagg 
tttttcgcct 
tccacttcct 
atcactgtgg 
tttaggattt 
gctccagcac 
tgcatatcat 
gtccaatagt 
gcgcactggc 
atgaagaaga 
ctcgtcctcc 
atcaacagca 
tcgcagagca 
atcgaaacgg 
ttaaggtaaa 
agttgctggg 
cgcccaccgt 
gtgcaaatgg 
tggaaccgcc 
cgaagttcct 
cgccatagat 
tgaaaatgaa 
aaatatatgc 
tctatctcta 
gatctccatt 
tattaagaca 
ggagttgcca 
ccttaaccgc 
cgcctctgcg 
tgtgctgtcg 
tgtctccggt 
cacttgaaaa 
atatgatata 
atgataaaaa 
tgttttttaa 
gatcgtaaat 
aaaattaatt 
tctcgttgag 
ttttttcacg 
gttttagttg 
ttgtcttcat 
tcaaaattag 
cttcgtggac 
ccggcgactg 



tctcgtccat 
tgttaccgga 
actgaaaggc 
tagggacctc 
ttggaggagc 
aggatgtgcg 
gcggttgttg 
gctgtaacgg 
cgatcgactt 
gggcggcctg 
tctgctgggt 
caggactgct 
cctcctggat 
cggtggccac 
tgcaggcact 
tctcctcaat 
tcagtgccac 
aggctgtttt 
agccaatgat 
caggaagtag 
gcagagcggg 
aactccggta 
ccgtgtactg 
tcggacccag 
ttcctgaaat 
atagttacta 
ctgccgaata 
aagaatgata 
tggcaaaaca 
atcagaggat 
gtaccattgc 
cttggccgcc 
gctgcacagc 
ttaattagag 
tctctaaact 
ttctaaaagc 
actcaccagt 
ttcaggccag 
tccgccctct 
ctccacaacc 
acggcggttg 
caggtgcagc 
aatatagagg 
atatatatat 
gtctgaattt 
tcgaattttt 
aaattttaaa 
gttttttaat 
atcgtaaata 
atgttgccac 
cgcgaccatt 
tagatggtgg 
taatgaaaat 
gtcttcgctt 
gcgatgcaat 



atagcgtagc 
actattccca 
gtcgggaggt 
gatgggtgcc 
ggacaggtcg 
cttgggcctg 
ctggttctgg 
actctggttg 
ggagttagtg 
tgccccaaag 
gatcaactgc 
ggatctctcg 
ttttgctgcc 
ggccagagga 
atccaaatcg 
cacaccaatg 
catggattgc 
gtagatcccg 
tagggccgta 
atcccgcttg 
aatgatccaa 
cttggcggca 
ggacaccaga 
gtcccaataa 
atgggtttat 
cctattagca 
ttcctgtcca 
cagatcgcca 
ggaccaagaa 
tgatttctaa 
agctgttcat 
tccaacagac 
ttatcccgct 
ttgacataaa 
aatgtccttg 
tttgccactg 
cggtgatcca 
tcagctcctc 
tcccgctctg 
tctgcggttt 
tctcggaatg 
atctgcaaga 
ttattaatat 
gcatataaaa 
cagaattctg 
ggtaataata 
tccgtgtgtg 
cgaatttttg 
aattttaaaa 
tttccaaaaa 
tttagccaaa 
aaaataatga 
caagtattta 
ccgccaagtt 
ttgcttgtgg 



tccgagttcg 
aagggacgat 
ggcaaaatgg 
aagtccacgg 
accaaacgat 
tccagcggtg 
tggtggtgca 
tgatggtgct 
gtgctcttct 
accagaccaa 
tgggcctggt 
ctcttttccg 
gacgctctcc 
tttgcctgac 
gtttgatctc 
ccagctgcat 
attttccgtt 
gcatataaga 
ttaaagatcg 
ccggtgaagt 
gtgattgtga 
atcttcacag 
cacacagtat 
cccttgagca 
agtacaatta 
tatccgtggc 
cgatgaaggc 
ggcatatggc 
tctccgagtc 
cagccaacag 
taaccggcca 
gtttgcactc 
cccttctaag 
actatctcta 
aaaacaaatt 
aaattactat 
cttgttcttg 
ctcgcccgcc 
gccacgtccc 
ttatggtaat 
cttacattgc 
ggcaacaaaa 
tatatgatat 
tatgatataa 
ccagaagtcg 
atccgaataa 
tttgatttgg 
gtaataataa 
tatatgtgtg 
ttcgaaaatt 
taatagtaaa 
aaatatgaaa 
ctaaataaag 
taaactaatt 
cagttgcttc 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



<210> 92 
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<211> 3060 

<212> DNA 

<213> Drosophila 



<400> 92 

atgcggtcgc 

agcgactcca 

cgggataagc 

ctgttggagg 

gaacagctgc 

agaaatcaat 

cttggtcctg 

gcgatctgta 

gacaggaata 

ctaataggaa 

ctgggtccga 

cagtacacgg 

taccggagtt 

cccgctctgc 

ctacttcctg 

atcattggct 

aaaacagcct 

gtggcactga 

attgaggaga 

agtgcctgca 

gtggccaccg 

atccaggagg 

agcagtcctg 

acccagcaga 

caggccgccc 

aagtcgatcg 

ccgttacagc 

caacaaccgc 

cgcacatcct 

gctcctccaa 

gaggtcccta 

gcctttcagt 

tccggtaaca 

atggacgaga 

gacagcgtcc 

ccaaaatcga 

caaagcctga 

gaggcggtgg 

aaggtgaggc 

gctgcggact 

atactctatt 

caaacgggaa 

gaggagcgag 

agcagtacgg 

aacgcggggg 

agcaaaaggg 

ctgatccttg 

tttcgcacga 

ttggccctcg 

ttctcgtact 

aatcagcagt 



ttaacaagtc 
tctataccat 
tgtgcagctt 
cggccaagat 
aatggtacag 
cctctgatgg 
ttttgccacc 
tcattcttac 
ttcggcagcc 
ccgtttcgat 
tgctctgcga 
tgctgttgat 
ggaggacgag 
tcttcttcat 
gccagtgcgc 
actattggac 
atgatatgca 
gtgctggagc 
aaatcctaaa 
ccacagtgat 
aggaagtgga 
aggcgaaaaa 
ccttcgattc 
aattgaacaa 
tgctggccac 
aggccatgca 
gggcgcagag 
accaccagca 
gctccacact 
cattactcaa 
gcgatcaagt 
gccccacccc 
gtgaactggc 
gctccgccat 
aggggaaacc 
acctgaacca 
gcctcaacct 
ccattgaaaa 
gggagagctg 
gtccccatcg 
ccccgatgcc 
cgccccagtc 
aaacctcgga 
gcggtgtggg 
atgacgatgc 
cattcattca 
gagtgggtgg 
tctcgttcat 
tcgagggatt 
tcctctgcta 
tcaagaagac 



ggcgggcgag 
caagaacaag 
cattttcaag 
ctatggcgag 
gaacttcgat 
cggttccacc 
atttgcactg 
ggtgggcggt 
cagcaactac 
gcccttctac 
tctctggctt 
cacgatagat 
aaccagagtg 
ctcgattttt 
cgttcagttt 
cacactaatt 
aaagcgcagt 
gatgtccgga 
gaccaaagtg 
taagagatta 
gaagatgaca 
acgtgaggca 
cgatgaggaa 
catgaggaag 
tcgtggcaaa 
ccagtaccac 
caaggaagag 
ccaccaaaac 
gtcccagata 
caccagtgat 
gcatcagggt 
gctctccgat 
cctgacctat 
gctggccagt 
accagtcctg 
gagcctcagt 
gaatccgaat 
acgtcttctc 
cattgaggcc 
ccgggcctcc 
accaatgcca 
ggcggcgggt 
cagtaatacc 
tggaggcggg 
caacagggag 
ttcgatagga 
acggcagaag 
tttgggctgc 
ctgccggcat 
cgccaacagc 
ctttacgcgg 



gaggagctga 
tggatcaccg 
cgagtcctga 
agcagtgact 
gtgaacaata 
atgtcgggat 
tggcagacga 
aatatcctgg 
tttattgcct 
acgatatacg 
tccgtcgact 
cgctactgct 
atatatatgg 
ggatgggagc 
ctcaaggatc 
gtgctttttg 
gaggcgaagc 
atggcaggtc 
gaacttgcgg 
agtggttccg 
ccggaacaga 
gccgtggatg 
tcatccgtca 
cggtcatcga 
ggtaacttgc 
caccaccaac 
atgcgctccc 
cagccgaacc 
gccgagcatg 
cccctgtccc 
ctggtgcaga 
gactactcgg 
gacctgatga 
gtgacggcca 
cctccaccac 
caaactccca 
cacaaccaga 
gtctcgtaca 
atctgcctgt 
agtcctatgg 
ctctcaccca 
gtaacaccac 
aacaagatgc 
gaacaagacg 
gagaagagtt 
aagcacttca 
tccaagtcgg 
ttcgtggcct 
ccgccctgca 
ccgatgaatc 
atcctcaagg 



ctggcctgaa 
actggagcct 
cgcacacgga 
tcaattcctg 
cgaaccttct 
ccagttccga 
tcctgatagc 
tgctactggc 
ccctagcagc 
tgctcaaggg 
atactgtgtg 
ctgtgaagat 
tcacaatcac 
acttcaccgg 
cgatctttaa 
tcttatatgc 
aacggaaaat 
atgcagctgg 
gagatcaaac 
gtcaggcaaa 
ggagagcgtc 
cggaaaagag 
accaggccca 
ttggtctggt 
agaagagcac 
agcaccatca 
tgcaccacca 
caccgctgga 
atcgtttggt 
ccgtggactt 
ccattttgcc 
atcgtccctt 
cgaactcgga 
actcgacgac 
cgcccgcacg 
gccaaaacca 
gtcagagtca 
caggaatcga 
tggatgtgcc 
agaccagcag 
aggtaaagac 
cggtgccact 
cctccacctc 
gtgttggtgc 
tggcggccag 
agagcaagaa 
agaatcgtgc 
gctggacacc 
tcaatgaaca 
ctttttgcta 
gggatctgca 



tggcaactcc 
tagaagggag 
gtgcaaacgt 
gccggttaat 
gttggctgtt 
ctcggagatt 
catatgcctg 
cttcatcgtg 
cacggatatg 
ttattgggac 
tctggtgtcq 
tgccgccaag 
ttggatcatt 
caagcgggat 
tacggcccta 
cgggatctac 
gcaatccatg 
cattggtgtg 
cgatttggat 
tcctctggcc 
ggcagcaaaa 
cgagagatcc 
gcagttgatc 
ctttggggca 
cactaactcc 
caaccagagt 
ccagaaccag 
caggcccaag 
cgacctgtcc 
ggcacccatc 
acctcccgac 
tgggaatagt 
gctacgctat 
ctccccgaat 
ccgaaatcca 
gagtcccagc 
gagtatcgag 
ggactttgct 
cggaaagcta 
caaggatgcc 
ttcaacgcca 
ggaaaggatt 
gggcaccacc 
caccaaaaag 
ccgaaactcc 
ggctctaccc 
tcgcaaagcg 
ctaccatgtg 
cctctacatg 
cgctctggcc 
catgacctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



<210> 93 
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<212> PRT 

<213> Drosophila 



<400> 93 



Met 


Arg 


Ser 


Leu 


Asn 


Lys 


Ser 


Ala 


1 








5 








Asn 


Gly 


Asn 


Ser 


Ser 


Asp 


Ser 


He 








20 










Thr 


Asp 


Trp 


Ser 


Leu 


Arg 


Arg 


Glu 






35 










40 


Phe 


Lys 


Arg 


Val 


Leu 


Thr 


His 


Thr 




50 










55 




Ala 


Lys 


He 


Tyr 


Gly 


Glu 


Ser 


Ser 


65 










70 






Glu 


Gin 


Leu 


Gin 


Trp 


Tyr 


Arg 


Asn 










85 








Leu 


Leu 


Ala 


Val 


Arg 


Asn 


Gin 


Ser 








100 










Gly 


Ser 


Ser 


Ser 


Asp 


Ser 


Glu 


He 






115 










120 


Ala 


Leu 


Trp 


Gin 


Thr 


He 


Leu 


He 




130 










135 




lie 


Leu 


Thr 


Val 


Gly 


Gly 


Asn 


He 


145 










150 






Asp 


Arg 


Asn 


He 


Arg 


Gin 


Pro 


Ser 










165 








Ala 


Thr 


Asp 


Met 


Leu 


He 


Gly 


Thr 








180 










Tyr 


Val 


Leu 


Lys 


Gly 


Tyr 


Trp 


Asp 






195 










200 


Trp 


Leu 


Ser 


Val 


Asp 


Tyr 


Thr 


Val 




210 










215 




Leu 


Leu 


He 


Thr 


He 


Asp 


Arg 


Tyr 


225 










230 






Tyr 


Arg 


Ser 


Trp 


Arg 


Thr 


Arg 


Thr 










245 








Thr 


Trp 


He 


He 


Pro 


Ala 


Leu 


Leu 








260 










Glu 


His 


Phe 


Thr 


Gly 


Lys 


Arg 


Asp 






275 










280 


Gin 


Phe 


Leu 


Lys 


Asp 


Pro 


He 


Phe 




290 










295 




Tyr 


Trp 


Thr 


Thr 


Leu 


He 


Val 


Leu 


305 










310 






Lys 


Thr 


Ala 


Tyr 


Asp 


•Met 


Gin 


Lys 










325 








Met 


Gin 


Ser 


Met 


Val 


Ala 


Leu 


Ser 








340 










Gly 


His 


Ala 


Ala 


Gly 


He 


Gly 


Val 






355 










360 


Lys 


Val 


Glu 


Leu 


Ala 


Gly 


Asp 


Gin 




370 










375 




Thr 


Val 


He 


Lys 


Arg 


Leu 


Ser 


Gly 


385 










390 






Val 


Ala 


Thr 


Glu 


Glu 


Val 


Glu 


Lys 










405 








Ser 


Ala 


Ala 


Lys 


He 


Gin 


Glu 


Glu 








420 










Asp 


Ala 


Glu 


Lys 


Ser 


Glu 


Arg 


Ser 






435 










440 



Gly 


Glu 


Glu 


Glu 


Leu 


Thr 


Gly 


Leu 




10 










15 




Tyr 


Thr 


He 


Lys 


Asn 


Lys 


Trp 


He 


25 










30 






Arg 


Asp 


Lys 


Leu 


Cys 


Ser 


Phe 


He 










45 








Glu 


Cys 


Lys 


Arg 


Leu 


Leu 


Glu 


Ala 








60 










Asp 


Phe 


Asn 


Ser 


Trp 


Pro 


Val 


Asn 






75 










80 


Phe 


Asp 


Val 


Asn 


Asn 


Thr 


Asn 


LeU 




90 










95 




Ser 


Asp 


Gly 


Gly 


Ser 


Thr 


Met 


Ser 


105 










110 






Leu 


Gly 


Pro 


Val 


Leu 


Pro 


Pro 


Phe 










125 








Ala 


He 


Cys 


Leu 


Ala 


He 


Cys 


He 








140 










Leu 


Val 


Leu 


Leu 


Ala 


Phe 


He 


Val 






155 










160 


Asn 


Tyr 


Phe 


He 


Ala 


Ser 


Leu 


Ala 




170 










175 




Val 


Ser 


Met 


Pro 


Phe 


Tyr 


Thr 


lie 


185 










190 






Leu 


Gly 


Pro 


Met 


Leu 


Cys 


Asp 


Leu 










205 








Cys 


Leu 


Val 


Ser 


Gin 


Tyr 


Thr 


Val 








220 










Cys 


Ser 


Val 


Lys 


He 


Ala 


Ala 


Lys 






235 










240 


Arg 


Val 


He 


Tyr 


Met 


Val 


Thr 


He 




250 










255 




Phe 


Phe 


He 


Ser 


He 


Phe 


Gly 


Trp 


265 










270 






Leu 


Leu 


Pro 


Gly 


Gin 


Cys 


Ala 


Val 










285 








Asn 


Thr 


Ala 


Leu 


He 


He 


Gly 


Tyr 








300 










Phe 


Val 


Leu 


Tyr 


Ala 


Gly 


He 


Tyr 






315 










320 


Arg 


Ser 


Glu 


Ala 


Lys 


Gin 


Arg 


Lys 




330 










335 




Ala 


Gly 


Ala 


Met 


Ser 


Gly 


Met 


Ala 


345 










350 






He 


Glu 


Glu 


Lys 


He 


Leu 


Lys 


Thr 










365 








Thr 


Asp 


Leu 


Asp 


Ser 


Ala 


Cys 


Thr 








380 










Ser 


Gly 


Gin 


Ala 


Asn 


Pro 


Leu 


Ala 






395 










400 


Met 


Thr 


Pro 


Glu 


Gin 


Arg 


Arg 


Ala 




410 










415 




Ala 


Lys 


Lys 


Arg 


Glu 


Ala 


Ala 


Val 


425 










430 






Ser 


Ser 


Pro 


Ala 


Phe 


Asp 


Ser 


Asp 



445 
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Glu Glu Ser Ser 
450 

Leu Asn Asn Met 
465 

Gin Ala Ala Leu 

Thr Thr Asn Ser 
500 

Gin Gin His His 
515 

Glu Glu Met Arg 
530 

His Gin His His 
545 

Arg Thr Ser Cys 

Val Asp Leu Ser 
580 

Ser Pro Val Asp 
595 

Gin Gly Leu Val 
610 

Pro Thr Pro Leu 
625 

Ser Gly Asn Ser 

Glu Leu Arg Tyr 
660 

Ala Asn Ser Thr 
675 

Val Leu Pro Pro 
690 

Leu Asn Gin Ser 
705 

Gin Ser Leu Ser 

Gin Ser He Glu 
740 

Tyr Thr Gly He 
755 

Glu Ala He Cys 
770 

Pro His Arg Arg 
785 

He Leu Tyr Ser 

Thr Ser Thr Pro 
820 

Pro Pro Val Pro 
835 

Asn Thr Asn Lys 
850 

Gly Val Gly Gly 
865 

Asn Ala Gly Asp 

Ser Arg Asn Ser 
900 

Phe Lys Ser Lys 
915 



Val Asn Gin Ala 
455 

Arg Lys Arg Ser 
470 

Leu Ala Thr Arg 
485 

Lys Ser He Glu 

His Asn Gin Ser 
520 

Ser Leu His His 
535 

Gin Asn Gin Pro 
550 

Ser Thr Leu Ser 
565 

Ala Pro Pro Thr 

Leu Ala Pro He 
600 

Gin Thr He Leu 
615 

Ser Asp Asp Tyr 
630 

Glu Leu Ala Leu 
645 

Met Asp Glu Ser 

Thr Ser Pro Asn 
680 

Pro Pro Pro Ala 
695 

Leu Ser Gin Thr 
710 

Leu Asn Leu Asn 
725 

Glu Ala Val Ala 

Glu Asp Phe Ala 
760 

Leu Leu Asp Val 
775 

Ala Ser Ser Pro 
790 

Pro Met Pro Pro 
805 

Gin Thr Gly Thr 

Leu Glu Arg He 
840 

Met Pro Ser Thr 
855 

Gly Gly Glu Gin 
870 

Asp Asp Ala Asn 
885 

Ser Lys Arg Ala 

Lys Ala Leu Pro 
920 



Gin Gin Leu He 
4 60 

Ser He Gly Leu 
475 

Gly Lys Gly Asn 
490 

Ala Met His Gin 
505 

Pro Leu Gin Arg 

His Gin Asn Gin 
540 

Asn Pro Pro Leu 
555 

Gin He Ala Glu 
570 

Leu Leu Asn Thr 
585 

Glu Val Pro Ser 

Pro Pro Pro Asp 
620 

Ser Asp Arg Pro 
635 

Thr Tyr Asp Leu 
650 

Ser Ala Met Leu 
665 

Asp Ser Val Gin 

Arg Arg Asn Pro 
700 

Pro Ser Gin Asn 
715 

Pro Asn His Asn 
730 

He Glu Lys Arg 
745 

Lys Val Arg Arg 

Pro Gly Lys Leu 
780 

Met Glu Thr Ser 
795 

Met Pro Leu Ser 
810 

Pro Gin Ser Ala 
825 

Glu Glu Arg Glu 

Ser Gly Thr Thr 
860 

Asp Gly Val Gly 
875 

Arg Glu Glu Lys 
890 

Phe He His Ser 
905 

Leu He Leu Gly 



Thr Gin Gin Lys 

Val Phe Gly Ala 
480 

Leu Gin Lys Ser 
495 

Tyr His His His 
510 

Ala Gin Ser Lys 
525 

Gin Gin Pro His 

Asp Arg Pro Lys 
560 

His Asp Arg Leu 
575 

Ser Asp Pro Leu 
590 

Asp Gin Val His 
605 

Ala Phe Gin Cys 

Phe Gly Asn Ser 
640 

Met Thr Asn Ser 
655 

Ala Ser Val Thr 
670 

Gly Lys Pro Pro 
685 

Pro Lys Ser Asn 

Gin Ser Pro Ser 
720 

Gin Ser Gin Ser 
735 

Leu Leu Val Ser 
750 

Glu Ser Cys He 
765 

Ala Ala Asp Cys 

Ser Lys Asp Ala 
800 

Pro Lys Val Lys 
815 

Ala Gly Val Thr 
830 

Thr Ser Asp Ser 
845 

Ser Ser Thr Gly 

Ala Thr Lys Lys 
880 

Ser Leu Ala Ala 
895 

He Gly Lys His 
910 

Val Gly Gly Arg 
925 
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Gin 


Lys 


Ser 


Lys 


Ser 


Glu 


Asn 


Arg 


Ala 


Arg 


Lys 


Ala 


Phe 


Arg 


Thr 


lie 




930 










935 










y 4 u 










Ser 


Phe 


lie 


Leu 


Gly 


Cys 


Phe 


Val 


Ala 


Cys 


Trp 


Thr 


Pro 


Tyr 


His 


Val 


94 5 










950 










y oo 










you 


Leu 


Ala 


Leu 


Val 


Glu 


Gly 


Phe 


Cys 


Arg 


His 


Pro 


Pro 


Cys 


lie 


Asn 


Glu 










965 










970 










975 




His 


Leu 


Tyr 


Met 


Phe 


Ser 


Tyr 


Phe 


Leu 


Cys 


Tyr 


Ala 


Asn 


Ser 


Pro 


Met 








980 










985 










990 






Asn 


Pro 


Phe 


Cys 


Tyr 


Ala 


Leu 


Ala 


Asn 


Gin 


Gin 


Phe 


Lys 


Lys 


Thr 


Phe 






995 










1000 








1005 






Thr 


Arg 


lie 


Leu 


Lys 


Gly Asp 


Leu 


His 


Met 


Thr 













1010 1015 



<210> 94 
<211> 4172 
<212> DNA 
<213> Drosophila 



<400> 94 

gacatacgca 

cgattattca 

tcctgcaaat 

taaacaagag 

tagagctgtc 

agggcaaccc 

agaaggccaa 

gtggtacttc 

gaacaattaa 

tttatataaa 

aggagcctat 

tccaacttcc 

tagaaatcgt 

cccgtccctc 

atcggaatcg 

tttttttttt 

gcaatcagag 

ctcgctaggt 

cctgtgggag 

actgcatgtg 

cgtacccttc 

agcggccggg 

gaccaactac 

gccgctgggc 

cagaggtatt 

cttgtgcgcc 

tttcatagtt 

ttacttgggg 

tttaaaaagt 

ttttaaatct 

ataatgtaca 

aatccgaatg 

gtggcaaaga 

atttaatgaa 

atatttcgat 

gtctatgggc 

cctttgtagc 

tgccagcaag 

ataagaattg 

gatctggaaa 



ggcggagaat 
aagtcccttt 
caaatgcaat 
tgggcggagg 
ctatgactta 
gatcaaaacg 
tttggtggca 
taggaattca 
gttcaagtag 
tactaagtgc 
tcaaatagtt 
ggcgctttgg 
cctggtggcg 
gctcatttcc 
gaatcagaat 
ttgcgatctg 
tagagcgcac 
gcctacaacg 
gaggatgaga 
gccaggtgga 
gatgcgaaca 
aacatccttg 
ttccttatgt 
atccttacgc 
ttttgcagcc 
aggattggct 
tactctaacg 
caagggaaaa 
gacgaagaag 
taagatttct 
gggaattgtg 
ataatctctc 
cattcaggcc 
gctgctggtg 
caaagctggc 
cggcagggcg 
cagcctggcc 
atggcaacgg 
cattaggggc 
catctgccga 



tgatttgatt 
ttgccctgcg 
tatgtcgttc 
tggtctgaat 
tcgactttaa 
aaaaatcaaa 
attcctgttt 
ttaaatgtat 
ctatgtttag 
tattcttaat 
ttcattaatg 
tggcgtaaaa 
ccgtgagctc 
ggtgcgccgt 
cggcgcgagg 
cgttgcatat 
aataggaata 
acagcggtgg 
cgcagcgaac 
atgccaccgg 
actactgggc 
tctgcctggc 
cgctggccat 
tggtgaaagg 
accttttgtt 
tgtgcggcta 
aaaatgagtt 
aagagtgaga 
atgcaaggaa 
tatatttgca 
aaaaaatatg 
caagtggaag 
aggacctcct 
ggaatggcgt 
aaacgtaagt 
gagcggagct 
aggacgaacc 
aagcttccaa 
ggcaaacgag 
cacattgctg 



tctgttattg 
gcgaacggcg 
aatcgtctga 
aatttatgac 
gccaattgga 
gtgtgggtat 
gtgatacaat 
ccaatttgct 
tcatttcaaa 
taagcatctg 
ttttcattct 
tgtttgccac 
tcgtcctttt 
ttttacattt 
acatttccat 
gcgagggcta 
ataacgcgga 
cgacgattgg 
gactgctaat 
caatgcgacc 
cctattggcc 
catcgcctgg 
caccgatctc 
tgagtccttt 
gagggactca 
atcttagttg 
ggtggaaaac 
agttttgata 
acgcaaagat 
atttaaatgc 
gatatttacg 
agtataataa 
gcgatgtcgc 
tggcggtgtg 
aagcccggat 
ccatcgttag 
gctggcgcag 
agcctttgca 
cgctggtgaa 
tcctgcaatc 



ttgcttatca 
attgccctca 
atgctgaacg 
cgcccattaa 
gaggggggag 
cattgtgtat 
tcctggccgc 
gtcaaggtaa 
gttatgtctt 
cacttaaggt 
ttccctttct 
caatatccgt 
cctttcgcga 
gcaattcgaa 
tccttcccat 
attagcatgc 
atggaagagg 
agcagctcgg 
gccaccagtc 
atcagcgcga 
ctcgtcctcg 
gagcgccggc 
atggtcgccg 
gtcccattca 
atggacactt 
gaaaagttcc 
ttgatgaaaa 
gttgttaaat 
gtagggaata 
ggtaaatgcg 
gattcctttt 
tcaaagccag 
caacacgagg 
cagattgcat 
agaaaaatgg 
acaatttaat 
acagggaagc 
ggcgccggag 
tttcgctttg 
tgcactgaaa 



ctatcgattt 
gtttctgagt 
gtaaatcgag 
ttattactca 
taaggctggc 
ttcgtaaatt 
cattgtaccc 
tttacttggt 
tacaatttat 
tttctctata 
cgattgcagc 
tcggttgccg 
gtcctgaatc 
tcggattgaa 
tctcttttta 
ggcatttcca 
atgtgtatgc 
agcacctggt 
ggcataatca 
ccttcgagga 
tcctgggaac 
tgcagaatgt 
tgctggtcat 
aagtccggaa 
gagtaagtca 
ttttccataa 
ttaatattaa 
ttttaaggct 
tacaaaatta 
aatcaattga 
gctgcaactc 
cgacagaaac 
cacgagttgc 
tatgtctcag 
aatcgctggt 
aagcgcggcg 
ccagaaacaa 
caactgcagc 
acttagttcg 
aaaaaaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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aatatattta 
gtataatttt 
atacttacat 
ccagacagga 
tccatttatg 
acgaatggct 
cctttttgtg 
tgggagccat 
gaaagggcga 
atgtagcaag 
cagtggcagt 
aggcaagaaa 
gacagaagct 
aattcaaagg 
attgcatcgg 
ttctattctc 
tcatgaccaa 
gtaatatcaa 
gtaataataa 
aatctaacaa 
tatttttagg 
attaatgcta 
actttcaacg 
aattaaacga 
cgcgtggaaa 
gttaattaat 
gacagtaaat 
agctcgcaaa 
aacagccgca 
gaaaattaat 



ttcaattatt 
tttctttcga 
tttttttaag 
aatggaaaag 
tgagtgcttc 
gatttgagga 
ccaacccctg 
tacaaacaca 
cgcacacaca 
aaggaggacc 
gacaggagca 
ggacgaaatc 
gaaatctcca 
cacaatggcg 
ttacgtaagt 
tgttgataag 
taggcgtatg 
aattaatcta 
tacaaattta 
actattgaaa 
acttaagttc 
ttatgctaaa 
aattttccac 
catgcgttga 
acataattaa 
aacagtaaac 
tgaattacaa 
tgtttgcaac 
aatataaaaa 
tgagcgcaaa 



tctatactta 
ttttttcgat 
ctaagtaact 
ctatttccgt 
ttccggtggc 
ctgcatttat 
cagatgttgc 
aacacaaaca 
cgcaaacaca 
aacaaatggc 
gttgttgctg 
gggatttgcc 
gcgaacagca 
caagtagcaa 
aacatgtggc 
tatgcaaatt 
agtaatatta 
acaaaatcat 
aattttcgta 
tggaatcaaa 
actgaaggct 
ctctgttgag 
ttgatcaatc 
tgccatttta 
tgcataatta 
gcaattactg 
cgtcccgtca 
ttaataaatt 
gaaaagccaa 
ttggtgaagc 



aagtaaaagt 
tgagcaataa 
tgaagctagc 
gactttcaaa 
agttaaacgg 
ctgagtgctg 
aacactgatg 
caaacacgag 
cacctggagc 
tgcttgttat 
tagttgcacg 
tgctgtcaag 
ggcattagat 
ctggatatga 
atacaatgtg 
caactttcaa 
gtagactaaa 
tcatggcttt 
aaaacaattt 
tagctgaaat 
tagcgggaaa 
tttccagatg 
gaccttttgg 
agcgcttgca 
atttcatttt 
catagcatag 
atggtcgtga 
atacttggga 
gcgaaaaaag 

tg 



catcaagttt 
aatatggttc 
tacacaaata 
ggcgttgctt 
tatttcttcg 
cttccgcctc 
acagttttgt 
cacgggcacc 
aatccttggc 
atcaagtgtg 
ggcacacgcc 
ggctttgcag 
cccaaggatt 
gtaagccgtg 
gcattgaaat 
tgaaagctga 
tgtttaaaaa 
ggttgttgac 
gatccgtacg 
aatatttata 
taagttttta 
catatatcct 
aaacgaaaca 
cttgcagtga 
aaaatcagca 
ccatcaaaag 
atatttaagt 
cgaataatga 
gcgcgaggca 



aaagctaatt 
atatatgtta 
ttataacata 
atttcgttgc 
attttcgaaa 
ctgctgccgt 
ttgtgcgctt 
aatacaagca 
aaagtgggca 
aaaaatattg 
cgccaactgg 
agggtcctgc 
aagctcttac 
cagtacatta 
ggcctatata 
actaatatgt 
cgctaaaggt 
tttctggagt 
tacacgcaca 
aatgaatata 
aggatgaaat 
ttgattttga 
agacccatgc 
attatgatta 
ttaaacgcca 
ccataaaaaa 
aaacgcgaaa 
taaacaagag 
cgagaagctg 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4172 



<210> 95 
<211> 1014 
<212> DNA 
<213> Drosophila 



<400> 95 

atggaagagg 

agcagctcgg 

gccaccagtc 

atcagcgcga 

ctcgtcctcg 

gagcgccggc 

atggtcgccg 

gtcccattca 

cgtggcaaag 

catttaatga 

gatatttcga 

tttgggagcc 

cagaaagggc 

caatgtagca 

tgcagtggca 

ggaggcaaga 

gcgacagaag 



atgtgtatgc 
agcacctggt 
ggcataatca 
ccttcgagga 
tcctgggaac 
tgcagaatgt 
tgctggtcat 
aagtccggaa 
acattcaggc 
agctgctggt 
tcaaagctgg 
attacaaaca 
gacgcacaca 
agaaggagga 
gtgacaggag 
aaggacgaaa 
ctgaaatctc 



ctcgctaggt 
cctgtgggag 
actgcatgtg 
cgtacccttc 
agcggccggg 
gaccaactac 
gccgctgggc 
cagagtggaa 
caggacctcc 
gggaatggcg 
caaacatgtt 
caaacacaaa 
cacgcaaaca 
ccaacaaatg 
cagttgttgc 
tcgggatttg 
cagcgaacag 



gcctacaacg 
gaggatgaga 
gccaggtgga 
gatgcgaaca 
aacatccttg 
ttccttatgt 
atccttacgc 
gagtataata 
tgcgatgtcg 
ttggcggtgt 
gcaacactga 
cacaaacacg 
cacacctgga 
gctgcttgtt 
tgtagttgca 
cctgctgtca 
caggcattag 



acagcggtgg 
cgcagcgaac 
atgccaccgg 
actactgggc 
tctgcctggc 
cgctggccat 
tggtgaaagg 
atcaaagcca 
ccaacacgag 
gcagattgca 
tgacagtttt 
agcacgggca 
gcaatccttg 
atatcaagtg 
cgggcacacg 
agggctttgc 
atcccaagga 



cgacgattgg 
gactgctaat 
caatgcgacc 
cctattggcc 
catcgcctgg 
caccgatctc 
tgagtccttt 
gcgacagaaa 
gcacgagttg 
ttatgtctca 
gtttgtgcgc 
ccaatacaag 
gcaaagtggg 
tgaaaaatat 
cccgccaact 
agagggtcct 
ttaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1014 



<210> 96 
<211> 337 
<212> PRT 
<213> Drosophila 
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<400> 96 



Met 


Glu 


CjJLU 


Asp 


1 








Gly Asp 


Asp 


Trp 








20 


Glu 


Thr 


Gin 


Arg 






35 




His 


Val 


Ala 


Arg 




50 






Phe 


Glu 


Asp 


Val 


65 








Leu 


Val 


Leu 


Val 


Ala 


He 


Ala 


Trp 








100 


Met 


Ser 


Leu 


Ala 






115 




Leu 


Gly 


He 


Leu 




130 






Val 


Arg 


Asn 


Arg 


145 








Arg 


Gly 


Lys 


Asp 


Arg 


His 


Glu 


Leu 








180 


Val 


Cys 


Arg 


Leu 






195 




His 


Val 


Ala 


Thr 




210 






Tyr 


Lys 


His 


Lys 


225 








Gin 


Lys 


Gly 


Arg 


Trp 


Gin 


Ser 


Gly 








260 


Cys 


Tyr 


He 


Lys 






275 




Cys 


Cys 


Cys 


Ser 




290 






Gly 


Arg 


Asn 


Arg 


305 








Ala 


Thr 


Glu 


Ala 



Asp 



Val Tyr Ala Ser 
5 

Ser Ser Ser Glu 

Thr Thr Ala Asn 
40 

Trp Asn Ala Thr 
55 

Pro Phe Asp Ala 
70 

Leu Gly Thr Ala 
85 

Glu Arg Arg Leu 

He Thr Asp Leu 
120 

Thr Leu Val Lys 
135 

Val Glu Glu Tyr 
150 

He Gin Ala Arg 
165 

His Leu Met Lys 

His Tyr Val Ser 
200 

Leu Met Thr Val 
215 

His Lys His Lys 
230 

Arg Thr His Thr 
245 

Gin Cys Ser Lys 

Cys Glu Lys Tyr 
280 

Cys Thr Gly Thr 
295 

Asp Leu Pro Ala 
310 

Glu He Ser Ser 
325 



Leu Gly Ala Tyr 
10 

His Leu Val Leu 
25 

Ala Thr Ser Arg 

Gly Asn Ala Thr 
60 

Asn Asn Tyr Trp 
75 

Ala Gly Asn He 
90 

Gin Asn Val Thr 
105 

Met Val Ala Val 

Gly Glu Ser Phe 
140 

Asn Asn Gin Ser 
155 

Thr Ser Cys Asp 
170 

Leu Leu Val Gly 
185 

Asp He Ser He 

Leu Phe Val Arg 
220 

His Glu His Gly 
235 

Gin Thr His Thr 
250 

Lys Glu Asp Gin 
265 

Cys Ser Gly Ser 

Arg Pro Pro Thr 
300 

Val Lys Gly Phe 
315 

Glu Gin Gin Ala 
330 



Asn Asp Ser Gly 
15 

Trp Glu Glu Asp 
30 

His Asn Gin Leu 
45 

He Ser Ala Thr 

Ala Leu Leu Ala 
80 

Leu Val Cys Leu 
95 

Asn Tyr Phe Leu 
110 

Leu Val Met Pro 
125 

Val Pro Phe Lys 

Gin Arg Gin Lys 
160 

Val Ala Asn Thr 
175 

Met Ala Leu Ala 
190 

Lys Ala Gly Lys 
205 

Phe Gly Ser His 

His Gin Tyr Lys 
240 

Trp Ser Asn Pro 
255 

Gin Met Ala Ala 
270 

Asp Arg Ser Ser 
285 

Gly Gly Lys Lys 

Ala Glu Gly Pro 
320 

Leu Asp Pro Lys 
335 



<210> 97 
<211> 9268 
<212> DNA 
<213> Drosophila 



<400> 97 

tataaatgaa 

tggaatcaaa 

gccatcaata 

gatcaaagtt 

tcgatccatc 

ttttcccagc 

ccaaatctca 



ttctgaaaaa 
aatttcccaa 
tttattagaa 
atacttaaag 
ttcccccgaa 
ggccatacgg 
gggcacacgc 



tgtataaaat 
agaaatggtt 
cattgacctt 
tcgtgcagtt 
cttatcccaa 
attcatgttc 
aacaacacag 



gaaatggact 
ttacatttca 
aaagcaattt 
gcaactgaaa 
tttctgttgg 
gaccagattc 
taaacacaag 



catcattagg 
gaaagtgacc 
gtatagtcgt 
ctgattcatt 
ctatctcgtt 
tggattcaca 
tatcactcac 



tcaatttgtt 60 
cttaattttc 120 
cactttaaca 180 
ctacgcctta 240 
aggaaattcc 300 
aatcccgcgc 360 
cgaatgtgtg 420 
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ggagattctc 
aatggaaaat 
aaatcttcta 
ttttcgtagt 
ttatgacgtc 
gtgggccagt 
cagctcacga 
ggcggggagt 
caatcgtaaa 
agtcggccat 
ttcttttaca 
tgtactcttc 
atcgctacga 
aattgttttc 
caaggtgagt 
tatagtaccc 
ctagttaacc 
attaaaatcc 
tacccaaatt 
ggtgctggtg 
tgtatgcttc 
tttggcccgg 
atgggccagt 
ttctgacccc 
aatatgattt 
tgaaaacttt 
gcaaggggaa 
catgtgccgg 
aagccgcagc 
gcgggggggg 
caaccttaca 
ccatgaatcg 
gctcgactag 
gccccgttgc 
aaccactttt 
tttaaacatg 
taaattgctt 
ttggctaaaa 
gattcaattt 
ttattcagcg 
agtaattttc 
aatatttaat 
tggtgatggt 
cgtgcattta 
gcctcagctg 
gccaagtgcg 
tgtaaagcgt 
acgcagagaa 
caagatattt 
ctaggagaag 
gtttattagc 
aaactgtata 
cgccttttga 
tcaacggata 
tgagctatcg 
ttatgtagca 
gagatgctga 
cccaccaaat 
cagcgatatg 
ggagtttacc 



ggccacacaa 
aaattaattt 
ggcatggttt 
gcacacacaa 
caacagctgt 
tggcctgaaa 
atgaactgtg 
attaaataaa 
tcgtgtgtac 
ggaacattta 
ggctactttc 
tgcacggcca 
tacccaatgc 
gtttggcttc 
aataatgcta 
ttttggcatt 
tagtactcaa 
aaagcgagac 
aattccgtgc 
aatggaactt 
tacattccac 
cagcgccaga 
ggatggctcg 
cactctcacc 
tcagtttgcc 
acacgacact 
agtgctcaaa 
atactccccg 
aatgcacaca 
gggggggcca 
aataggatga 
agggccagct 
ttttaaaacc 
gtagggtagt 
tgaaatttcc 
tagctaataa 
cctgcatccc 
acaatccaag 
attttaagca 
ctcccatttc 
tgggggacaa 
ggaatgcttc 
gactgcagtc 
ggggcgattc 
caggcattag 
actgtgttag 
ccgcatttcg 
aaatcttcgc 
aatgcaagtt 
aatttctctc 
gaatttgtga 
tccgtttaac 
aaccgggtcc 
cggatctcag 
agtttgcgag 
taaaggagcc 
aactatcccg 
ccgaattttc 
ccgccaccca 
accagcaggc 



gctacagaga 
tattagttct 
gtaaatcatc 
ttgtattttt 
gcaaaacgat 
tatgttgcat 
gctcccggga 
aatggctaga 
acataggtag 
caagctgacc 
ctttgggctc 
gcatcatgca 
gatttggccg 
tgagcatcgc 
ttcttttgtt 
ctatttccct 
aatctaaagt 
aagtgtctga 
tcgccctgct 
gccagatacc 
tgggcgtgat 
atctgggtgg 
gccaggcacc 
cagaagtgtc 
taaaatttgc 
cgacaaaagt 
agtggggagg 
aatccgtgtc 
aaatcgacat 
acaccccagg 
aaatccagcc 
acgtaaatgg 
gaagttttaa 
ttttaaaaac 
ataacacttt 
ataaaaagtg 
taattacccc 
atccttcgct 
taactaagtg 
tcatccaatt 
gctgtggatc 
tggtggtggc 
gaatcgcgtg 
gcaggacgcg 
ggtgtgtatg 
tgccgtggat 
gtcaggcacg 
atttattaca 
agaatagctt 
agtgcctatg 
cttttcgtag 
cccgtgttcg 
aggtaacgcc 
cactttggac 
ccagtgttgc 
aacgcccacg 
aggcctggag 
cctatcgaac 
ccagcagtcc 
atcgcccaaa 



aacaaaccca 
ttataagata 
cttatatctt 
atgggaatag 
ggccacagca 
aagctccccc 
aaatatggcc 
gaacattgca 
aaagtattgc 
aagaaataca 
ggagcactgc 
cctgtgcacc 
gaataaaaca 
catgagcttg 
tatattctat 
taatctaata 
gaaaattaca 
aatattttgc 
atctttcctt 
ggatccggtg 
gctgctgaca 
tggacagcag 
ggccttgggt 
cacgaagagc 
attttgttct 
agagagtagg 
ggtgacaaca 
cctgggcttt 
tgtcgtcgac 
acaatcaccc 
catatgtgta 
cttgtaatgc 
gcaccaactt 
tcaagagctg 
tcagcgcatc 
aacagtctgt 
tctcgttact 
tccattatcc 
gctcctaatg 
ggattgcctt 
tctggatgtt 
gaatggtgaa 
cactgcaggg 
gtgtccccga 
tatgtatgtg 
ttgggcagtt 
tagtcggatt 
caaagaatca 
caagcactat 
ttgaaggggc 
cgcactattg 
tgctccacag 
tcctcggtgc 
aaccacgagc 
agtgcaaatt 
accatgacgg 
tcggtggcct 
cacctgcagc 
caccaccaca 
ggtcgccacg 



attcaaacta 
ctaagactta 
tcgatattat 
ggcgagagtg 
tcgggaatag 
ctcttggcaa 
aaaaaaaata 
ttgtttgccg 
agccacttaa 
atgaagaatc 
ctcacctgga 
atctccgtgg 
cgtcggcggg 
cccttgagct 
attccaacaa 
gtgaaaaccc 
aacactgtgc 
gcatacatat 
tgattccaga 
tacaagctgg 
tattgcctga 
acggcggcgg 
aaggatgcta 
tgctgcttct 
ccggattgca 
aaatgggcgg 
ctagacaaac 
ctggagccca 
attgaaccgg 
attccaccca 
gatagctaat 
tctaagagct 
ttcaacgaag 
cgttaacttt 
tacaaaaaag 
tattttagat 
agcccagtta 
cttcacaatt 
ccattgtctg 
gagccccaaa 
gaaagtcaat 
tgctgaatgg 
catttttgcg 
ttcgtctaag 
tgggtgcgca 
attttaagcc 
tgccgtattg 
atgtgtctga 
ttcataaggt 
atttaagtgt 
gactgaatgc 
aacgacgatg 
tgcacgcaca 
tctggctgcc 
accatatata 
ccttgcatca 
cgtcatccac 
tgcagtaccc 
accaccagca 
gaaccacagt 



aaaatttcta 
tactagttgg 
tctaagactt 
gtgcgctgag 
ccgctcatat 
tatcatttgg 
aaaaaaaaaa 
ccgaccaagc 
tacagtcata 
ggtttttttt 
tctgcctgga 
acagatattt 
tcactctcaa 
tgatgtactc 
tgcgtcaaaa 
attaaattcc 
caagtcaata 
gtcctttact 
atcacgcctc 
tcggctccat 
ccgtccgact 
ccactcccgg 
gacaccacac 
gctgcagctt 
ctttttgcac 
atcaggatca 
aatgcagaga 
aagtgcgact 
atggaatggg 
cttgcacaca 
ttccacgttg 
cttccatagt 
cctcgttaca 
tagttttatt 
aggaggaatc 
ttaaagagtc 
actggagctt 
caatagacca 
cccgcagaat 
ggggacagag 
acagttgggg 
taaatgctga 
aaaatcgtag 
tggatttgaa 
tctgtgagcg 
aaatctctgc 
tactttttac 
ttaatataag 
ctgaactgga 
gctcgtgcga 
aatttaatgg 
cacctggcgg 
ctcggccaat 
cgattccagg 
ttattcatgt 
gttcggtgcc 
caccggctca 
cagcagccca 
gcatccggcc 
tctgggtctg 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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tccaccacca 
atgggcggcg 
aataatgggg 
caattttctc 
agaatctctg 
cttttgcggg 
tcgggcaggt 
aacagctgcg 
ttttgttgtc 
tgaccacacc 
tctccttttc 
ttattgtcct 
ctacgttatc 
aggcagaggc 
tgaggccttt 
gctgggataa 
atattttgaa 
aagtaaatag 
t tgcgtatga 
aggcactcaa 
atttagtctg 
gatatcttga 
tgactgcaca 
atatcatata 
agcactatgt 
tggctttggc 
cgaaaagaaa 
ctcatgcggc 
agacttaaaa 
cgaaggcggt 
gggaaacaat 
ggccccaaat 
ttatgcgcac 
ccttatcctg 
ccacttggcc 
cgccccacac 
cggcaggaca 
caaaatgagg 
taaagctaca 
aaaatcccgc 
aataatgcaa 
gcggtaaacc 
aagaaacgag 
actgcattgt 
taattaaaaa 
actaaatcaa 
aaccacatgg 
gaccactttt 
ttatatttat 
cgattgcgag 
cgtgccgggc 
cacaagcgcc 
gagagcctcg 
acgaagtcga 
tgccacatag 
tgtccgtatt 
atttcgtccg 
atggagctac 
ggcataacag 
cactctgctc 



cgctgggcat 
aacagagcga 
tctaaaaata 
aagatgtaat 
ttgtcacaaa 
ccacaatttt 
accttggaaa 
atgggcaaac 
gctgtctcag 
aaaagccaag 
catttattta 
tccgccttcg 
ctttttcttt 
actgcctttt 
tgccaaaatg 
gtgatgcgac 
attaccgaga 
ggcgcttcat 
tatgaaattc 
cccaattcga 
caaaatccat 
atcccccgtt 
ttttatctcg 
tcattgtgac 
gccacattgt 
tccggcttcc 
cgggacgagg 
aaagttgaca 
tcagctcaac 
atcctgcgat 
aacaaagttg 
accacctccc 
gtctcgtgca 
cgtcttggca 
ttctgtctgc 
cgtcttggcc 
gaaaaccgca 
cggtacagaa 
catatgagct 
cagtaaagta 
gaatttttca 
acaaagagca 
ctgaccacaa 
tttaggcacc 
tgtacgacgc 
taaatcggtg 
ttccttttgt 
ttaaaatgaa 
ttaatcagta 
cgtataaaaa 
atgaggacgc 
ataacagtct 
atgacgacga 
agatctgcca 
ccgggcagaa 
ttggggacag 
ccttccgagt 
tcatgtgcag 
gtggaccggc 
acaatcaggg 



cgagcgggag 
tgggtgagtt 
tactatctaa 
atgatgctca 
taatattaac 
cactagaatt 
aacgtaaagt 
cgttttgcca 
ccggttttcg 
aaaatgctca 
gttttctttt 
acttgttgtg 
gtgtcggtgc 
tatttggcat 
tgtttttcat 
gagaaattta 
atttaataag 
gtttatgggc 
tatggaagag 
tattgtacac 
ccaataaaag 
cgattttaag 
ctcatatttt 
tgtgcgcctt 
ttgacagttg 
tattgtccgc 
tcaaagtcga 
aatgatgctg 
acttcgccac 
gtcctgatga 
gcatcagctc 
cactccccca 
aacaaatcaa 
gcaccgagca 
taacgccttc 
aaataaagtt 
aaggcagctc 
aaaaaatcga 
aagtatttct 
cagtaagccg 
agtgtaccac 
aacacaaaaa 
agcgtgctgg 
tcaccttcga 
cgaaggggag 
gggtatgaca 
tctagtctgc 
atgaaaataa 
aagttctttt 
gcgacgaaga 
agacgaggat 
gcccaagaac 
cgacgaggag 
ctcggacact 
tcatcagtcg 
accgctccag 
gaccaccacc 
tagtggcaat 
aggagcggct 
cagctcgctg 



agcaccagga 
cataaatagt 
aagtctggaa 
aacataatta 
acagacaatt 
gtgtcaaatc 
tttcgtggct 
gagtgatcct 
agtgccagta 
cctcgagtcc 
ttcgttagct 
aaatttttcg 
aaactgtttt 
tcgagccgct 
taatttccta 
attaaaaaat 
aactggaaac 
attcaggtgg 
cctggcaaat 
gcattttatt 
catgaatttt 
cttgaataaa 
tcccatttaa 
tcagccaaat 
aacgaatttg 
aatggatggg 
gtggttttta 
gcgatattct 
ataagagtat 
gcttggcagc 
ggctttgttt 
ttgcttagcc 
aaacgcgaag 
tcccgatccc 
attagcagct 
ggcgcgcatt 
gcaaatttat 
tgtacaatat 
gcaattaata 
aactaacata 
aaacaacgga 
atgttgccta 
cttttgttca 
atagagtttt 
aaggtgaaag 
tagacatatt 
tttaatttga 
tcatttaagg 
cctttcagga 
gcttcgtcag 
gtggaaacgc 
gcactctatc 
gaggagagcg 
gagctagatc 
gactacctgc 
aattgtgtga 
acgacggcgg 
aaaaagtcat 
ccagccacag 
acagttcaga 



attccctggc 
aatataaata 
gccaaccaaa 
aattgcttca 
aaattctttt 
ccttgaaaag 
cttcacctga 
ttctagtcgc 
ctgcgcttgg 
tttttgtgac 
catttttgtg 
taataattaa 
atttcttttt 
caagtgattt 
aatttagtct 
tgagataata 
atggctaaac 
aactaatatt 
tctaaaaata 
ttgatttaat 
tcactgcaac 
aggcagctca 
ccaatccctt 
gcttggtatt 
cacaatcaat 
atgccacttg 
attgttgcac 
aatttttgaa 
atattatatg 
tgtttaattc 
cacttgcata 
ctgttccaac 
gatgcataat 
tgtcctccct 
aaccacgccc 
tgcatgcagg 
gcgctttatt 
gtttatatat 
atataaaatt 
ccaaatataa 
gcaagaaaaa 
cttcccggca 
attgccgtta 
gccaattggc 
aaaatggctc 
ttatatattt 
ttcaggaaaa 
actttgactg 
ctctttcaca 
ctgttcctgg 
ccacgtcaac 
cgcgacacac 
cgcaacagaa 
ctccgcagaa 
agctgccctc 
aatcggcgga 
tgagcagttc 
tgacctccag 
ttggctccac 
gcgatggtag 



cagcagtcgc 
agtgagtcca 
attatggtta 
ccacttaacc 
aactttatcc 
catagacaat 
caaaaggaca 
tcgcaggacc 
ccaagtgtgc 
tcgtttactt 
gcattgcctt 
ttttttttcg 
tttggccaaa 
tggcgtcgac 
actggcttat 
gaagacgaat 
tggcatattc 
aactatttaa 
cttttatttt 
tggttatcat 
tgcgaacatt 
acttgcgaaa 
atgcatgttt 
gggtattggg 
aaaatggatt 
gcataaagga 
aatttgccgt 
cttggctggg 
tatgtgtatg 
ctttgtgagg 
atgagtagct 
gcatctcaat 
gagctgacat 
gcagccacca 
ccaacgccac 
gcaaaacggg 
aaaatgaatg 
aactttaaat 
aaaaaaaaaa 
tatagtaaat 
atgttagaag 
aaacaggtga 
gcgagtgttc 
tgcgcaattt 
ggcattgggt 
tcattattaa 
tcgtttaata 
tagtatgtca 
gctttcccaa 
acgtgagaaa 
gctgcgccgc 
cactgtacag 
ggatcagcag 
ggcgaagcag 
ggtttgcact 
ggtgaagatt 
gcccagcgag 
tgtgagtgcc 
cctcagtcca 
tgggtacttg 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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gctgctccgg 
gtggtcacgt 
acatccctgc 
agaagatcgg 
ggtggcaaga 
cgatctcatc 
atcatcaggc 
atcctctggt 
gaacgaatta 
gtgaacccca 
ctgctgtgcc 
aaaccgggca 
actgctccgg 
cggcaaaggc 
gaaaggacga 
gtccttgggg 
ttgaaaacac 
gtcggatccc 
acgagcatgc 
gaagatgcac 
tactttatag 
aatctgttga 
aaaataaact 
aattcttaaa 
gtttggcatt 
aagaatttca 
gacttttgtt 
cattcatttt 



gaactccttg 
gggat tcaag 
tgttgacgcc 
ccaccttgag 
ccaggacgac 
actcgcggaa 
tggagcagaa 
cgccattctt 
gccactgggt 
tcttctacac 
gctattcgag 
aactgaagac 
tgcggggcgt 
cggtcatttc 
taatgcccac 
cttaaattct 
atgtcccagg 
cgcaggattg 
tactcacccc 
ttaattgcgg 
caacgctctt 
tatttgatag 
tgaaaattca 
aataaagcgc 
tggcaattat 
tcggtatcca 
gtttgtttgt 
gttttatttc 



tcctggccga 
tcgtcatcgt 
caccaaaaca 
gagtcaccag 
aacatcctca 
ctccagcgtg 
ggccaccaag 
cgtcctcaat 
cttcgacgtg 
catcttcaac 
cacgagtgcc 
gaatacccac 
tctgaaaaga 
gcctgcggtc 
ccgcactcca 
tatgtgcctc 
cgtaagaagt 
ccaagggtcg 
tggtggttcc 
aatgtttgct 
cagttatcga 
cattttccgc 
cgtaaattta 
caatgtcgct 
gcattaaatt 
ttaaagaaac 
gtggttggac 
tgtggcca 



cgaaagttga 
cggcggggca 
gcgaccacct 
aatatgaact 
ccctgcatgt 
atctccagaa 
gtgctgggag 
ctcctgccca 
gtcacctggc 
aaggtgttcc 
tggcgaccga 
aagtgcgcca 
agtgtggcag 
caccgcacaa 
caccacgcac 
taattaattt 
ggggccaagt 
gggattaaac 
ttccatttcc 
ttcgagcaac 
caaagttctg 
gaaattcctt 
aagattgtat 
tcgcaaactt 
tatggcaaat 
cgacacgtac 
cggcagattg 



gcatctcaaa 
gcagttttgg 
ccaactcatc 
accagggagc 
tgcagcgtca 
actcctcgcg 
tggtgttctt 
cggtgtgcgc 
tgggctacgc 
ggcaggcgtt 
gcagataaca 
gtgtggattt 
tggcaagtga 
cgtgtcgtat 
tccgcactgc 
caaattcaat 
gaattagata 
tagccacaga 
ctcagctctc 
gtaataccgg 
gctcaccgaa 
ttagattcac 
atataaagtt 
cgatcgataa 
tcactgcccg 
acaaatccac 
ctatttttat 



gaccgcctcg 
cggagtgcga 
cacaccgttg 
gggcgaatgt 
gcagacggtg 
ccacggcagg 
tacgttcgta 
ggagtgcgag 
tagctccatg 
caagaaggtc 
gacgcccgtt 
tcaggattcc 
catataaata 
acgcaatctc 
tcacacgtgt 
ttgcatgcca 
ttgaagtgga 
caacagcata 
agcaaacggt 
caaacacatt 
agttcatgtc 
tacgtactcg 
caacttttga 
acgctatttg 
ccaatgcaca 
ggcaatttgt 
ttgaaaagtt 



7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9268 



<210> 98 
<211> 1251 
<212> DNA 
<213> Drosophila 

<400> 98 

atgaagaatc ggtttttttt ttcttttaca 
ctcacctgga tctgcctgga tgtactcttc 
atctccgtgg acagatattt atcgctacga 
cgtcggcggg tcactctcaa aattgttttc 
cccttgagct tgatgtactc caagaatcac 
ataccggatc cggtgtacaa gctggtcggc 
gtgatgctgc tgacatattg cctgaccgtc 
ggtggtggac agcagacggc ggcggccact 
gcaccggcct tgggtggacc ggcaggagcg 
ccacactctg ctcacaatca gggcagctcg 
ttggctgctc cgggaactcc ttgtcctggc 
tcggtggtca cgtgggattc aagtcgtcat 
cgaacatccc tgctgttgac gcccaccaaa 
ttgagaagat cggccacctt gaggagtcac 
tgtggtggca agaccaggac gacaacatcc 
gtgcgatctc atcactcgcg gaactccagc 
aggatcatca ggctggagca gaaggccacc 
gtaatcctct ggtcgccatt cttcgtcctc 
gaggaacgaa ttagccactg ggtcttcgac 
atggtgaacc ccatcttcta caccatcttc 
gtcctgctgt gccgctattc gagcacgagt 

<210> 99 



ggctactttc ctttgggctc ggagcactgc 60 
tgcacggcca gcatcatgca cctgtgcacc 120 
tacccaatgc gatttggccg gaataaaaca 180 
gtttggcttc tgagcatcgc catgagcttg 240 
gcctcggtgc tggtgaatgg aacttgccag 300 
tccattgtat gcttctacat tccactgggc 360 
cgacttttgg cccggcagcg ccagaatctg 420 
cccggatggg ccagtggatg gctcggccag 480 
gctccagcca cagttggctc caccctcagt 540 
ctgacagttc agagcgatgg tagtgggtac 600 
cgacgaaagt tgagcatctc aaagaccgcc 660 
cgtcggcggg gcagcagttt tggcggagtg 720 
acagcgacca cctccaactc atccacaccg 780 
cagaatatga actaccaggg agcgggcgaa 840 
tcaccctgca tgttgcagcg tcagcagacg 900 
gtgatctcca gaaactcctc gcgccacggc 960 
aaggtgctgg gagtggtgtt ctttacgttc 1020 
aatctcctgc ccacggtgtg cgcggagtgc 1080 
gtggtcacct ggctgggcta cgctagctcc 1140 
aacaaggtgt tccggcaggc gttcaagaag 1200 
gcctggcgac cgagcagata a 1251 
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<211> 416 
<212> PRT 
<213> Drosophila 



<400> 99 



Met 


Lys 


Asn 


Arg 


Phe 


Phe 


Phe 


Ser 


Phe 


Thr 


Gly 


Tyr 


Phe 


Pro 


Leu 


biy 


1 








5 










10 










15 




Ser 


Glu 


His 


Cys 


Leu 


Thr 


Trp 


lie 


Cys 


Leu 


Asp 


Val 


Leu 


Phe 


Cys 


Thr 








20 










25 










30 






Ala 


Ser 


He 


Met 


His 


Leu 


Cys 


Thr 


He 


Ser 


Val 


Asp 


Arg 


Tyr 


Leu 


Ser 






35 










40 










45 








Leu 


Arg 


Tyr 


Pro 


Met 


Arg 


Phe 


Gly Arg 


Asn 


Lys 


Thr 


Arg Arg 


Arg 


Val 




50 










55 










60 










Thr 


Leu 


Lys 


He 


Val 


Phe 


Val 


Trp 


Leu 


Leu 


Ser 


He 


Ala 


Met 


Ser 


Leu 


65 










70 










75 










80 


Pro 


Leu 


Ser 


Leu 


Met 


Tyr 


Ser 


Lys 


Asn 


His 


Ala 


Ser 


Val 


Leu 


Val 


Asn 










85 










90 










95 




Gly 


Thr 


Cys 


Gin 


He 


Pro 


Asp 


Pro 


Val 


Tyr 


Lys 


Leu 


Val 


Gly 


Ser 


He 








100 










105 










110 






Val 


Cys 


Phe 


Tyr 


He 


Pro 


Leu 


Gly 


Val 


Met 


Leu 


Leu 


Thr 


Tyr 


Cys 


Leu 






115 










120 










125 








Thr 


Val 


Arg 


Leu 


Leu 


Ala 


Arg 


Gin 


Arg 


Gin 


Asn 


Leu 


Gly 


Gly 


Gly Gin 




130 










135 










140 










Gin 


Thr 


Ala 


Ala 


Ala 


Thr 


Pro 


Gly 


Trp 


Ala 


Ser 


Gly 


Trp 


Leu 


Gly 


Gin 


145 










150 










155 










160 


Ala 


Pro 


Ala 


Leu 


Gly 


Gly 


Pro 


Ala 


Gly 


Ala 


Ala 


Pro 


Ala 


Thr 


Val 


Gly 










165 










170 










175 




Ser 


Thr 


Leu 


Ser 


Pro 


His 


Ser 


Ala 


His 


Asn 


Gin 


Gly 


Ser 


Ser 


Leu 


Thr 








180 










185 










190 






Val 


Gin 


Ser 


Asp Gly 


Ser 


Gly 


Tyr 


Leu 


Ala 


Ala 


Pro 


Gly 


Thr 


Pro 


Cys 






195 










200 










205 








Pro 


Gly 


Arg 


Arg 


Lys 


Leu 


Ser 


He 


Ser 


Lys 


Thr 


Ala 


Ser 


Val 


Val 


Thr 




210 










215 










220 










Trp 


Asp 


Ser 


Ser 


Arg 


His 


Arg 


Arg 


Arg 


Gly 


Ser 


Ser 


Phe 


Gly 


Gly Val 


225 










230 










235 










240 


Arg 


Thr 


Ser 


Leu 


Leu 


Leu 


Thr 


Pro 


Thr 


Lys 


Thr 


Ala 


Thr 


Thr 


Ser 


Asn 








245 










250 










255 




Ser 


Ser 


Thr 


Pro 


Leu 


Arg 


Arg 


Ser 


Ala 


Thr 


Leu 


Arg 


Ser 


His 


Gin 


Asn 








260 










265 










270 






Met 


Asn 


Tyr 


Gin 


Gly 


Ala 


Gly Glu 


Cys 


Gly Gly 


Lys 


Thr 


Arg 


Thr 


Thr 






275 










280 










285 








Thr 


Ser 


Ser 


Pro 


Cys 


Met 


Leu 


Gin 


Arg 


Gin 


Gin 


Thr 


Val 


Arg 


Ser 


His 




290 










295 










-inn 

JUL) 










His 


Ser 


Arg 


Asn 


Ser 


Ser 


Val 


He 


Ser 


Arg 


Asn 


ber 


Ser 


Arg 


His 


Gly 


305 










310 










315 










320 


Arg 


He 


He 


Arg 


Leu 


Glu 


Gin 


Lys 


Ala 


Thr 


Lys 


Val 


Leu 


Gly 


Val 


Val 








325 










330 










335 




Phe 


Phe 


Thr 


Phe 


Val 


He 


Leu 


Trp 


Ser 


Pro 


Phe 


Phe 


Val 


Leu 


Asn 


Leu 








340 










345 










350 






Leu 


Pro 


Thr 


Val 


Cys 


Ala 


Glu 


Cys 


Glu 


Glu 


Arg 


He 


Ser 


His 


Trp 


Val 






355 










360 










365 








Phe 


Asp 


Val 


Val 


Thr 


Trp 


Leu 


Gly 


Tyr 


Ala 


Ser 


Ser 


Met 


Val 


Asn 


Pro 




370 










375 










380 










He 


Phe 


Tyr 


Thr 


He 


Phe 


Asn 


Lys 


Val 


Phe 


Arg 


Gin 


Ala 


Phe 


Lys 


Lys 


385 










390 










395 










400 


Val 


Leu 


Leu 


Cys 


Arg 


Tyr 


Ser 


Ser 


Thr 


Ser 


Ala 


Trp 


Arg 


Pro 


Ser 


Arg 










405 










410 










415 





<210> 100 
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<211> 8877 

<212> DNA 

<213> Drosophila 



<400> 100 

gacctttaag 

tcattaaact 

ctatgggctg 

aaatggaggt 

tggaaagaat 

tcctgtcagc 

tgttcgcaga 

aactgttgca 

tttgccatat 

ttttgagtac 

aaaccgcaat 

catttattta 

gttacattca 

gctctcagct 

tgcttgtttg 

attagtacta 

gaaagttcag 

gtccttgctg 

tctggttctc 

ggaactcatg 

gtggagtgaa 

cgattcccgc 

agatatcccg 

gggggaacca 

tgcgcagaaa 

gcaatctagc 

atgtacaact 

tattcttaag 

ttaagctgaa 

acctccgagt 

aacagatctt 

gagtaatacc 

agccagcacc 

agctttccta 

ctaaaagaga 

atgtcggatg 

gggtgattca 

tctttttttc 

cagcagaaat 

cacatgggtc 

gaagactact 

tgtaggtgac 

ctgccagcag 

cagcctatgg 

agcatgtaca 

tggaacaggg 

ttaagtggta 

gactagcacc 

cagtctcata 

gtacgtgaag 

caaagacaaa 

ccagggtatc 

aaaacaatca 

atggtgggca 

gcgcaggcat 



cagactaagg 
tcccaccgtt 
gtttttcata 
gtgagaaaga 
gaatattctc 
ttcataaaat 
ttatcgtgaa 
tacttttagg 
ttgtagccat 
atcaaagccg 
gataattccc 
gatgctagaa 
acagctacag 
gctcacatat 
ctccgcttgg 
gctaactaat 
tctcttaaga 
gaattcgaga 
accctcatca 
gcatttgccg 
agtggggcat 
ttgccacgcc 
tcacttctta 
ccgcctttcg 
aagttagtgt 
attgccacac 
tagttgtttc 
tgtacttaca 
tggtgcgcgt 
tgaggaaaca 
tagaaaaaaa 
gctcgcagga 
accaggtagg 
tactgacgag 
agcaacccga 
ataaattgta 
taaatgtgag 
gcatatcaat 
gtgagattga 
tcctgcatcc 
ctccaggggt 
cgtcagactc 
gagaaagact 
aggcgtagat 
ggtacaaacc 
tgatcagagc 
gccattagtc 
cctttcaggc 
atgctcacct 
aacaagttgg 
tacaaaggac 
ggggtcacca 
gcgccagcga 
gctccacgat 
cgtagttggt 



ggttgaggca 
ttgtggagcg 
atcaagcaaa 
gcctttatta 
agtgagcaag 
gttggttttg 
agagttttga 
aagttgcaag 
aacagctaca 
ataaagaaaa 
cagagaaaaa 
tttcttgttt 
ttggagttca 
acgacattta 
tacttggttt 
cgcactcgca 
tgcacttgct 
atgattattg 
ggggactaat 
ccaaagtggg 
gctgacccac 
catagtagct 
gccacgattg 
agaagctttt 
acaatagtga 
tttatgatag 
ctatataagt 
agcagactta 
gtcccgccac 
gtagaacagc 
gtggctaaga 
cggcaaacat 
tgatgccaaa 
tctccactgt 
taatctggct 
aattcataca 
tttttgcatc 
attttttctg 
ttattagcac 
aggggcagtt 
ctcttcaaat 
ttggccacag 
taattggctg 
attaaagcgt 
ctcgaccagc 
aatgcagctg 
ttaagctgtc 
acgagtatga 
ggacggataa 
cgtggatggt 
ggggggtaag 
gggagggggc 
gagcgatacc 
gacctcgaac 
gtacttctcc 



atcctttttt 
tcggacccaa 
atttcaatta 
cgaaggactc 
accagataag 
atgatggaaa 
gctccgtttc 
gagcggaaca 
catctccgag 
tcccccagac 
cgccgatcga 
atacagagaa 
tagcatggag 
acagtttaaa 
cgtcctttca 
gatctgcgac 
agggatttgt 
gatggtctgt 
ggcccggggc 
catcgtgtgc 
cagcgtggtg 
ggtcaaaggt 
ccacataact 
cgtggtataa 
aaatattttt 
gtttaacatt 
ttgcagtttt 
aaagtttgga 
gtggaaatgt 
gacaccgaga 
ggattgacaa 
gtgtcccatt 
aagtgggatt 
attggccgag 
cagtattttc 
atagctttaa 
catgttaact 
tgtaaattcc 
cgcacagaca 
gatttcgtac 
cccaaaatag 
cccaggtgac 
gcacacacaa 
aaattgtccc 
atccaaaaga 
cccactccac 
agcgtctgtt 
tttaataagc 
caagagtatc 
gttgcgcagg 
tcgggggtta 
tggggtgggt 
aggctgaggg 
tccggcacgg 
caagttccat 



cccaactcgt 
ttagctttca 
attaaaatgt 
tattacacat 
ctgccaacat 
cttttgccca 
gctgtattgt 
tgtgtttgcc 
gaaggtattt 
aatggaagct 
gtacactgaa 
ttaatccatt 
ttggttggat 
acgattgacc 
tcgcactcca 
acggaagtgg 
ctacaccgag 
atgagtgaga 
attacgctca 
agggcattgt 
gtggtggccg 
ctgcaacgga 
taccgcatac 
ctgccaagtt 
aatagctgcc 
aaaggcattt 
ctcttctcga 
ctacttaccg 
ggtgtctcag 
atccctgtgg 
acgaaaccca 
aggccagatt 
agcaccagca 
cgcagctttg 
ggcaatttac 
agtgtttcaa 
tgttcaatct 
atacgcaccc 
ggtccctggt 
tgtcggatgg 
acataccttt 
aacaaacaag 
tcgggcattc 
cggagaaagt 
agttggtcag 
ttcggatgga 
tatagtttcc 
cagcaattgc 
cagaacaaag 
caacgcagct 
ggcaaaaata 
acactcactt 
tgtatccaat 
attctggggc 
tggcgtggca 



gtcaattaac 
attataccgt 
aaccgagcta 
taaagtttta 
attttctcta 
tgaaactttt 
tggcttgcca 
atgtcaactt 
taggaatttt 
taccgaaaga 
gtaaatagac 
tgcacattga 
tgttaatggg 
taacttatgt 
gcctttaata 
aaagttcaat 
ttctcctcga 
agagctgaat 
gagtgctgcc 
tggagccccg 
aacttacgca 
gtggatgtca 
tctcggctcg 
tacgaagaaa 
attcctagta 
gaggagaaat 
gtagtgtatt 
tcggttctgg 
tgcatttcgc 
cacacacaca 
cctgagtgct 
ccgagggacc 
gtgccttagc 
tgattagaac 
agtcggaaaa 
gaaactaatt 
tctctgatgt 
acatgatacg 
atatccaatc 
gaaacaaaag 
tcgggggtgc 
gccgggccac 
acacttaccc 
cttaaccacc 
ggtgaagaag 
gatctgtaga 
tttcactttg 
aattatctgg 
ccgacatgat 
ccctggtgaa 
aatgccatac 
gaagtaggcg 
gtagtagata 
gggcaggtgg 
gaacctcgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 
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gcattctctg gtcaaataca aaaacgcacg cagcgaaaaa tgcaaacagt ttttaaatag 3360 
actagtccta tacacttgga tctatattgg gaatatgcct tctctgtttg gacatcgact 3420 
taaaaaattc caatttaaat tcataactta gtgcaaagct aacagttttt ttcaaacgct 3480 
tagtttaaat ataagtctaa tgttaaagta taaggtcaaa ggggtactca ctgctgctat 3540 
cgtaatgaat tccctgcaac tcgtccatgc actggagcac cgccagggta ccgcgagctg 3600 
tcctcggcca gcacaggatc gagtcgaact gggtgaggca gtggccacta tcgttgccat 3660 
aggtggatgc ctcgatgtgc tcctgcacca ggcactgcaa ctccacgctc tcaccgatgc 3720 
catctaggtt gtgcagatcc aggagcggat cactcccgct ggcgttcacc gaatcgatgt 3780 
ggttgtggtc actcattcta caacatcctg gcacttggca tctctggtac tgaaataagt 3840 
cagtttaaag ttgaggcaaa gattgtaaaa tacttgaagc agtgggaaaa tttaaaatat 3900 
tttagaatca ttatagagtt ttaatacatt tcatttaaga accgtcggac aataacgttt 3960 
tagaaaaatg gtaagcttta atcggaaata aaatattata gactgatacg aaaatattgc 4020 
tgtcatttcc tgaccatttc cctacgaatt gacaacgttc ataatactcc ccagctgaat 4080 
tcgttcaatc agaaagttat aatttcctca attgttaggg gattccccgg cggttggctg 4140 
aaactaagct gtcgggcaaa gttcaattgc caacccttct gctcgctcca ccgcagccag 4200 
tggcatttcg cctgtgtgcg ttaattgcat tggccatgtt tcaccgacaa cgggccacgc 4260 
ccacatcgaa ctcggcttaa agatccatca actcggcggg aagtatgctt taagccaaag 4320 
accatttcta gggaaaacag ggtggctaga gataagtaaa tgcacaggcc cagtgccata 4380 
aaaattcatt agttttgcca tcaaaggcgg acagaagcgt cagcaaatca gagaagcggc 4440 
aggcaacctt ttattctgat atgtgcccat aaaaagcacg tcaaaggacc caagtctggg 4500 
gtgatggtga tagaaattag aaaagctaac ttccggccaa agattgtggc ctttgtggcc 4560 
tggttctgcc tggttctgcc tcagagacga aatctcgcat ttatgttctg ataataaatg 4620 
gcataaatga tgcatcgtta cattaatgca cgacataaaa cggggcggca agtaagacac 4 680 
acacacaaac aatacatatt aaaaatgtag' atgggcctgt ttgtacagtg agctagaaaa 4740 
gtaaaagtgc gctctttaaa atttaagcgg aagtggcttt cactttctac ttaacagtga 4800 
aataaattag gcaaacaagc cagcttgaat tccaaggaga agccataagc caaggcagga 4860 
atttaaatcc actcggagtg aaaatggtta tctcttaagt gatgcccaac ccgaaccaat 4920 
tactttctgt ggaatttccg attttctcgg acggcacatt ccccaggcga ttttctaatt 4980 
ggcacaggaa tttctaacca caagccatga aatattaaac ccgaagggaa attgaaaatt 5040 
ctgccccaag gcttgggtgc tccacttctg gctgccactg tatgctcggc ttcgtatttt 5100 
tctttccatt gtttctgctg tttgtttttc tcgtaaatta acgcaatgca tgtacataaa 5160 
acaaaggctg gctgattatt actgcacggc acatgcctgc acgtcctttc cggagtttgc 5220 
caaggtcctg ggtcctggtt cattagccgc tcgcttactt ataacacata cgccccgttg 5280 
ccttcatttt ccagcttcgc acgctcacag caacgcagat gagttaagac agcaatatac 5340 
gcgccgcaat tttaagttga aaagcatatt atttgcaaaa aagggaaaga acggcgaagg 5400 
accgcctcta attacctcat aaataacgtc gcacgaaatt agctggccta aattatggcc 5460 
ataactgcta attgcatatt tgatttcaat tagttcagct gacaggggcc caaaaaccca 5520 
agaaggacac atttgaaatc agcgtgcgaa tgcaaatacg ttctgacagc tcgcttggac 5580 
aaatctcatt tatggccagc caatttgacg aatgtccgcc cgccggggaa acaggcaaaa 5640 
caaattcagc ttggccgaat catcgccggc ataaaaagga cattaccaga aatacataca 5700 
tatacggagg cataagaaat ggttggtcct ggacgaagac caacggacaa atcgcacatc 5760 
gtcctttctg ctcctctgcc tctttgcagc atgctattat agaagattgt atgtaatctc 5820 
gtacgattgc acacaatgga tttgccaaag gcgacacaaa aaaaaaaaaa aaaaaaaaaa 5880 
taggggattg cgtgggggga gtttgcttgt gggtccttta ctttttttct cccccatgtg 5940 
agaggttctg caatttcctg ctgctataat gcttataaat gttcacccgc aatcgccggg 6000 
gatttcttta gggaattgaa aacataataa cagtgcgaat agtggcaagc attcgcattt 6060 
tccacttatc caactaaata tgcagactaa aaattaaatt ccagtttttt caagccetcg 6120 
aaggggattt ttattttacg cccttatatg tgtacctatg caaaaattcc gcaattccgc 6180 
gggacaccgg ggaaaagtgc tattttcggt tctcattaat gtagctcatt aaattttata 6240 
tgcataagct gccagctgca aacaaaaaat gtaagcattt aaaatttata agcgtgccac 6300 
acaagagacc aacagcacat tacgtatacg cagcatggac aaagccagct tggaaagcca 6360 
gcacgcaggc ataagcgtat agaaagtttg taattaaatt gctcatttaa gcgtggaaac 6420 
aatgaattcg atcgggaata cagggagtgg ggagtgttct gaggaaatat gtaatattca 6480 
gcgtgtaatc cgaagaaaat atattactaa ttttattcat ccgccttggg cgcttaaatt 6540 
ttattgttcg gacttttgtg gactctcctc ggaggccgat aaggaaaagc cagtctagac 6600 
aagccaacat cataatttgc aggcacgcac tgccatttcc atttccgccc aagcttgagc 6660 
ctgatctgct ggccactttc catgcttaag tcagcacata tcctttagca ggaagatgga 6720 
atgggatggg atggggctga cacttcagcg attgccaaca agtgttgcgt tttattgtct 6780 
ggcctttaac agctctcctt tgccatccgc ccattgtgtg tgcggcacgt gcaggcgcct 6840 
gcatttggag ctacgtgagg gatctccact cctgtggata gtccgagcat aaaaccataa 6900 
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aaatccgccg 
aaaaatttgg 
cttggcttgc 
agtttattag 
ttcccaatcg 
tctcggctca 
gtgagcgtgt 
ccctgaaaac 
tcgccactca 
gactacggac 
cccgtcctaa 
tgctggagcg 
caaaacttta 
tattcgaagc 
atttgttatt 
taaaaattag 
aattccgaaa 
atgtgtctta 
gaaaagctcg 
tggaacatcc 
cttctaatgc 
ttgcccagca 
caacaggatt 
tgaagtcgca 
ggcctaatca 
atatggctag 
aaacaagggc 
acaatgataa 
ttgcctgtgt 
accaaactgt 
ttccaattta 
tcgagtccat 
agttttctgc 



caaaaaaaga 
tttaatgcat 
aaagggaagt 
agtggtcaag 
atactgcaaa 
ttcagccaca 
gtatgcactg 
gcttaccttc 
cactcacaat 
cgcaaaagcg 
atgcgtttaa 
gttatttttg 
ttgagttcac 
actcaaagcc 
tcttttgcgt 
tttccacttt 
cgaaccgcac 
acaaggctcg 
caaagctgca 
tgacgggagg 
ctgaaaatct 
attcgctggc 
cggcaggatt 
acctgattta 
aattaaactg 
gctccgggaa 
aacattttta 
acatttagac 
ggatttttgg 
ttgattttta 
aaaatgcttt 
ccactcgttc 
tcgcattaaa 



aacgaacctg 
tacaagttgc 
ccttaaggcc 
tgcatggagt 
cacgcaaaac 
gagttttcgt 
gcattttaat 
tactcctagc 
ttgcgttgga 
ccaggaggcg 
cgccttccaa 
ctagttgcct 
atttatatgc 
aacgactaga 
ttgctaatta 
aacggctctg 
tcgtacacat 
ccttcttacc 
ttccacaagc 
ttgggtaaaa 
gcctatttga 
tgtttgttta 
cgccaggata 
attgtgattt 
aagtggcagc 
aatggtttcc 
ctcactggca 
ggatgtcctg 
gcgcaaggat 
aacaaatgct 
ctctcgatgc 
tttttctctc 
gcgttcgtcc 



gccaccaaat 
cgaaatgcca 
atggcaacaa 
ggggggcatt 
aatgcctctg 
tttagtaacc 
taaccacaat 
caagtttcta 
attcttggga 
ggtcctttgt 
gctgcgaaat 
tggttattga 
gggattattt 
aaagttattc 
aaattttcat 
aggaggacaa 
cccgctgata 
aaaaggcgcc 
tgttgaaaaa 
ggaccccaaa 
tttatgcaaa 
gggcatgcgg 
ttgtttgtga 
tcggagcccg 
gcctcgggtc 
cggaaaatgt 
tcttttctca 
gctgcttgga 
atgggcgggg 
aatgtcaact 
caaaagcaaa 
agattctccg 
tgtctcatgc 



ctgttattta 
cggagattgc 
gtccttcctg 
tcggatttcg 
ctgccaacta 
gctggcattg 
ttgtttctgc 
tctgcagaat 
ttacttgcgt 
cctttcgacc 
gattatgtta 
cacacgccca 
gttatagcga 
gacattaatg 
ttttctcttc 
taggttttcg 
actgcgggga 
aattctgggt 
gctttccgcc 
cagaaaccga 
tatgtgccgg 
caaaagggat 
aactccttgc 
ttcaaaacac 
tgacagttat 
gcgaaattaa 
tgcctgccaa 
atgcgtgcga 
tggtgggggc 
gccactcaga 
gacatgttcg 
attctctgac 
atatttaatg 



ttggcccaat 
cggagattta 
gttagcagtt 
ggctgaaatt 
atatccatac 
ttttcagcag 
attccaaacg 
gagattcctt 
ttagcactcg 
acctacttga 
aagcggtttt 
tttacgtggg 
cccttgaact 
cgcggcactt 
gcacattatt 
caagttttga 
tacccgaagg 
tccacaaagt 
gggcaacagg 
ttcgattgca 
agtctcatgt 
accccggact 
aggccgtctt 
ctctcacgtg 
tgaaatatca 
attgtagaga 
attttatatc 
cacatgaggg 
gatatctcca 
ggcatttaaa 
ttgtcctgtt 
ccgtttctcc 
tcacacg 



6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8877 



<210> 101 
<211> 2089 
<212> DNA 
<213> Drosophila 

<400> 101 

cggttcgttt cggaatttca aaacttgcga aaacctattg tcctcctcag agccgttaaa 60 
gtggaaacta atttttaaat aatgtgcgaa gagaaaaatg aaaattttaa ttagcaaacg 120 
caaaagaaat aacaaataag tgccgcgcat taatgtcgaa taacttttct agtcgttggc 180 
tttgagtgct tcgaataagt tcaagggtcg ctataacaaa taatcccgca tataaatgtg 240 
aactcaataa agttttgccc acgtaaatgg gcgtgtgtca ataaccaagg caactagcaa 300 
aaataaccgc tccagcaaaa accgctttaa cataatcatt tcgcagcttg gaaggcgtta 360 
aacgcattta ggacgggtca agtaggtggt cgaaaggaca aaggacccgc ctcctggcgc 420 
ttttgcggtc cgtagtccga gtgctaaacg caagtaatcc caagaattcc aacgcaaatt 480 
gtgagtgtga gtggcgaaag gaatctcatt ctgcagatag aaacttggct aggagtagaa 540 
gtaccagaga tgccaagtgc caggatgttg tagaatgagt gaccacaacc acatcgattc 600 
ggtgaacgcc agcgggagtg atccgctcct ggatctgcac aacctagatg gcatcggtga 660 
gagcgtggag ttgcagtgcc tggtgcagga gcacatcgag gcatccacct atggcaacga 720 
tagtggccac tgcctcaccc agttcgactc gatcctgtgc tggccgagga cagctcgcgg 780 
taccctggcg gtgctccagt gcatggacga gttgcaggga attcattacg atagcagcaa 840 
gaatgcaacg aggttctgcc acgccaatgg aacttgggag aagtacacca actacgatgc 900 
ctgcgcccac ctgcccgccc cagaatccgt gccggagttc gaggtcatcg tggagctgcc 960 
caccattatc tactacattg gatacaccct cagcctggta tcgctctcgc tggcgctgat 1020 
tgttttcgcc tacttcaagg agctgcgttg cctgcgcaac accatccacg ccaacttgtt 1080 
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cttcacgtac atcatgtcgg ctttgttctg gatactcttg ttatccgtcc agatctccat 1140 

ccgaagtgga gtgggcagct gcattgctct gatcaccctg ttccacttct tcaccctgac 1200 

caacttcttt tggatgctgg tcgagggttt gtacctgtac atgctggtgg ttaagacttt 1260 

ctccggggac aatttacgct ttaatatcta cgcctccata ggctggggtg gcccggcctt 1320 

gtttgttgtc acctgggctg tggccaagag tctgacggtc acctacagca cccccgaaaa 1380 

gtacgaaatc aactgcccct ggatgcagga gacccatgtg gattggatat accagggacc 1440 

tgtctgtgcg gtgctaataa tcaatctcac atttctgctg cgtatcatgt gggttctaat 1500 

cacaaagctg cgctcggcca atacagtgga gactcgtcag tataggaaag ctgctaaggc 1560 

actgctggtg ctaatcccac tttttggcat cacctacctg gtggtgctgg ctggtccctc 1620 

ggaatctggc ctaatgggac acatgtttgc cgtcctgcga gcggtattac tcagcactca 1680 

gggattctcg gtgtcgctgt tctactgttt cctcaactcg gaggtgcgaa atgcactgag 1740 

acaccacatt tccacgtggc gggacacgcg caccattcag cttaaccaga accgacgtta 1800 

taccacgaaa agcttctcga aaggcggtgg ttccccccga gccgagagta tgcgaccttt 1860 

gaccagctac tatgggcgtg gcaagcggga atcgtgcgta agttcggcca ccaccaccac 1920 

gctggtgggt cagcatgccc cactttcact ccaccggggc tccaacaatg ccctgcacac 1980 

gatgcccact ttggcggcaa atgccatgag ttccggcagc actctgagcg taatgccccg 2040 

ggccattagt cccctgatga ggcaaggact cgaggagaac tcggtgtag 2089 

<210> 102 
<211> 504 
<212> PRT 
<213> Drosophila 



<400> 102 



Met 


Ser 


Asp 


His 


Asn 


His 


He 


Asp 


Ser 


Val 


Asn 


Ala 


Ser 


Gly 


Ser 


Asp 


1 








5 










10 










15 




Pro 


Leu 


Leu 


Asp 


Leu 


His 


Asn 


Leu 


Asp 


Gly 


He 


Gly 


Glu 


Ser 


Val 


Glu 








20 










25 










30 






Leu 


Gin 


Cys 


Leu 


Val 


Gin 


Glu 


His 


He 


Glu 


Ala 


Ser 


Thr 


Tyr 


Gly Asn 






35 










40 










45 








Asp 


Ser 


Gly 


His 


Cys 


Leu 


Thr 


Gin 


Phe 


Asp 


Ser 


He 


Leu 


Cys 


Trp 


Pro 




50 










55 










60 










Arg 


Thr 


Ala 


Arg 


Gly 


Thr 


Leu 


Ala 


Val 


Leu 


Gin 


Cys 


Met 


Asp 


Glu 


Leu 


65 










70 










75 










80 


Gin 


Gly 


He 


His 


Tyr 


Asp 


Ser 


Ser 


Lys 


Asn 


Ala 


Thr 


Arg 


Phe 


Cys 


His 










85 










90 










95 




Ala 


Asn 


Gly 


Thr 


Trp 


Glu 


Lys 


Tyr 


Thr 


Asn 


Tyr 


Asp 


Ala 


Cys 


Ala 


His 








100 










105 










110 






Leu 


Pro 


Ala 


Pro 


Glu 


Ser 


Val 


Pro 


Glu 


Phe 


Glu 


Val 


He 


Val 


Glu 


Leu 






115 










120 










125 








Pro 


Thr 


He 


He 


Tyr 


Tyr 


He 


Gly 


Tyr 


Thr 


Leu 


Ser 


Leu 


Val 


Ser 


Leu 




130 










135 










140 










Ser 


Leu 


Ala 


Leu 


He 


Val 


Phe 


Ala 


Tyr 


Phe 


Lys 


Glu 


Leu 


Arg 


Cys 


Leu 


145 










150 










155 










160 


Arg 


Asn 


Thr 


He 


His 


Ala 


Asn 


Leu 


Phe 


Phe 


Thr 


Tyr 


He 


Met 


Ser 


Ala 








165 










170 










175 




Leu 


Phe 


Trp 


He 


Leu 


Leu 


Leu 


Ser 


Val 


Gin 


He 


Ser 


He 


Arg 


Ser 


Gly 








180 










185 










190 






Val 


Gly 


Ser 


Cys 


He 


Ala 


Leu 


He 


Thr 


Leu 


Phe 


His 


Phe 


Phe 


Thr 


Leu 




195 










200 










205 








Thr 


Asn 


Phe 


Phe 


Trp 


Met 


Leu 


Val 


Glu 


Gly 


Leu 


Tyr 


Leu 


Tyr 


Met 


Leu 




210 










215 










220 










Val 


Val 


Lys 


Thr 


Phe 


Ser 


Gly 


Asp 


Asn 


Leu 


Arg 


Phe 


Asn 


He 


Tyr 


Ala 


225 








230 










235 










240 


Ser 


He 


Gly 


Trp 


Gly 


Gly 


Pro 


Ala 


Leu 


Phe 


Val 


Val 


Thr 


Trp 


Ala 


Val 










245 










250 










255 




Ala 


Lys 


Ser 


Leu 


Thr 


Val 


Thr 


Tyr 


Ser 


Thr 


Pro 


Glu 


Lys 


Tyr 


Glu 


He 






260 










265 










270 






Asn 


Cys 


Pro 


Trp 


Met 


Gin 


Glu 


Thr 


His 


Val 


Asp 


Trp 


He 


Tyr 


Gin 


Gly 






275 










280 










285 
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It X. (J 


Val 


Cys 


Ala 




290 










Val 


Leu 


305 








r\x y 


Gin 


Tyr 




PhP 


Gly 


Tip 
lie 


Thr 








340 


Leu 


Met 


Gly 


His 






355 










Oct 




370 






Arg 


Asn 




Leu 










lie 


bin 


Leu 


Asn 


biy 


u.iy 


biy 


Ser 








420 


Tyr 


Gly 


Arg 


Gly 






4 35 




Thr 


Leu 


Val 


Gly 




450 






Asn 


Ala 


Leu 


His 


465 








Gly 


Ser 


Thr 


Leu 


Gin 


Gly 


Leu 


Glu 



500 



Val Leu lie lie 
295 

lie Thr Lys Leu 
310 

Lys Ala Ala Lys 
325 

Tyr Leu Val Val 

Met Phe Ala Val 
360 

Val Ser Leu Phe 
375 

Arg His His lie 
390 

Gin Asn Arg Arg 

405 

Pro Arg Ala Glu 

Lys Arg Glu Ser 
440 

Gin His Ala Pro 
455 

Thr Met Pro Thr 
470 

Ser Val Met Pro 
485 

Glu Asn Ser Val 



Asn Leu Thr Phe 
300 

Arg Ser Ala Asn 
315 

Ala Leu Leu Val 
330 

Leu Ala Gly Pro 
345 

Leu Arg Ala Val 

Tyr Cys Phe Leu 
380 

Ser Thr Trp Arg 
395 

Tyr Thr Thr Lys 
410 

Ser Met Arg Pro 
425 

Cys Val Ser Ser 

Leu Ser Leu His 
460 

Leu Ala Ala Asn 
475 

Arg Ala lie Ser 
4 90 



Leu Leu Arg lie 

Thr Val Glu Thr 
320 

Leu lie Pro Leu 
335 

Ser Glu Ser Gly 
350 

Leu Leu Ser Thr 
365 

Asn Ser Glu Val 

Asp Thr Arg Thr 
400 

Ser Phe Ser Lys 
415 

Leu Thr Ser Tyr 
430 

Ala Thr Thr Thr 
445 

Arg Gly Ser Asn 

Ala Met Ser Ser 
480 

Pro Leu Met Arg 
4 95 



<210> 103 
<211> 3731 
<212> DNA 
<213> Drosophila 



<400> 103 

catcgattaa 

aggactggta 

aegagegtet 

atttagacaa 

tgtggcgaag 

tcgtgggaaa 

tttaatctga 

tctacatgta 

tataggactt 

cagaaacata 

aataaacata 

gttggcacga 

ggatgggtgg 

agactttegg 

ttaattggtt 

aatatcagee 

agecagggtt 

agtcgcccct 

tattcctget 

taaggaaaca 

tgtttatgga 

tggcctcgaa 

tgatgccaaa 

tttccagggt 



ccactatccc 
tcgactttat 
ccagcggtac 
aatctgtgat 
gaegcagaca 
caactaaaag 
tctcttataa 
tgtegtatae 
gttgatgtca 
agaaaataag 
acattaagaa 
gtgtgttagt 
atgtcaaaga 
ttcgegatet 
aaatgacaac 
atagecacat 
atataaatat 
caatgattgg 
ctcccgccac 
gtagaacaga 
aagactgtca 
gagattacga 
gagaggtatc 
atgagcagaa 



atttattgee 
aggacctacg 
gagttggact 
atcgatactg 
gataacacat 
aatataaaaa 
tgtaaactat 
gtatattttt 
gtctacaata 
taagaataaa 
ctgatgtagg 
atatttagag 
agtagaaagc 
cagegagtag 
aacaacaaca 
caactaggta 
aatccaacaa 
ttagcccacc 
egggtgaate 
gecacaaaga 
cctaatctat 
ctgataccct 
agcaccagca 
cgtaatttag 



tgcccgtcta 
acaccgaggg 
gaattaatgc 
tggtagcgtt 
tcctccgtgc 
tggcccgttg 
catgeaatae 
gtgtggtgtt 
aacgataaac 
aaggatatct 
gaaatgcacg 
aacgggtgca 
atatacatga 
tcttttgagg 
acaacatcga 
gtgeaaatag 
caataatgtg 
tgegattett 
catgcctcag 
agecctaaag 
atacctgegt 
gttccgggcc 
gcgccttcga 
tgatgagtac 



cacaaategg 
atttgttttc 
cctttttcag 
gcagagatcc 
tgaaaatcaa 
gtttataaaa 
ttagggacga 
cattcatagt 
agaaaaagga 
aaagccaacc 
gttttaagta 
atttaacttt 
geattgegta 
cgcgatgcct 
cgacaatagc 
tgaaaageca 
etagaatgee 
gatggaactg 
cgtctggcgc 
aatggatttt 
gcttatgaga 
tgttagcacc 
ggccttgtaa 
ctgtacgata 



cacgtgaacg 
gggcagtttt 
aaagaataag 
caatcgegtg 
ttataattag 
aatataatac 
acggatgtgg 
gtccagtggg 
aaaccccaat 
aaccatgeag 
gatcacttta 
ggaattattt 
tacgcaccgt 
gaaatggaaa 
gaacaactac 
atatgtcgac 
aaatgagtgc 
ttccgatgga 
acctctgagt 
cgaagatcaa 
aaggctctta 
aacagatagg 
tactgccgcg 
tgatataata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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gcatgaagct 
tgaggaatac 
tgtgagattc 
agcaagcata 
agatgcggag 
cagccgatga 
agcgtgtaca 
accgcctttg 
aaacatgatt 
taattaatta 
tgtgagtatc 
gtttcgcagg 
gtggcattac 
agctcaggaa 
cgggcaccac 
cccacgttcc 
ttaagatcta 
tcctcgaaac 
gaatcgaacg 
gattggcaag 
aaacgcaact 
ttctgactcg 
gcatgatttt 
agccaaatat 
atttatatat 
gtacatatat 
gttcgtagcg 
ttgatatttt 
gaagaattca 
agaacctttt 
cgagtaagcc 
attttcactc 
ttttcgagga 
cctcaagtgg 
tatgtgctgc 
gtttacttgc 
ggattgaagc 
gaaattcctc 
tatttattgg 



aactcactta 
tagattaacc 
acgcatccat 
tctaggtagg 
caaatgcctt 
gggcgtagat 
gatagaggcc 
aacccaccct 
accaacgtta 
gtggagtggg 
cagaggagtg 
caacgaagat 
aactcactta 
ataaccgccg 
cggtatggag 
gtttggaaag 
aacccaatct 
agggtagtac 
agctgggaca 
gtaaacttcg 
gacctagccc 
gtataataat 
ctttaatcaa 
cgatgtagtc 
ttgcaatggc 
gtatattaac 
ccaaattaat 
ttttattgac 
ccaaattggc 
tgcgacagct 
aaacactgga 
attagcctcg 
acaacgccct 
atttccttgg 
attttccgga 
ctggtttttc 
caattatttc 
aatcggcaaa 
c 



agccagaagt 
agcaaagcca 
gcacaatcaa 
tggcaattgg 
ggcaatggac 
aataaagcta 
tataaaaagc 
ccacaaacat 
tgcagccagc 
atatatatat 
cggatgtgat 
ctctgaagag 
aagctgagga 
gcatatatga 
cccgagttct 
caaaagcggg 
ggaataccca 
agccagactg 
ttgtaaaatc 
ttgagcattt 
aaaacccatt 
gatgccttca 
atttagtttc 
ccaaaaaaaa 
gtgctaaaca 
gttaagtata 
cgcgacgcgc 
cttatcaggt 
aattctttta 
aatgcgatcg 
gactggaaac 
tcatttggac 
tgatcgtgca 
gtctagtttt 
aataaaatgc 
atcgtatcgt 
ggaacagaag 
agagatgaat 



taaccggggg 
gtgatgcagg 
tttccagctt 
tatcataccc 
cacaccaaaa 
atgttatcac 
aaatgataaa 
ccagaaaaag 
ctgactagac 
tccagcttat 
gtaggtgagg 
ttttaagcat 
atatgatgag 
tggccggcag 
ggtgacagcg 
tggcattgtc 
cctgtggtgt 
ccggcgtttg 
tgtaagaaat 
actgcagtat 
aaatcagagt 
ttgcggatcc 
cagactttac 
aataaagaaa 
atttcgtgca 
cgacgacccg 
tatggtagcg 
gacgccgaga 
tggccggtct 
gacttagcca 
tccggcaata 
acagtcatag 
ttcggtataa 
gtttacaggc 
caaaataatt 
acttaaagtg 
atggatcaat 
tcattggcca 



actcacccac 
taccttgaat 
ttgaaaaatc 
cattgaagtg 
tgcatacggc 
tggagaatgt 
accgaggaaa 
ttggttaggt 
tctgtggtta 
cttaccactt 
aacaaattgg 
gaaaatgatg 
agccaccacc 
ttcgacgttg 
gtcatagtcc 
tgcaaatata 
cgtagtgcac 
tgcgtggcca 
aacaattgag 
aattcgtgtt 
acataagatt 
tggtcattta 
acacacacat 
gaatcaccag 
tacggataca 
gcgattgttt 
gtgttagaaa 
gcgcggagcg 
tcccattact 
cgcgacaagt 
tttgtttctt 
tacgaataca 
acaaaaatgg 
gagaggaata 
taatccgaaa 
aaagagattg 
tagcgggtgc 
aacaaagggt 



atgatgcgta 
atccagtcaa 
agaagaggtt 
ctcgttctcg 
tggacagccc 
ttgcaccacc 
ttcaattgat 
aaaagtactg 
tctaagtgaa 
gcaggaacag 
catgaatggt 
caactcatgt 
aaggtggcga 
ggtgagaagt 
gaatagtgat 
ttgattttat 
gcccttgaat 
gcacaggacc 
aatatacaat 
tcccctctgt 
catgccccgt 
tagctctgac 
acgcagcgtg 
ctaccgcttt 
caagttatat 
acccagttca 
cagagcttca 
gagactcgtc 
gataagcccg 
gagctgtcaa 
tgctcttctg 
ttttatgaca 
ctaaatgcag 
cataaataaa 
gcgatcgaca 
acgggcggat 
gattaattgg 
aattagggaa 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3731 



<210> 104 
<211> 1167 
<212> DNA ' 
<213> Drosophila 



<400> 104 

atgaaggcat 

tgctggccac 

ggcgtgcact 

gatcactatt 

gacttctcac 

ttcgccacct 

cgaaacacca 

ctcacactgt 

gtaatcatgt 

tatctgtaca 

gccctcatcg 

tttgctccgc 

cgtgaatctc 

ctagtattcc 



cattattata 
gcacaaacgc 
acgacaccac 
cggactatga 
ccaacgtcga 
tggtggtggc 
ttcatgccaa 
tcctgcaagt 
ttcagtactt 
cgctggtggt 
gctggggctg 
atctcgagaa 
acattgactg 
tcatacgcat 



ccgaatttta 
cggcagtctg 
agacaatgcc 
ccgctgtcac 
actgccggcc 
tctcatcata 
tttgttcctc 
gataaccaca 
ttacctaacc 
gcaaacattc 
tccagccgta 
cgagcacttc 
gatattcaag 
catgtgggta 



caatgtccca 
gctgtactac 
acccgctttt 
cagaactcgg 
atcatatatg 
ttcctcagct 
acctacatca 
gagtctagtc 
aactttttct 
tccagtgata 
tgcattttgg 
aatgggctgg 
gtacctgcat 
ctcatcacta 



gctcgttcga 
cctgtttcga 
gctttccaaa 
gctccatacc 
ccggcggtta 
ttaaagatct 
catccgcact 
aggctggctg 
ggatgtttgt 
acattagctt 
tgtggtccat 
aaattgattg 
cactggcttt 
aattacgttc 



ttcggtcctg 60 
ggaattcaag 120 
cggaacgtgg 180 
ggtggtgccc 240 
tttcctgagc 300 
tcgttgcctg 360 
cctctggata 420 
cataacgttg 480 
ggagggcctc 540 
tattatctac 600 
tgccaaggca 660 
tgcatggatg 720 
gctggttaat 780 
tgctcatacc 840 
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ctggaaacgc ggcagtatta caaggcctcg aaggcgctgc tggtgctgat acctctcttt 900 

ggcatcacct atctgttggt gctaacaggc ccggaacagg gtatcagtcg taatctcttc 960 

gaggccataa gagcctttct cataagcacg cagggcttct ttgtggctct gttctactgt 1020 

ttccttaact cagaggtgcg ccagacgctg aggcatggat tcacccggtg gcgggagagc 1080 

aggaatatcc atcggaacag ttccatcaag aatcgcaggt gggctaacca atcattgagg 1140 



ggcgactgca ctcatttggc attctag 



1167 



<210> 10S 
<211> 388 
<212> PRT 
<213> Drosophila 



<400> 105 



Met 


Lys 


Ala 


C ft 

ber 


Leu 


Leu 


Tyr 


Arg 


Tift 

He 


Leu 


Gin 


Cys 


Pro 


Ser 


Ser 


Phe 


1 








5 










10 










15 




Asp 


Ser 


Val 


Leu 


Cys 


Trp 


Pro 


Arg 


Thr 


Asn 


Ala 


Gly 


Ser 


Leu 


Ala 


Val 








20 










25 










30 






Leu 


Pro 


Cys 


rne 


blU 


GlU 


Phe 


Lys 


Gly 


Val 


His 


Tyr 


Asp 


Thr 


Thr 


Asp 






35 










40 










45 








Asn 


Ala 


Thr 


Arg 


Phe 


Cys 


Phe 


Pro 


Asn 


Gly 


Thr 


Trp 


Asp 


His 


Tyr 


Ser 




50 










55 










60 










Asp 


Tyr 


Asp 


Arg 


Cys 


His 


Gin 


Asn 


Ser 


Gly 


Ser 


He 


Pro 


Val 


Val 


Pro 


65 










70 










75 










80 


Asp 


Pne 


Ser 


Pro 


Asn 


Val 


blU 


Leu 


Pro 


Ala 


He 


He 


Tyr 


Ala 


Gly 


Gly 










Q C 
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90 










95 




Tyr 


Phe 


Leu 


Ser 


Phe 


Ala 


Thr 


Leu 


Val 


Val 


Ala 


Leu 


He 


He 


Phe 


Leu 
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1 Uo 
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Cft v- 

ber 


rne 


Lys 


Asp 


Leu 


Arg 


Cys 


Leu 


Arg 


Asn 


i nr 


Tift 

lie 


Ll i ft 

HIS 


Ala 


Asn 


Leu 






115 
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Phe 


Leu 


Thr 


Tyr 


Tift 
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C ft V 

ber 
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Leu 


T ft ii 

Leu 


Trp 


Tift 

lie 


Leu 


Thr 


Leu 


Pne 




1 JV 










1 "3 C 

loo 










i a n 
14U 










Leu 


bin 


vai 


T 1 « 

lie 


lnr 


1 nr 


Pill 
blU 


C ft 

ber 


Ser 


bin 


Ala 


P 1 . F 

bly 


Cys 
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Thr 
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14 0 










i c n 
13U 










ICC 

loo 










i c n 
1 DO 
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Tl « 
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inr 


Asn 


DK ft 
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Trp 


Met 
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1 ^ c 
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Leu 


Tyr 
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i nr 
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Vdl 


bin 


i nr 


rne 


C ft ^ 

ber 


C ft 

ber 








ion 
18U 










IOC 

loo 










190 






Asp 


Asn 


Tlrt 

± ±G 
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rne 


T "1 a 

lie 


Tift 

lie 


Tyr 


/iia 


Leu 


lie 
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Trp 
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Cys 


Pro 






1 b»D 
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T 1 <= 
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215 










220 










Leu 


Glu 


Asn 


Glu 


His 


Phe 


Asn 


Gly 


Leu 


Glu 


He 


Asp 


Cys 


Ala 


Trp 


Met 


225 










230 










235 










240 


Arg 


Glu 


Ser 


His 


He 


Asp 


Trp 


He 


Phe 


Lys 


Val 


Pro 


Ala 


Ser 


Leu 


Ala 










245 










250 










255 




Leu 


Leu 


Val 


Asn 


Leu 


Val 


Phe 


Leu 


He 


Arg 


He 


Met 


Trp 


Val 


Leu 


He 








260 










265 










270 






Thr 


Lys 


Leu 


Arg 


Ser 


Ala 


His 


Thr 


Leu 


Glu 


Thr 


Arg 


Gin 


Tyr 


Tyr 


Lys 






275 










280 










285 
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Ser 


Lys 


Ala 


Leu 


Leu 


Val 


Leu 


lie 


Pro 


Leu 


Phe 


Gly 


He 


Thr 


Tyr 
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Leu 


Leu 


Val 


Leu 


Thr 


Gly 


Pro 


Glu 


Gin 


Gly 


He 


Ser 


Arg 


Asn 


Leu 


Phe 


305 










310 










315 










320 


Glu 


Ala 


He 


Arg 


Ala 


Phe 


Leu 


He 


Ser 


Thr 


Gin 


Gly 


Phe 


Phe 


Val 


Ala 
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Cys 


Phe 


Leu 


Asn 


Ser 


Glu 


Val 


Arg 


Gin 


Thr 


Leu 


Arg 


His 








340 
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Gly 


Phe 


Thr 


Arg 


Trp 


Arg 


Glu 


Ser 


Arg 


Asn 


He 


His 


Arg 


Asn 


Ser 


Ser 






355 










360 










365 








He 


Lys 


Asn 


Arg 


Arg 


Trp 


Ala 


Asn 


Gin 


Ser 


Leu 


Arg 


Gly 


Asp 


Cys 


Thr 
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370 

His Leu Ala Phe 
385 



375 



380 



<210> 106 
<211> 7564 
<212> DNA 
<213> Drosophila 

<400> 106 

tttccgctgc catctgcagc tgtagctgcc taacaaatgc aattaaaatc tttgcgcaag 60 
aaaattgttg cagcaggcac gcagagatac tcgtactgta tgtgctgcaa tagtttggtg 120 
gcagcaacgg ggcggcagat aacatattgc ccacgtccgg ctgaaattgt acggacaagg 180 
cgaactcaaa aaaaaaaaaa aaaaaaagga atccctcaaa aggagaacca agaaaaatgg 240 
gatttatctc tggccaagct ccagccatac aaatctaatt tagaccacac gacggcttca 300 
gatattttat attccccact cagttgcggt ccatcctgat ctcctccagc ttgcttgatg 360 
agcccagccg ttggccaaat gaaattattt atgggcagtc ctaggtcagt gagaatttgg 420 
ttttttaagg gtttgaactt tgcggctcaa caatgcctgt gatttttttt ttttttatta 480 
tttttttgca ttgcatccat tagagaaagc actcctttgg cattctactc agctacttgc 540 
cacataaata tctacggagg gtagctaact agtccttatc tcaacttccc catttatcct 600 
cgccgcaagt gcacttttcc tgctgctaat taagttgcaa ggcagctttc tcagcaggcg 660 
ttgcacgaag tagataaaag gaaaagcccc cgatttgcag acactatgct gggaaagata 720 
tatatatata tatattttgc cagcattcgc taatcggctt agggcagcca ccatgcgacc 780 
acaacaacaa caacaacagc agcagcagtg tgctccatgc aatatttaat cacattagtt 840 
gatggggtaa cggccatgta cgcaacaatt tacaacttgc attaattata cgtctagttg 900 
gacaatggcg cctctaacta tcccactcta aatgtatctc tatctctatc cttatttgca 960 
ggggacaccc gttgataata aattattaat gcatctgctg ggaaggcggc actgcacggc 1020 
tgtccttggc agcaaatcca gttgcaacat cggtgagtga cgggcggcaa tcacaaagga 1080 
cttgcaacag ttgccgttgc ttctgctatt gctgttgctg ttgctgttgc aacaatggcc 1140 
agtggcaaca atgaaactga gccgctctac tgcggcagcg gcatggataa ttttcataca 1200 
aggtaagcga ctttcaccaa tgtcacgatc aaggccaagg ggcttttcca ccagacatgg 1260 
cgacaacaat accataagcc aacagttggg ggcgtggtcg ggggcggagg cgatggtggc 1320 
aacatcatag ccattagctg ctggcgcaag ggaaccgttc aaaaatcgat tatcgcccca 1380 
tttcggggga gcttctattt tgatttgccg tacaattttc tcgggcgatt aaacgacgaa 1440 
gcagaacgaa aacaaaaaac agatttgtca acagcaaggt caacaattga tggctgaaat 1500 
caatttaatt gaccatatcc tacgggccct ccaagtggcc atctgctgca cctataaaaa 1560 
agtgaatccg gtctgcgatt atttatatat tcgttgcatg gcaggcggtc gtaaaacctc 1620 
gagatgatga ttaaaagcgg ccctaaaaac ttaatggcgg tttaggaaat tcaattcctg 1680 
taatttaagc cgagtcacca ttcttcgaag ttcttacatg taagcgataa taaatagtta 1740 
agtcaattgg ccaataaacc tattaatatt gtgcatttac cacgattaga ctttgattaa 1800 
agtgacaatg ctgatttctg tagaggaaat ctagttctag tcttcccaca aagctattta 1860 
gttactcttg aataaatatg ttacttttct tttgccaaaa ccaacagaat tttaaattta 1920 
ataatttgga ttttttgcaa taaactgtac tgattaatgg gccacacaaa aatgtctagt 1980 
ttattatgga gctcttggtt tcataaatta agaacataat ccaatcggca tataaatcat 2040 
tgatagcaat ttattttccg tgatgaaact gtgctccgtg tgaactgcga attactcatt 2100 
ctacggttgc aaaaaaagcc accaacggtc aacatttaga ccaggacttt tagttttaat 2160 
tagagccagc ctggccaaca gcagtgttaa tgaccacaaa gtggctggcc acaggatcag 2220 
catcccagaa tgcgatgccg catttgcttt aattaaaggt agtagctgga gtttgaaaga 2280 
tgactgtatg gcaattagat gtgtagccag aacacttggc catttacttt tgtgtcaaag 2340 
tcgtgccaaa ttgccagcgg aggcgacact tgacgctgtc acgccccaga cagacgcaga 2400 
ccggcccaaa agcacccact cagccgtctc caggcgccac tcaagcggca aaggaacgcc 24 60 
aaaacactag gacacagaac gccagaagac tcgaaaaaaa agtataaaat tcaaacgcgt 2520 
atacaattca atgtgtcaca tcgaagaaga accctttctc tggcggcaac gtgacgtata 2580 
cgcaatatta atggcaaagc gaaaagggcc tcaaaagcta acgaaaaacc gcagcataac 2640 
acgcacaaaa caagcaatta acataaataa taaagaacgc acaaaggcgt tcggctctcc 2700 
gttgccaaag agaaaagcca acgaaaggaa taaaaaacgc caaaaagccc gaaaaaaaaa 2760 
ccgaagaccc aaaaaatgtg taaaattgat ttcggaaaat atttaattta ccaaatgcat 2820 
gctcgctttt tgcggccgtg tacgccccga acaaataaaa caaataggca cacatcgtct 2880 
ggcatttcat gttaaaggct tctcatgctg atggacagga tacagaaaat atatatacga 2940 
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agtgaggtgt 
gcgacacttt 
tccattcatc 
gatttcacct 
gataatccat 
tggaattgtt 
tattggccca 
taggcaaaac 
acctaaaagg 
gtatgaaatt 
ggataccggt 
agagttttat 
acattttcct 
cggaaaaatt 
gctctcatct 
gtgtgggcta 
tgtcgttcgg 
acagcaataa 
gtcactccat 
ggacttgcat 
ttggctgcaa 
ccaaccgaaa 
gcacataaat 
aagcgctcga 
tgggcgtggc 
ctgctgcttg 
cgagaaagcg 
cagaaacatc 
gtgtctttat 
tgggaaattg 
tttaatggat 
aaatattgtt 
ctacaagaac 
taccttgaat 
cacgggtctg 
cgactacatc 
cttcatcaag 
ggtgaccctg 
gtgcggtatg 
ggtgtcgccg 
gaatggcaaa 
gttactgagt 
aactgtgtgg 
gccatcgcca 
gcacaatgcc 
accatgcata 
gcggcggcgg 
atcaccggcc 
ggaaaagcag 
cactgaattt 
tctgcagtgc 
ttccacagat 
ccaggtgacc 
tgctccatga 
gacaagtggc 
gtggcaccgg 
accaacatca 
ctcagtcgcc 
ggcgtgggag 
gccatcgtgg 



gtgtctacgt 
ttggcggcac 
aaactcacaa 
ggctaaaaat 
gttgatgaat 
ttgcaccagg 
gaaatcaggc 
aaacagtagt 
cttggctttt 
aattcccttt 
tttttttttt 
gccaagttgg 
cccactacaa 
tggttcgcaa 
atgtagattg 
acatgactaa 
ccaggaatcc 
agaatttaac 
ttggagagct 
cggtctcgcc 
cttgcacaaa 
ccgttgaggg 
ttgagtcctg 
cgaaaactaa 
ttgcttaagc 
gcaagtcaac 
gactagacat 
agaaacataa 
tacacttctc 
cttaaaaata 
taataagttt 
gaattgaaga 
atgcatggct 
atcattgtgc 
gccgtggccg 
ctgacggaca 
ttccattcga 
gctgtttggc 
agaaccacca 
tcgctctatt 
gtgataaact 
gccaggacgg 
ttaaatgcga 
gtggtgcgaa 
tcgctgcaaa 
gcactcacca 
aagaagctca 
aatggtaaag 
acggatcgca 
ccacaaggga 
tacctaagac 
tcccatttca 
tgatggatat 
gcaagcaatt 
tgccggtggc 
tcctggaaaa 
cgcaggtgac 
tgctgagtgt 
gaggtggtgc 
tggtggtgga 



gccacaaaaa 
aaccctcgtc 
aaactttgcc 
ccagactccc 
ggcggcaacg 
atggcatcgg 
atagttagct 
acaccggaaa 
aagctttcac 
tgtcaataaa 
tttcaatgaa 
gcacagtttt 
ctgattgcat 
acaaattaat 
gaatggaaat 
aaagttgtcc 
ccttttttgt 
taaagtcctg 
tggagtatat 
tcgcattttc 
ggcaaaaaat 
ctgcctcgct 
cgcctgggct 
acaaaagctg 
cctagacacc 
ggaaatggct 
ggaagtaaca 
tttgggaaat 
tgccgtgatg 
aaatttgttt 
aggaagttaa 
gataactcgg 
atgtttcgct 
taacccgacg 
acctggcagt 
gtttgccgcg 
ttttcgccca 
gttatatagc 
taataacgat 
tgatcacggc 
ccattcccat 
ctgccctgaa 
gcaccttgct 
atgttactgt 
atgccacatt 
ttctgtcggt 
ccagcaagcc 
cagcggatag 
ccacgagaat 
ttatgggtct 
tgagtaagta 
ctttaagatg 
cttggccctg 
ccgcaccacg 
gcaggacgaa 
gggacgacag 
aaatctgtag 
cctgaaacgc 
gccgttggcc 
caaggtgagc 



cagcgaaatt 
ttgtcgacca 
tggtgtacgg 
agccatccct 
acaacaacgt 
ggctccaggt 
gccgaaatga 
tgcatatcat 
atttataaaa 
ctttctttct 
atcccttcga 
ttttttcccc 
tgccggtgca 
atgaactggc 
tccggctaga 
acaaatttgg 
ttcgagtgaa 
acaccgacag 
taaacatttt 
ctatataaat 
aaacaagggc 
tttttcctgt 
tttcctcttc 
gtaaaatgtt 
cgagagttta 
ttcggatggt 
aattggttaa 
gcctgccact 
ggttataact 
aatttaattt 
gtatatacat 
agctaatccc 
ggtggtctgc 
ggagatgcgc 
tatgctggag 
ggaggagaag 
ggttctgcac 
ggtgggttat 
aaccaccgct 
tataaccgaa 
gacccagtac 
tgccacgccc 
gacatcgaca 
ctataggcta 
tctcatatac 
tcgattgatc 
cgccactccg 
gccccggaaa 
gctgctggcg 
gctgaatgct 
aatatgtact 
taataaccca 
atcaactcca 
ttcacgctgc 
atggcagctg 
cagccgcagg 
cacaggagga 
ggcagaaggc 
ggcaacgatg 
ggtgccacgg 



gaaacgcttt 
actcctgggg 
tcttgggttc 
gagtctaatt 
cgacgacatg 
gggacgtact 
aacccaaata 
tgtaaaaact 
tattgaaaat 
ttctttctgt 
aaggtataag 
agctacctaa 
gaaagttttt 
aagcattttt 
attgcataag 
cttagattct 
tggggaattt 
caccagcagg 
ttccccacca 
tttatgctaa 
gaaatgccga 
gccgaattcc 
cacctttttt 
tatcccagaa 
ttgcctcagt 
tttattgcct 
ggctaactag 
caccgacaac 
tttggatcct 
gaggtggttt 
ctatctatta 
atatttttca 
atcctgggca 
tcccccacga 
tatataccct 
ctcagctaca 
accatttcca 
ccgcaaaaga 
tatgtggtgt 
tatgtcgatc 
gtaatcgatt 
accagtgcac 
accaccgctg 
taccacagcg 
agtgtagtga 
ctggccttac 
ggtgccagta 
aatagcaaaa 
gtgctactcc 
gtgctcggag 
atgcataacg 
atgccttttg 
gcatcaactt 
tctttcgtcc 
ctcgagctga 
tggtgatggc 
gtcgtggtcg 
gctcctccgg 
cggtggaacc 
agaatcagct 



ccgaatgccg 
aatttacaac 
gtcgtcgcag 
tatggccttt 
aatgaagtcc 
ggctcaaagt 
ccgagaaaac 
acatcagttt 
gcatataaaa 
gtaatatggg 
ttcagaatcg 
aataatagag 
tcagttggtt 
cgggcaaaaa 
accacctgca 
ccaataaaac 
cgcacgacag 
acgcacacgt 
gtcagccgca 
gtctaatttg 
aagccaaaac 
ctaaaacttt 
tttagcctca 
gggggcggtg 
gttgcaaacg 
acttttgctg 
agcataataa 
atttcgattt 
tgcagagagt 
ttcaaacaat 
taccatcttc 
tttttttcag 
ccatcgcgaa 
atgccatact 
acaccataca 
gctgggcctg 
tttggctgac 
atcgcgtatg 
gtgttctggt 
agttggatat 
atcgtaatga 
cactgaacga 
caccacccac 
atttggcgtt 
ttaagctgat 
tggaggccaa 
atggaaccaa 
ctctggaaaa 
tctttctcat 
atgtcttcta 
ccacagtaaa 
ccttttgcca 
cattttgtac 
aaaattcctg 
acgctctgcg 
cagcacgacc 
ccgaactttg 
cgaaggagga 
agcgttccag 
gtacaccgcc 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4 920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
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gagcaagctc 
tccttgtaaa 
ctttggtggg 
atacgcaccg 
gctgtgcata 
gttggcttca 
atctatcttc 
attttcccca 
aatatagtaa 
ttcgcataaa 
gagcagaata 
gtatctttaa 
ggaaaaataa 
attattggaa 
gaattcagtt 
cagctgaccg 
cgaaaattgt 
cata 



gtattgtgac 
aagtaaacta 
aaatcgtaaa 
tttgacttca 
aacgagctcg 
caacatagat 
atatttaact 
cttgacccaa 
acttttgcat 
gtatgcagca 
tttgccattt 
taatggcaat 
gtggcagttg 
atcgcaattt 
ccgtttcgtg 
aataatccaa 
aattgcaaat 



gtagtaaccc 
gttgtgtaag 
acccgtacag 
ctctattcca 
tatacgctac 
caaagtcaaa 
aacacaattc 
ttgtgcgcaa 
cttttccact 
cacgtactca 
gttgtacaga 
taagtcctag 
cagtgcggtg 
gggctgtgaa 
ggttactgaa 
tgaaacattg 
cgcttaagcc 



tgatagtagt 
aatcgcagat 
tcataataat 
cattcacagt 
agctggcgtc 
aaatatatag 
agaaactttc 
ctgttcagct 
ttgatcccat 
ctcaaaacta 
ttatatatac 
aaattttgac 
cgttgataaa 
tgaaattata 
tttatgggaa 
cataaacagt 
ccacgacgaa 



acatattata 
aaccacaagt 
aaaatgctgg 
caagtaggag 
acaataatag 
ttttgtaaaa 
gcttgcatgg 
tttcggcatg 
gcgttttcat 
cgaattttcg 
acacacacac 
ctgatattga 
tgaaatactt 
gtt ttcttga 
ataatttaag 
ttgcagtttt 
tgacaaatgg 



tgcctagtta 
tatgtgcata 
ttcatcactc 
ctgtaaaaat 
ttaccattgg 
tgtttcataa 
aaaatgtttt 
aatcaaatca 
gttgatttca 
gcaacgattt 
acttatgtcc 
tgcaggcatt 
tgcgaattct 
acgcaattga 
gcagtcactg 
tccacacttg 
ccagcgaacg 



6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7564 



<210> 107 
<211> 1757 
<212> DNA 
<213> Drosophila 



<400> 107 

ggaaggcggc 

cgggcggcaa 

ttgctgttgc 

gcatggataa 

gcatcctggg 

gctcccccac 

agtatatacc 

agctcagcta 

acaccatttc 

atccgcaaaa 

cttatgtggt 

aatatgtcga 

acgtaatcga 

ccaccagtgc 

caaccaccgc 

tataccacag 

acagtgtagt 

tcctggcctt 

cgggtgccag 

aaaatagcaa 

cggtgctact 

ctgtgctcgg 

tcttggccct 

tccgcaccac 

cgcaggacga 

agggacgaca 

gcctgctgag 

gaggaggtgg 

tggtggtggt 

ctcgtattgt 



actgcacggc 
tcacaaagga 
aacaatggcc 
ttttcataca 
caccatcgcg 
gaatgccata 
ctacaccata 
cagctgggcc 
catttggctg 
gaatcgcgta 
gtgtgttctg 
tcagttggat 
ttatcgtaat 
accactgaac 
tgcaccaccc 
cgatttggcg 
gattaagctg 
actggaggcc 
taatggaacc 
aactctggaa 
cctctttctc 
agatgtcttc 
gatcaactcc 
gttcacgctg 
aatggcagct 
gcagccgcag 
tgtcctgaaa 
tgcgccgttg 
ggacaaggtg 
gacgtag 



tgtccttggc 
cttgcaacag 
agtggcaaca 
agctacaaga 
aataccttga 
ctcacgggtc 
cacgactaca 
tgcttcatca 
acggtgaccc 
tggtgcggta 
gtggtgtcgc 
atgaatggca 
gagttactga 
gaaactgtgt 
acgccatcgc 
ttgcacaatg 
ataccatgca 
aagcggcggc 
aaatcaccgg 
aaggaaaagc 
atcactgaat 
tatctgcagt 
agcatcaact 
ctctttcgtc 
gctcgagctg 
gtgcacagga 
cgcggcagaa 
gccggcaacg 
agcggtgcca 



agcaaatcca 
ttgccgttgc 
atgaaactga 
acatgcatgg 
atatcattgt 
tggccgtggc 
tcctgacgga 
agttccattc 
tggctgtttg 
tgagaaccac 
cgtcgctcta 
aagtgataaa 
gtgccaggac 
ggttaaatgc 
cagtggtgcg 
cctcgctgca 
tagcactcac 
ggaagaagct 
ccaatggtaa 
agacggatcg 
ttccacaagg 
gctacctaag 
tcattttgta 
caaaattcct 
aacgctctgc 
ggagtcgtgg 
ggcgctcctc 
atgcggtgga 
cggagaatca 



gttgcaacat 
ttctgctatt 
gccgctctac 
ctatgtttcg 
gctaacccga 
cgacctggca 
cagtttgccg 
gattttcgcc 
gcgttatata 
cataataacg 
tttgatcacg 
ctccattccc 
ggctgccctg 
gagcaccttg 
aaatgttact 
aaatgccaca 
cattctgtcg 
caccagcaag 
agcagcggat 
caccacgaga 
gattatgggt 
actgagtgac 
ctgctccatg 
ggacaagtgg 
ggtggcaccg 
tcgccgaact 
cggcgaagga 
accagcgttc 
gctgtacacc 



cggtgagtga 
gctgttgctg 
tgcggcagcg 
ctggtggtct 
cgggagatgc 
gttatgctgg 
cgggaggaga 
caggttctgc 
gcggtgggtt 
ataaccaccg 
gctataaccg 
atgacccagt 
aatgccacgc 
ctgacatcga 
gtctataggc 
tttctcatat 
gttcgattga 
cccgccactc 
aggccccgga 
atgctgctgg 
ctgctgaatg 
ctgatggata 
agcaagcaat 
ctgccggtgg 
gtcctggaaa 
ttgctcagtc 
ggaggcgtgg 
caggccatcg 
gccgagcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1757 



<210> 108 
<211> 540 
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<212> PRT 

<213> Drosophila 



<400> 108 



t w ie c 


A 1 ^ 


OCX 


C 1 \/ 


A c n 


Bon 


VJ X \J 


Thr 


O X u 


Pro 


Leu 


Tvr 


Cvs 


Gly 


Ser 


Gly 


1 
1 








CL 
J 










10 ■ 










15 






Asp 


Asn 


tr lie 


n x o 


Th r 
1 1 1 X 




Twr 

iyr 


T 

Ly s 


Sen 


Met 


His 


Gly 


i yj. 


Val 


OCX 


















9 s 










30 






Leu 


Ua 1 
vax 


\/a 1 
vol 


P \7 C 


Tip 
lie 


Leu 


ox y 


Thr 

X I IX 


lie 


Al P 
t\ x a 


Asn 


Thr 


Leu 


Asn 


X X ti 


T1p 

lie 






J -J 










40 










4 5 








v a J. 


Leu 


T K v 

j. nr 


Arg 


Arg 


p. i ,, 


Mdf 


Arg 


Qor 

oer 




Thr 

1 i IX 


A c n 


Ala 

r\X a 


Tie 

IXC 


Leu 


Thr 
1 III 














j j 










fin 










oly 


Leu 


Ala 


vax 


hi a 


Asp 


Leu 


a i a 

Aid 


vax 


we u 


Leu 


Pill 

oXU 


i yr 


x xe 


Pro 


Tyr 












/ u 










7 ^ 










on 
0 u 


l nr 


T 1 * 

lie 


HIS 


Asp 


Tyr 


x xe 


Leu 


j. nr 


Asp 


oer 


Leu 


r ro 


Arg 




ulu 


Lys 










D 3 




















.7 J 




Leu 


oer 


Tyr 


Ser 


Trp 


hi a 


Cys 


Dh 

rne 


Tin 

x xe 


Lys 


tr I lc 


fll o 


oer 


x xe 


irne 


Ala 

Aia 








1 nn 
1 uu 










i ns 










inn 

X X u 






bin 


v ai 


Leu 


HIS 


TK 

i nr 


Tin 

lie 


oer 


Tl « 

116 


Trp 


Leu 


i nr 


Val 


Th r- 

i nr 


Leu 


Ala 

Aia 


v ai 






1 1 c 

1 1 D 










1 9H 
liU 










XZ D 








Trp 


Arg 


Tyr 


Tin 

lie 




\7a 1 

vax 


uiy 


Tyr 


Pro 


PI n 

oxn 


Lys 


Asn 


Arg 


V cl X 


Trp 


Cys 




1 

1 JU 










1 "5 ^ 
1 jD 










i a n 

X fJ 1J 










oiy 


fjt n t- 

Mec 


Arg 


Th v 

l nr 


Th v 

i nr 


x xe 


x xe 


Th r- 

i nr 


x xe 


Th r 

i nr 


Th r- 
1 fix 


Ala 

ax a 


Tyr 


V cl X 


\/a 1 
Val 




X H D 










i ^n 










1 SS 
1JJ 










X ou 


vai 


Leu 


v ai 


v ai 




Pro 


Ser 


Leu 


Tyr 


Leu 


Tlo 

x x e 


Th r 
I fix 


Ala 
AX a 


Tic 

x xe 


Th r 

i nr 


Pill 
ol u 










ID J 










i 7 n 

X / \J 










17c: 

LID 




Tyr 


vai 


Asp 


bin 


Leu 


Asp 


Mor 


Asn 


oxy 


Lys 


Ua 1 
Val 


Tlo 

lie 


Ann 
Ao 1 1 


OCX 


Tlo 
lie 


Dro 

trro 








1 on 
X o u 










IRS 

10 J 










1 Qf) 

X 3 VJ 






Met 


l nr 


P 1 n 


Tyr 


\/a 1 

vax 


Tin 

x xe 


Asp 


Tyr 


Arg 


Asn 


Pin 
olU 


Leu 


Leu 


oer 


Ala 

Aia 


Arg 






i i?j 










9nn 










one: 








Thr 


Ala 


Ala 


Leu 


Asn 


Axa 


TK v 

i nr 


Pro 


TK v- 

i nr 


Ser 


Aia 


Pro 


Leu 


Asn 


Pi 11 
olU 


TK ■»- 
1 nr 




91 n 

£. 1 U 










Z X J 




















vai 


Trp 


Leu 


Asn 


ax a 


Ser 


i nr 


Leu 


Leu 


Th v 

i nr 


oer 


Th v 

i nr 


Thr 


Thr 


nla 
Aia 


Ala 

Aia 


9 9 










9 ^n 










9 ^ s 










9 4 n 


Pro 


Pro 


tk v- 

i nr 


Pro 


oer 


Pro 


vax 


vax 


Arg 


Asn 


vai 


i nr 


Val 


Tyr 


Arg 


Leu 










t *> J 










9sn 










0 ss 




Tyr 


MIS 


C e\ -r~ 

oer 


Asp 


Leu 


ax a 


Leu 


nis 


Asn 


a l a 
ax a 




Leu 


Gin 


Asn 


Ala 

Aia 


Th y~ 

1 nr 








9 £n 










9 A"^ 










270 






rne 


Leu 


lie 


Tyr 


oer 


vax 


vax 


T 1 o 

x xe 


Lys 


Leu 


Tlo 

x xe 


tr ro 


Cys 


He 


Ala 

Aia 


Leu 






<£ / 3 










9Rn 










285 








i nr 


T 1 a 

i le 


Leu 


oer 


\7a 1 
V a X 


Arg 


Leu 


x xe 


Leu 


Ala 


Leu 


Leu 


Glu 


Ala 


Lys 


Arg 




?Q0 
£ y\j 










295 










300 










Arg 


Arg 


Ly s 


Ly s 


Leu 


Th r 
1 tlx 


Cor 


Ly s 


Pro 


AT a 


Thr 

1 1 IX 


Pro 


Gly Ala 




A Q n 


-J VJ J 




















^1 S 










■J C.\J 


P, 1 \/ 


Thr 


Ly s 


Ot.1 


Pro 
irx u 


Ala 


As n 


ftl V 

ox y 


Lys 


Ala 


Ala 


Asp 


Arg 


Pro 


Arg 


Lys 










325 










330 










335 




Sen 


Cor 


Ly s 


Thr 

1 1 1 X 


Le u 


ftl n 


y s 


ox u 


Lys 


Gin 


Thr 


Ago 


Arg 


Thr 


Thr 

1 I IX 


A r rr 








J *i U 










345 










350 






Met 


Leu 


Leu 


Axa 


vai 


Leu 


Leu 


Leu 


Phe 


Leu 


T1 Q 

x xe 


TK v 

i nr 


Glu 


Phe 


Pro 


Pl i-> 
bin 






355 










360 










365 








Gly 


lie 


Met 


Gly 


Leu 


Leu 


Asn 


Ala 


Val 


Leu 


Gly 


Asp 


Val 


Phe 


Tyr 


Leu 




370 










375 










380 










Gin 


Cys 


Tyr 


Leu 


Arg 


Leu 


Ser 


Asp 


Leu 


Met 


Asp 


He 


Leu 


Ala 


Leu 


He 


385 










390 










395 










400 


Asn 


Ser 


Ser 


He 


Asn 


Phe 


He 


Leu 


Tyr Cys 


Ser 


Met 


Ser 


Lys 


Gin 


Phe 










405 










410 










415 




Arg 


Thr 


Thr 


Phe 


Thr 


Leu 


Leu 


Phe 


Arg 


Pro 


Lys 


Phe 


Leu 


Asp 


Lys 


Trp 








420 










425 










430 






Leu 


Pro 


Val 


Ala 


Gin 


Asp 


Glu 


Met 


Ala 


Ala 


Ala 


Arg 


Ala 


Glu 


Arg 


Ser 






435 










440 










445 
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Ala 


Val 
4 50 


Ala 


Pro 


Val 


Leu 


Glu 

ACQ 
403 


Lys 


Gly 


Arg 


bin 


bin 
4 60 


Pro 


lain 


v a i 


MIS 


Arg 


Arg 


Ser 


Arg 


Gly 


Arg 


Arg 


Thr 


Leu 


Leu 


ber 


Arg 


Leu 


Leu 


Ser 


\I -i Y 

val 


4 65 










470 










475 










480 


Leu 


Lys 


Arg 


Gly 


Arg 
485 


Arg 


Arg 


Ser 


Ser 


Gly 
490 


Glu 


Gly 


Gly 


Gly 


Val 
495 


Gly 


Gly 


Gly 


Gly Ala 


Pro 


Leu 


Ala 


Gly 


Asn 


Asp 


Ala 


Val 


Glu 


Pro 


Ala 


Phe 








500 










505 










510 






Gin 


Ala 


He 


Val 


Val 


Val 


Val 


Asp 


Lys 


Val 


Ser 


Gly Ala 


Thr 


Glu 


Asn 






515 










520 










525 








Gin 


Leu 
530 


Tyr 


Thr 


Ala 


Glu 


Gin 
535 


Ala 


Arg 


He 


Val 


Thr 
540 











<210> 109 
<211> 5251 
<212> DNA 
<213> Drosophila 



<400> 109 

taaccttctt 

ctgtgaattt 

cacccacagt 

ggaatcgctg 

tgagtactcc 

tattctgaaa 

agcagtgact 

gtcgcttcca 

tagcatatgt 

tatttaacca 

taacttcaac 

atacataagc 

taattaacta 

cctaatctct 

cgtttccgaa 

gagtatttcc 

gactttcctg 

ccagaaggga 

ccatgggcgg 

gacgggcggc 

ccgatctcag 

gaagtacaga 

gtcccttagc 

atgccgtgcc 

cgatcacaaa 

agtgagcgct 

catttgtggg 

gatcctgatt 

tgaccaaagc 

gaacagagct 

catcctggct 

caggagcagg 

aacccggagc 

cctccactat 

ccgtttggtt 

agttcttcag 

cgagagttat 

catagaggcg 

tggccaccga 

tcaggagtaa 



ttgaccttcg 
gcccagcaga 
gggtcggttc 
gagaagcgtt 
aggcggcatt 
ttatataaac 
ccaaccatct 
aaaagctctg 
aaaagaaaaa 
acaaaaataa 
tttgaaattc 
tgacatatac 
gtccgtagaa 
atatttcact 
agagagccta 
tcaggggtca 
ctgatcccgc 
cggcgaggag 
tggtgatgga 
gtgaggctgg 
cggactaaaa 
ggactcctga 
gatggagcta 
ctttcgatac 
gataccggat 
attggagttg 
agatgtttga 
gttgcccatt 
cagggagcga 
ggccggcacg 
gaaactgtag 
aggagctggt 
gggggaaggg 
gttccacatt 
gatgttgttg 
ggcctcctgc 
agggattttg 
cggctgcgtg 
gaagcggtgg 
gttcttcagc 



taagtcttcc 
gcttccgaaa 
tggtccatgg 
actgatgcaa 
tctattgtgg 
ggcttgttgg 
gatggggaac 
gaaaagcttt 
cttgtatacc 
aacacaacat 
atgtcgcatg 
acagtaaata 
acgtattatg 
accgaactaa 
agagagctgg 
ccccttaccc 
ttcttttccg 
gtggtcagac 
gtgggcgtgg 
aggaagtgaa 
aagttctctg 
tccctgcaga 
cagttgcagg 
ttaggcttgt 
tgctcctggg 
ccagcaagca 
gtggagagcg 
ggcctggagt 
cgcagttcca 
aatacgtcct 
ttgtggttgt 
gggactgtgg 
aactcgtagc 
tctcgctcga 
aaacagggcg 
tgctgcctca 
ccacagttcc 
gagcgcatat 
attgcctccg 
gcggccctga 



actggcactt 
cgacggcttt 
tggggatggt 
tgcactggat 
ttacaatagt 
ggttttaaaa 
tttgagcaag 
tatcaacagg 
ctaccaatag 
gaatgtacat 
gcttaacttg 
ctctaacaac 
accatacact 
ttgatttaga 
aaagtactta 
ctttcgcatc 
accgcgaagg 
ccactggaac 
aggccgtgga 
acaagttgga 
tgggatggtt 
gattactgct 
atcggctctt 
gcccatttga 
gagcaggact 
gtttggccct 
gtctctccct 
ccgtgtcatc 
cgggatctat 
caaactccag 
cataggaggc 
aactgctatt 
tggtttcacc 
gatctgtggt 
acaccactcc 
accaaatggg 
aaggagacaa 
tggagtccat 
cctgctgatc 
agtccttgag 



gaaaagcttt 
tagtacctcc 
gtggtggttc 
accgttcctg 
aagtaaggag 
atgttgccgg 
tgagctccaa 
cttgccaaaa 
gcatacatcg 
tctggtttaa 
gttttcacag 
attaaacata 
caaaataaat 
tccgatatag 
acgtaaaagc 
cttgtgatcc 
tcaactctcc 
cgattcctcc 
cgacgccgtc 
atgctgcatg 
ggctggctgc 
gggctcgtcg 
cttcagcgaa 
gccgggactc 
cgccgtggat 
cagaagagcc 
ggactgttca 
cgatcgcaat 
gccaggctga 
atcatcatca 
ggacggaact 
tcgttcactg 
cagctcggcg 
caccgaagcc 
gcacttgggc 
tgaatctgca 
cgcgcactca 
ggactgcagg 
aatgtccacg 
gtgataggca 



gggaaaacag 
gccgtaacac 
cgatgatacg 
ttcagatcat 
tagtaaaagt 
ttttagattt 
tagttttggc 
tccgctatac 
catcaggact 
tcgctttatt 
acgttcaggt 
tgtatgtaca 
gcgcatagct 
aatttgacaa 
tttgacttgc 
tcttccagtg 
gcgctggtcg 
tgagcctggc 
gatgaggtgg 
aaactgccca 
ggatgctgtg 
tggacccctc 
ctccgagtga 
gcctcactat 
cctggcaggc 
ggaagtgggc 
tctatgggaa 
ttctccctca 
acgggatttg 
tcttcgtcct 
gagggcttcg 
ctgcgctggg 
ccactggagg 
acggcggcca 
aggagctggg 
gggaatgtta 
cactcaccca 
cgatgctgac 
cccatcatct 
tacaggaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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gatttaccgc 
catagcagtc 
tcataaagaa 
cggcgcccaa 
gcaccacggc 
gacggacctt 
ataaaaaact 
gatgtgcgtg 
gaagacgagg 
atcggcgtgc 
ggagatgatg 
cttgaagtgg 
gccaccaagc 
agggtgagtg 
ggaaagtaat 
tgcttagaaa 
aaacagttgc 
ctagtaagac 
gcgtccattc 
catggtcgat 
tatcacggca 
caaatatcct 
accattttgt 
tatcgccgtg 
tcgatccacg 
ggagactgtg 
gaagggaata 
gttggtgcgg 
gatgatgctg 
actggggctg 
aggtccctcg 
tgttgcgggt 
gtgccgagct 
gtgaactccc 
ctgcgcattg 
cttttccgat 
ttttctgctg 
cggcacgttt 
tgataaatat 
acaaaggccg 
atttgaaagg 
gaaatatgtt 
gtatatatac 
aatacaaagc 
cgaaccgggc 
cgcagaaaca 
aggggtttac 
aagatagatg 



cgagttcaga 
gggacagaag 
cagcacgata 
gacccgcagc 
cgccgaggag 
taaagttcaa 
ttactggcga 
tagaaggcca 
tagttgtagt 
cagccgaaaa 
actgcaacgg 
agctccccca 
accataaaaa 
cacttcccaa 
cttgccttga 
attatatatt 
tataaacatc 
tattatgtta 
gtggtagttc 
ggccggaatg 
acaaaggcac 
gacaagtagg 
acctactgat 
cgggtgcgga 
gacacggcca 
aagaggcagg 
cccaatgcgc 
cgtcgcaact 
acgatggcca 
tccgagtcct 
aaggagaaat 
attttacggg 
aatgaaaggg 
taattgactt 
cgtatacgcc 
tcgctttttt 
tcgacgtacc 
gttacacaca 
ttgccaaaaa 
tatcacaact 
ggccatcagt 
ggaacttcga 
ccgcaactct 
tcgaaaagca 
gctgcagaac 
gacgtcccat 
gaaaataatt 
atgtattgct 



tgggacagaa 
gccttgatgc 
atggacagat 
atggttcccg 
cgacgactga 
ttgaaaacgg 
aaatctatct 
gcatcagcag 
ccatcacctc 
gcggcaggaa 
aagaggaaca 
gacgtcattt 
cttttatcga 
atgacgcaaa 
gcggccattt 
tatgagatat 
taatatgaaa 
cggattttaa 
aaatgaggcg 
aaacacaatg 
aatactaaac 
tgaaaccaga 
tttatacaat 
cattccttga 
ccagacagaa 
catgaaggtt 
ccactagcaa 
ttctttcccg 
ccaggacctc 
tggcatcctt 
cggtgatcga 
ttccacgggg 
cggaacaggt 
gaggtggccg 
atgcccgccg 
cgctctcgct 
ctattaagct 
tgcagcaact 
atacatgtcc 
atgcaatacc 
tgcaggtctt 
tggcttttac 
gcgagtacac 
aacaactccc 
ataataaaag 
aaataaaaga 
catattcaat 
cctaaaaact 



tgatgcagaa 
agttgatcgt 
tctgggtggc 
tgtgtccacc 
gatcggaggc 
atagttaata 
accactcacc 
agccggagtg 
cacgaagaga 
gcccactatc 
ttggtaatca 
gagcagattg 
attgcgggaa 
taattaccaa 
ataaagcact 
ttttagctag 
attcttagaa 
ggatgtttat 
tcttttgttt 
tgctgtatat 
aatggacaac 
aaactggtcc 
acattgtata 
gtaggccatc 
gatggagatg 
tctgggtagt 
atcggccatg 
gcggaagacg 
gaagacggtg 
gctcgtggtg 
gaagtagcga 
ttccctgccc 
gtgggcttca 
tggcagtagc 
cgctgcggtt 
ctttattttc 
ggaaaaaaaa 
taattgtagc 
atattgataa 
ctagataatc 
ccttaaagcc 
gatgatggtg 
actgccacac 
gactagcaac 
tgtataaaaa 
gcatcctaca 
tcaaaagtat 
c 



gagcgtcagc 
atagagcggt 
cttgacgtcc 
tcttccgggt 
ggggttcatc 
ttccaaactc 
tgtttgatga 
ataatggtgg 
cactcctggt 
gtcccggcca 
gagtgccctg 
caggaatccc 
actaaacaaa 
ttttccggac 
atatagtagg 
aggccaatta 
tcaatttgaa 
tactgtgtac 
aactcaagtc 
tagccatcca 
ttaaagtcga 
tttaaagtgg 
tacccccata 
ggatatagga 
gtacacagca 
cccatggagg 
gccagggata 
atgatcacca 
tagggtatgt 
gccgcgtgca 
aacgcggaca 
cgatcgagta 
attacagtgc 
agcgatagta 
gctcctttcc 
ctttccgttg 
gggaaaactt 
aattttttag 
cgtgtgcggc 
ggccagtact 
agcagctcaa 
tttctggttt 
tatctgattc 
aaaatcaatt 
ggacggaaca 
cagagaaaaa 
tcgctttttt 



ttgggatgca 
atccagcaga 
cgcttccttg 
gtcgtcacct 
gtaacaatct 
tatggaatat 
tgacccggta 
caaaatagag 
tgtgattgac 
cccagcacat 
aactgaagcc 
tgccgtcttc 
actggcaggc 
gtagcatatt 
tgatatatag 
ttctgattcc 
tctagttagg 
tgttctcatt 
gcaactcaaa 
gagtcattag 
aatagtttga 
gttttagcat 
tatacttaca 
tggcccagta 
ccacgagcag 
ccaggatggc 
ctatgtagta 
gcacgttgcc 
tcagctcgga 
gcggtagcag 
tggtgggcgg 
tttaaggcta 
ccgctggcct 
atagcacatg 
cgctttctcg 
atgtatttgt 
aatcatcgtt 
tgtaataaat 
gagcaaaaca 
tgagtatatt 
ggatcacagt 
ggcttatggg 
gcgtgcggca 
gccagacaat 
gcgagaacaa 
agaattaggc 
tgtaattatt 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5251 



<210> 110 
<211> 2325 
<212> DNA 
<213> Drosophila 

<400> 110 

atgtccgcgt ttcgctactt ctcgatcacc gatttctcct tcgagggacc tctgctaccg 60 

ctgcacgcgg ccaccacgag caaggatgcc aaggactcgg acagccccag ttccgagctg 120 

aacataccct acaccgtctt cgaggtcctg gtggccatcg tcagcatcat cggcaacgtg 180 
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ctggtgatca 
gtatccctgg 
gcctccatgg 
gtgctgtgta 
atcctatatc 
tccatgtgct 
gatgtcaatc 
ttcctctatt 
atctaccggg 
ctcagtcgtc 
acgggaacca 
aatctgtcca 
tgcatcaagg 
attctgtccc 
ttcagggccg 
gcggaggcaa 
aatatgcgct 
cagcagcagg 
tgtttcaaca 
cgagaaatgt 
gagttccctt 
gtcccaccag 
cacaactaca 
gtattcgtgc 
ctgcgtcgct 
aggccaatgg 
tccactcaaa 
gctggcaact 
gagcaatccg 
cctaagtatc 
aactgtagct 
agggatcagg 
aactttttta 
ttcctccagc 
cccactccat 
accacctcct 



tcgtcttccg 
ccatggccga 
gactacccag 
ccatctccat 
cgatggccta 
gggtggccgg 
acaaccagga 
ttgccaccat 
tcatcatcaa 
gctcctcggc 
tgctgcgggt 
ttatcgtgct 
ccttctgtcc 
atctgaactc 
cgctgaagaa 
tccaccgctt 
ccacgcagcc 
aggccctgaa 
acatcaacca 
ggaacatagt 
cccccgctcc 
ctcctcctgc 
gtttcagcca 
cggccagctc 
ccctggcttt 
gcaacaatca 
catctcccac 
ccaatagcgc 
gtatctttgt 
gaaagggcac 
ccatcgctaa 
agtcctctgt 
gtccgctgag 
ctcacgccgc 
caccaccgcc 
cgccgtccct 



ccgggaaaga 
tttgctagtg 
aaaccttcat 
cttctgtctg 
ctcaaggaat 
gacgatagtg 
gtgtctcttc 
tatcactccg 
acaggtccgt 
ggccgtggtg 
cttgggcgcc 
gttctttatg 
cgactgctat 
ggcggtaaat 
cttactcctg 
ctcggtggcc 
gcgcctctat 
gaactcccag 
aacggtggcc 
ggaggcctcc 
gggttcccag 
tcctgcgaag 
ggatgaggac 
tgttccaaat 
ggtcatgagg 
ggatcttccc 
aaatggccca 
tcactgcctg 
gatcgatagt 
ggcattcact 
gggacgaggg 
acttccacag 
atcggtgggc 
ccgtcccacc 
catgggccag 
tctggcgacc 



aagttgcgac 
ggcgcattgg 
gcctgcctct 
gtggccgtgt 
gtccgcaccc 
ggcttcctgc 
gtggaggtga 
gctctgctga 
cagattgtta 
caggtgacga 
gcaaggaagc 
atctgctgga 
gtgcatccca 
ccggtcctgt 
aagatgatgg 
agtcagcatc 
gtgggtgagt 
ctcctgccca 
gccgtggctt 
agtggcgccg 
cgcagcagtg 
ccctcagttc 
gaagatgatg 
cccgttcagc 
gagaaattgc 
atagatgaac 
cttccggctc 
ccaggatcca 
gaggcgagtc 
cggagttcgc 
gtccacgacg 
catccgcagc 
agtttcatgc 
tcatcgacgg 
gctcaggagg 
agcgcggaga 



gccgcaccaa 
gtattccctt 
tcacagtctc 
ccgtggatcg 
gcacggcgat 
cgcttttcgg 
tggactacaa 
tgctggcctt 
cgatgaaccc 
cacccggaag 
gggacgtcaa 
taccgctcta 
agctgacgct 
atgcctatca 
gcgtggacat 
gcctgcagtc 
attcacccat 
agtgcggagt 
cggtgaccac 
agctgggtga 
aacgaaatag 
cgtccgcctc 
atgatctgga 
ctggcataga 
gatcggatga 
agtccaggga 
ttctgagggc 
cggcgagtcc 
ccggctcaaa 
tgaagaagag 
agcccagcag 
cagccaacca 
agcattccaa 
cgtcgtccac 
aatcggttcc 
gttga 



ctactacata 
cgccatcctg 
cctgctcgtg 
atactgggcc 
attcatcatc 
ctggcacgcc 
ctacctcgtc 
ctacacgcac 
cgcctccgat 
aggtggacac 
ggccacccag 
tacgatcaac 
cttctgcatc 
cctcaaggac 
tgatcagcag 
catggactcc 
ttggttgagg 
ggtgtcgccc 
agatctcgag 
aaccagctac 
cagttccaca 
ctatgacaac 
gtttgaggac 
tcccgtggaa 
cacggactcc 
gagaccgctc 
caaactgctt 
tgctccccag 
tgggcacaag 
ccgatcctgc 
taatctctgc 
tcccacagag 
cttgtttcac 
ggcctccacg 
agtgggtctg 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2325 



<210> 111 
<211> 774 
<212> PRT 
<213> Drosophila 



<400> 111 



Met 


Ser 


Ala 


Phe 


Arg 


Tyr 


Phe 


Ser 


He 


Thr 


Asp 


Phe 


Ser 


Phe 


Glu 


Gly 


1 








5 










10 










15 




Pro 


Leu 


Leu 


Pro 


Leu 


His 


Ala 


Ala 


Thr 


Thr 


Ser 


Lys 


Asp 


Ala 


Lys 


Asp 








20 










25 










30 






Ser 


Asp 


Ser 


Pro 


Ser 


Ser 


Glu 


Leu 


Asn 


He 


Pro 


Tyr 


Thr 


Val 


Phe 


Glu 






35 










40 










45 








Val 


Leu 


Val 


Ala 


lie 


Val 


Ser 


He 


He 


Gly 


Asn 


Val 


Leu 


Val 


He 


He 




50 










55 










60 










Val 


Phe 


Arg 


Arg 


Glu 


Arg 


Lys 


Leu 


Arg 


Arg 


Arg 


Thr 


Asn 


Tyr 


Tyr 


He 


65 










70 










75 










80 


Val 


Ser 


Leu 


Ala 


Met 


Ala 


Asp 


Leu 


Leu 


val 


Gly 


Ala 


Leu 


Gly 


lie 


Pro 










85 










90 










95 




Phe 


Ala 


lie 


Leu 


Ala 


Ser 


Met 


Gly 


Leu 


Pro 


Arg 


Asn 


Leu 


His 


Ala 


Cys 








100 










105 










110 






Leu 


Phe 


Thr 


Val 


Ser 


Leu 


Leu 


Val 


Val 


Leu 


Cys 


Thr 


He 


Ser 


He 


Phe 






115 










120 










125 








Cys 


Leu 


Val 


Ala 


Val 


Ser 


Val 


Asp 


Arg 


Tyr 


Trp 


Ala 


He 


Leu 


Tyr 


Pro 
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130 



135 



140 



Met Ala Tyr Ser Arg Asn Val Arg Thr Arg Thr Ala lie Phe lie lie 
145 150 155 160 

Ser Met Cys Trp Val Ala Gly Thr lie Val Gly Phe Leu Pro Leu Phe 

165 170 175 

Gly Trp His Ala Asp Val Asn His Asn Gin Glu Cys Leu Phe Val Glu 

180 185 190 

Val Met Asp Tyr Asn Tyr Leu Val Phe Leu Tyr Phe Ala Thr He He 

195 200 205 

Thr Pro Ala Leu Leu Met Leu Ala Phe Tyr Thr His He Tyr Arg Val 

210 215 220 

He He Lys Gin Val Arg Gin He Val Thr Met Asn Pro Ala Ser Asp 
225 230 235 240 

Leu Ser Arg Arg Ser Ser Ala Ala Val Val Gin Val Thr Thr Pro Gly 

245 250 255 

Arg Gly Gly His Thr Gly Thr Met Leu Arg Val Leu Gly Ala Ala Arg 

260 265 270 

Lys Arg Asp Val Lys Ala Thr Gin Asn Leu Ser He He Val Leu Phe 

275 280 285 

Phe Met He Cys Trp He Pro Leu Tyr Thr He Asn Cys He Lys Ala 

290 295 300 

Phe Cys Pro Asp Cys Tyr Val His Pro Lys Leu Thr Leu Phe Cys He 
305 310 315 320 

He Leu Ser His Leu Asn Ser Ala Val Asn Pro Val Leu Tyr Ala Tyr 

325 330 335 

His Leu Lys Asp Phe Arg Ala Ala Leu Lys Asn Leu Leu Leu Lys Met 

340 345 350 

Met Gly Val Asp He Asp Gin Gin Ala Glu Ala He His Arg Phe Ser 

355 360 365 

Val Ala Ser Gin His Arg Leu Gin Ser Met Asp Ser Asn Met Arg Ser 

370 375 380 

Thr Gin Pro Arg Leu Tyr Val Gly Glu Tyr Ser Pro He Trp Leu Arg 
385 390 395 400 

Gin Gin Gin Glu Ala Leu Lys Asn Ser Gin Leu Leu Pro Lys Cys Gly 

405 410 415 

Val Val Ser Pro Cys Phe Asn Asn He Asn Gin Thr Val Ala Ala Val 

420 425 430 

Ala Ser Val Thr Thr Asp Leu Glu Arg Glu Met Trp Asn He Val Glu 

435 440 445 

Ala Ser Ser Gly Ala Glu Leu Gly Glu Thr Ser Tyr Glu Phe Pro Ser 

450 455 460 

Pro Ala Pro Gly Ser Gin Arg Ser Ser Glu Arg Asn Ser Ser Ser Thr 
465 470 475 480 

Val Pro Pro Ala Pro Pro Ala Pro Ala Lys Pro Ser Val Pro Ser Ala 

485 490 495 

Ser Tyr Asp Asn His Asn Tyr Ser Phe Ser Gin Asp Glu Asp Glu Asp 

500 505 510 

Asp Asp Asp Leu Glu Phe Glu Asp Val Phe Val Pro Ala Ser Ser Val 

515 520 525 

Pro Asn Pro Val Gin Pro Gly He Asp Pro Val Glu Leu Arg Arg Ser 

530 535 540 

Leu Ala Leu Val Met Arg Glu Lys Leu Arg Ser Asp Asp Thr Asp Ser 
545 550 555 560 

Arg Pro Met Gly Asn Asn Gin Asp Leu Pro He Asp Glu Gin Ser Arg 

565 570 575 

Glu Arg Pro Leu Ser Thr Gin Thr Ser Pro Thr Asn Gly Pro Leu Pro 

580 585 590 

Ala Leu Leu Arg Ala Lys Leu Leu Ala Gly Asn Ser Asn Ser Ala His 

595 600 605 

Cys Leu Pro Gly Ser Thr Ala Ser Pro Ala Pro Gin Glu Gin Ser Gly 
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610 






He 


Phe 


Val 


He 


625 








Pro 


Lys 


Tyr 


Arg 


Ser 


Arg 


Ser 


Cys 








660 


Asp 


Glu 


Pro 


Ser 






675 




Pro 


Gin 


His 


Pro 




690 






Pro 


Leu 


Arg 


Ser 


705 








Phe 


Leu 


Gin 


Pro 


Thr 


Ala 


Ser 


Thr 








740 


Glu 


Glu 


Ser 


Val 






755 




Ala 


Thr 


Ser 


Ala 




770 







Asp Ser 
630 
Lys Gly 
645 

Asn Cys 

Ser Asn 

Gin Pro 

Val Gly 
710 
His Ala 
725 

Pro Thr 
Pro Val 
Glu Ser 



615 

Glu Ala 

Thr Ala 

Ser Ser 

Leu Cys 
680 
Ala Asn 
695 

Ser Phe 

Ala Arg 

Pro Ser 

Gly Leu 
760 



Ser Pro 

Phe Thr 
650 
He Ala 
665 

Arg Asp 

His Pro 

Met Gin 

Pro Thr 
730 
Pro Pro 
745 

Thr Thr 



620 
Gly Ser 
635 

Arg Ser 

Lys Gly 

Gin Glu 

Thr Glu 
700 
His Ser 
715 

Ser Ser 
Pro Met 
Ser Ser 



Asn Gly 

Ser Leu 

Arg Gly 
670 
Ser Ser 
685 

Asn Phe 

Asn Leu 

Thr Ala 

Gly Gin 
750 
Pro Ser 
765 



His Lys 
640 
Lys Lys 
655 

Val His 

Val Leu 

Phe Ser 

Phe His 
720 
Ser Ser 
735 

Ala Gin 
Leu Leu 



<210> 112 
<211> 4314 
<212> DNA 
<213> Drosophila 



<400> 112 

gagccaattc 

atctgacttg 

ctttaatgcg 

tgaattcgtc 

taatgcgcca 

tttcatttgt 

attttatggc 

cgggcgcgga 

tcatccttta 

ttgagttgaa 

aatctactat 

aagcacatct 

tctacgcaat 

ttaattccta 

gataatatta 

gaccaaagtt 

attcttagac 

tgtccgcagc 

attaactttt 

tagcagcagc 

gcatcaggaa 

gatcatggcg 

acatagagag 

tagggggcac 

cgctgcgttt 

cgcgaattca 

aaatcggtaa 

attaataaat 

actataaccc 

atgttatata 

gtcatcaccg 



ttttttggct 
tttgctcctt 
attgctggtt 
tggggtttcc 
cgttgtccct 
agtcaacaag 
caaattacat 
atatgctaag 
aggtatctct 
tgctacattt 
gtttaagatc 
gtgttacact 
ttaaaaggcg 
tatttaaatg 
ttaagttata 
taattttaag 
tgaataattc 
tgaatgaatg 
aattgccaaa 
aatagcaata 
gccgcacgag 
tactgagcgg 
agagagagag 
ttgcctgcat 
gactggaatt 
gcgaaccctt 
ggtaattagt 
atatataaat 
ccacttccgt 
acagcgggtt 
tgtagacaaa 



ttacgtgacg 
tatggcccgc 
gctcattacg 
atttggctga 
gcatttgcat 
gcgcagacga 
ttttactaga 
ctttatttgc 
agaagtaagt 
tcgaaatctt 
gcaacatttt 
ttggtatgca 
atctattaaa 
gtgggcgttc 
tccgcctcag 
agcgccagta 
tgaatacttg 
aaaaacttaa 
tgcaaatgaa 
gacgacaaat 
agagtgagtt 
cgactggcgg 
agagagagag 
tatcgacggt 
ggttagtgcc 
cgataccttc 
taaatagaaa 
atttataata 
tttactcacc 
gatgcaggtg 
ctcgtgctga 



ttggcagaac 
tgcgctttgg 
gccaatcctt 
ttgcattttc 
gaccttgaac 
cagggccaca 
atgaatccat 
agccatggga 
ttagtaagcg 
attttaaatg 
tttttacact 
tgtgaaacat 
tttcaagggg 
actagtgatt 
gcttgtttaa 
tgctgaatag 
aaagactagt 
tgcttaaggg 
tgcaaaagca 
aactggctta 
cgagtttatc 
cttgttcgtc 
ccagttaata 
ttcggctgct 
ctcctcaggc 
ttgccaaaca 
aattcagaca 
cagataaatt 
ttgaatgcct 
gacaagtagt 
tcccgcagtt 



aaatcgtttt 
actttgacgc 
actagtactt 
ataacaattt 
cagtagccac 
atcgataaaa 
agaaagatgc 
ataatcttaa 
aactttgtcc 
cgattcaata 
ttcttaacta 
ttggttaact 
caagcttttc 
gatttatgct 
gagtaaatcc 
tatgagtcaa 
ttaggagttt 
agagtgtgcg 
gtttaaacaa 
aacagaattc 
agttagttga 
ggattctgga 
tcgaacctat 
ccgccgactc 
ggcgcgattc 
gaacggccta 
ttcgaattat 
tgaaatatag 
ctcggaactt 
agaggacacc 
tggccccccg 



tgttttagcc 
atttgcgtcg 
gcactttctg 
ggaggattac 
ctcattagaa 
acccaaatgt 
ggaaaataag 
tacagcaaat 
gtcatttcca 
caatatacag 
aacactttaa 
aaaacatata 
gtttttaaat 
taccattatc 
cccgagctct 
aagattgaat 
ttccagccgt 
ctcgactgtg 
aacgctttgg 
aaataaatta 
cttggacacc 
tgtggagtac 
ccattgatta 
aatggagaag 
gatgttgttt 
aaggagtggg 
taataaatat 
tattgcgcat 
gtggctcata 
ggagacatag 
tgcaggggcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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tagatggcaa 
acggccacta 
tggcaatgga 
ttttggtaat 
tggctgcaga 
tgagcaaaag 
agatatcata 
aaaaacggag 
tactattaag 
atacccccgt 
atcctgagta 
gcccccgagc 
atagcggtac 
tgctgccgtc 
ccgataagta 
gacagctgga 
tagtgctcga 
accaggacga 
tggcaaatgg 
gccgatgtct 
tacttggacc 
gagatggcca 
atcacaatgc 
accgttacgg 
ttgccagcgg 
ccactaacag 
gtcccctcaa 
ggtctggact 
gcacgcaaac 
gtattccgta 
gttaatttga 
gcgctccaaa 
gaaagccagc 
tggttgtaat 
gatatcttga 
aatttttggg 
catttataaa 
cacgataatg 
gatgatgcaa 
cttacttgtt 
aaatgtaaac 



tcagtcgctg 
aaggggaggg 
aatggggaag 
cgcaattgtc 
cgggttgatt 
ttgaagtaaa 
tttcagaatt 
tcttctactt 
tcatgtttac 
actattgctg 
ctcgccgggt 
cgcccatcaa 
cggatccacc 
tggcaaccga 
ggtacaacac 
atgagtgctt 
ttccaaattg 
tgatgcgaat 
caatataccg 
ccgccaacag 
agatgtagga 
gcgaaaagag 
aggtacttat 
gtatcacgat 
acatggcgcc 
cgctaaattc 
ctatgctaat 
cgcgcttcgc 
gcaccaaatt 
ttcccggctg 
tttcgattcg 
cttatctgat 
gacgaagcgt 
gaccgagcgt 
tgggactagc 
taattaaaaa 
acacttagtt 
taaacaacac 
taataccaaa 
agcttataag 
aaataataat 



ggcgtggaag 
gggggcggta 
acacagcgag 
atcataaatg 
agcccagctt 
gcttctaaaa 
tttaatttgc 
ttgtttttgt 
ttaatggtta 
gtatttttcg 
gccatagtga 
gcccgctgtc 
atagcgatat 
ggcaggaccc 
taggataata 
cactatgacc 
ggcagcctgc 
ggcgcgactg 
ctccaccgtg 
accacgtccg 
cacctcctgc 
gtagtaattc 
gttgcccact 
ggccagcgga 
ggcggcctgg 
cactgccgtt 
tataataatt 
aatgctagat 
accgacgagg 
tttttctagg 
aattctgatt 
aagccgcggc 
cgctgttttt 
ccagcgaacg 
cgatgttaga 
ggctaattaa 
gggaattttg 
atcttggaat 
tgaccgaagg 
taagaatatt 
aattacaatt 



ggggcccagc 
aaagtcagtg 
agtcacagat 
gagagtgcca 
agggcatatg 
tggttcttca 
aagacttttg 
caaaatttct 
gcggtttaga 
ccagtgtata 
ttgagccgac 
cctgctccac 
aggatcgtat 
tccaccaaag 
gacatcggag 
cgcactatgc 
ggaatggccg 
agtttactca 
aaggccgtaa 
atgcagatgt 
ggaacgcccg 
gtggccgtgt 
acaccggtta 
tcgcgtggcg 
aatttttccc 
taactgctgc 
tattggaggc 
actcgattct 
tcgagatccg 
gggcccgact 
cccgagagca 
gacatttcat 
ccctcacatg 
tgccggctca 
gactaagcac 
caaaaacttg 
actttgtgat 
taaaagccaa 
ccaccgtgcg 
gttctttgtt 
caaatctaat 



agaggaagaa 

acgtggggat 

aaagagtcaa 

aaatgacaaa 

gtcaaacagt 

ttgcatataa 

tgagaagact 

tactaattta 

gatctgtcat 

tcgtgctata 

ctctcaccga 

ttcctccgcc 

cactgggcac 

tggatcgata 

ccaggaagaa 

cgcattgctc 

taactatggc 

tggcctggcc 

tcgttagcac 

actccccaaa 

acaacaggag 

gcattgaacg 

tgaaaatgag 

gtccaagatc 

cttttgtttt. 

ggctccggct 

gaccgggcgg 

aagattcaga 

tataccgtat 

tctcttcaat 

gcctgcaacc 

tcagatttgt 

aacgcggaag 

attaaatctt 

aatggctccg 

taataatgtt 

gcacataatc 

aataagatgc 

tatgagtaat 

tatcttacta 

ttagattaat 



gcacaccacc 
gaaaatggaa 
cgattatatt 
cgcagtaaat 
tggcatacac 
agaaagattt 
aaaccttatt 
atcatctgaa 
ttaatttgtg 
attacctagc 
gctaagttga 
gttgaaactc 
actcttcagc 
caggtgcaca 
tatgaacgtg 
cactcccgaa 
cataatccaa 
cagaaacgga 
cgtggcattc 
cacgtacggg 
caggaaatcc 
gttcttctta 
ggagtagacc 
atggctcata 
cacaaaattt 
gcgactcgat 
aggagggaca 
tactcgactg 
tcagtattca 
aagttatcga 
gcactccagc 
atctgtatct 
gtgagcgaca 
gaaaagcggc 
aatacaacgc 
tgctttggcc 
ctaaataaac 
aataatatta 
gtccagccag 
aagtatattg 
taaa 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4314 



<210> 113 
<211> 1287 
<212> DNA 
<213> Drosophila 



<400> 113 

atgtccgctg 

atacccgtaa 

acctgcattg 

tcgctggcca 

atctggtcca 

gcggagacat 

attgccattt 

atcatcgtcc 

ggaatcgagc 

tcattccagc 

tacctactta 

gccagacggc 



gcaatatgag 
cggtggtcta 
tgattaagaa 
tctcggattt 
agtacccgta 
cggcgaatgc 
gccatccgtt 
tggtttggat 
actattcggg 
tgtccacgtt 
tcggtgtgca 
agcagctgaa 



ccatgatctt 
ctccctcatt 
gaaccgttca 
cctgctcctg 
cgtgtttggg 
cacggtgcta 
tctgggccag 
tatggccata 
agtggagcaa 
catattcttc 
cctgtatcga 
gagtgtgccc 



ggaccgccac 
ttcataaccg 
atgcacacgg 
ttgtcgggcg 
gagtacatct 
acgattacgg 
gccatgagta 
gttacggcca 
tgcggcatag 
ctggctccga 
tccactttgg 
agtgatacga 



gcgatccgct 
gtgtagtggg 
ccacgaatta 
ttccgcagga 
gcatcggacg 
ccttcacggt 
aactcagtcg 
ttccgcaggc 
tgcgggtcat 
tgtctattat 
tggagggtcc 
tcctatatcg 



ggccatcgtg 
caacataagt 
ctacctcttt 
ggtgtcctac 
tggtctgttg 
ggagcggtat 
cgccattcgc 
tgcccaattt 
agtgaagcac 
cctagtgttg 
tgcctcggtt 
ctatggtgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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tccggtaccg ctatgagttt caacggcgga ggaagtggag cagggacagc gggcttgatg 780 

ggcggctcgg gggctcaact Cagctcggtg agaggtcggc tcaatcacta tggcacccgg 840 

cgagtactca ggatgctagt ggccgtggtg gtgtgcttct tcctctgctg ggcccccttc 900 

cacgcccagc gactgattgc catctacgcc cctgcacggg gggccaaact gcgggatcag 960 

cacgagtttg tctacacggt gatgacctat gtctccggtg tcctctacta cttgtccacc 1020 

tgcatcaacc cgctgttata taacattatg agccacaagt tccgagaggc attcaaggcc 1080 

gttctgtttg gcaagaaggt atcgaagggt tcgctgaatt cgcgaaacaa catcgaatcg 1140 

cgccgcctga ggagggcact aaccaattcc agtcaaacgc agcgcttctc cattgagtcg 1200 

gcggagcagc cgaaaccgtc gataatgcag gcaagtgccc cctataatca atggatagct 1260 

gcggacaacg gctggaaaaa ctcctaa 1287 

<210> 114 
<211> 428 
<212> PRT 
<213> Drosophila 



<400> 114 



Met 


Ser 


Ala 


Gly 


Asn 


Met 


Ser 


His 


Asp 


Leu 


Gly 


Pro 


Pro 


Arg 


Asp 


Pro 


1 








5 










10 










15 




Leu 


Ala 


He 


Val 


He 


Pro 


Val 


Thr 


Val 


Val 


Tyr 


Ser 


Leu 


He 


Phe 


He 








20 










25 










30 






Thr 


Gly 


Val 


Val 


Gly 


Asn 


He 


Ser 


Thr 


Cys 


He 


Val 


He 


Lys 


Lys 


Asn 






35 










40 










45 








Arg 


Ser 


Met 


His 


Thr 


Ala 


Thr 


Asn 


Tyr 


Tyr 


Leu 


Phe 


Ser 


Leu 


Ala 


He 




50 










55 










60 










Ser 


Asp 


Phe 


Leu 


Leu 


Leu 


Leu 


Ser 


Gly 


Val 


Pro 


Gin 


Glu 


Val 


Ser 


Tyr 


65 










70 










75 










80 


lie 


Trp 


Ser 


Lys 


Tyr 


Pro 


Tyr 


Val 


Phe 


Gly 


Glu 


Tyr 


He 


Cys 


He 


Gly 










85 










90 










95 




Arg 


Gly 


Leu 


Leu 


Ala 


Glu 


Thr 


Ser 


Ala 


Asn 


Ala 


Thr 


Val 


Leu 


Thr 


He 




100 










105 










110 






Thr 


Ala 


Phe 


Thr 


Val 


Glu 


Arg 


Tyr 


He 


Ala 


He 


Cys 


His 


Pro 


Phe 


Leu 






115 










120 










125 








Gly 


Gin 


Ala 


Met 


Ser 


Lys 


Leu 


Ser 


Arg 


Ala 


He 


Arg 


He 


He 


Val 


Leu 




130 










. 135 










140 










Val 


Trp 


He 


Met 


Ala 


He 


Val 


Thr 


Ala 


He 


Pro 


Gin 


Ala 


Ala 


Gin 


Phe 


145 










150 










155 










160 


Gly 


He 


Glu 


His 


Tyr 


Ser 


Gly 


Val 


Glu 


Gin 


Cys 


Gly 


He 


Val 


Arg 


Val 










165 










170 










175 




He 


Val 


Lys 


His 


Ser 


Phe 


Gin 


Leu 


Ser 


Thr 


Phe 


He 


Phe 


Phe 


Leu 


Ala 








180 










185 










190 






Pro 


Met 


Ser 


He 


He 


Leu 


Val 


Leu 


Tyr 


Leu 


Leu 


He 


Gly 


Val 


His 


Leu 






195 










200 










205 








Tyr 


Arg 


Ser 


Thr 


Leu 


Val 


Glu 


Gly 


Pro 


Ala 


Ser 


Val 


Ala 


Arg 


Arg 


Gin 




210 










215 










220 










Gin 


Leu 


Lys 


Ser 


Val 


Pro 


Ser 


Asp 


Thr 


He 


Leu 


Tyr 


Arg 


Tyr 


Gly 


Gly 


225 










230 










235 










240 


Ser 


Gly 


Thr 


Ala 


Met 


Ser 


Phe 


Asn 


Gly 


Gly 


Gly 


Ser 


Gly 


Ala 


Gly 


Thr 










245 










250 










255 




Ala 


Gly 


Leu 


Met 


Gly 


Gly 


Ser 


Gly 


Ala 


Gin 


Leu 


Ser 


Ser 


Val 


Arg 


Gly 








260 










265 










270 






Arg 


Leu 


Asn 


His 


Tyr 


Gly 


Thr 


Arg 


Arg 


Val 


Leu 


Arg 


Met 


Leu 


Val 


Ala 






275 










280 










285 








Val 


Val 


Val 


Cys 


Phe 


Phe 


Leu 


Cys 


Trp 


Ala 


Pro 


Phe 


His 


Ala 


Gin 


Arg 




290 










295 










300 










Leu 


He 


Ala 


lie 


Tyr 


Ala 


Pro 


Ala 


Arg 


Gly 


Ala 


Lys 


Leu 


Arg 


Asp 


Gin 


305 










310 










315 










320 


His 


Glu 


Phe 


Val 


Tyr 


Thr 


Val 


Met 


Thr 


Tyr 


Val 


Ser 


Gly 


Val 


Leu 


Tyr 










325 










330 










335 
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i yr 


LeU 


Ser 


Thr 


Cy s 


He 


Asn 


Pro 


Leu 


Leu 


T vr 
j 


Asn 


He 


Met 


Ser 


His 






340 










345 










350 






J_iy s 


Phe 


Arg 


Glu 


Ala 


Phe 


Lys 


Ala 


Val 


Leu 


Phe 


Glv 


Lys 


Lys 


Val 


Ser 






3S5 










360 










365 








Lys 


Gly 


Ser 


Leu 


Asn 


Ser 


Arg 


Asn 


Asn 


He 


Glu 


Ser 


Arg 


Arg 


Leu 


Arg 




370 










375 










380 










Arg 


Ala 


Leu 


Thr 


Asn 


Ser 


Ser 


Gin 


Thr 


Gin 


Arg 


Phe 


Ser 


He 


Glu 


Ser 


385 










390 










395 










400 


Ala 


Glu 


Gin 


Pro 


Lys 


Pro 


Ser 


He 


Met 


Gin 


Ala 


Ser 


Ala 


Pro 


Tyr 


Asn 










405 










410 










415 




Gin 


Trp 


He 


Ala 


Ala 


Asp 


Asn 


Gly 


Trp 


Lys 


Asn 


Ser 











420 425 



<210> 115 
<211> 5927 
<212> DNA 
<213> Drosophila 



<400> 115 

ttaagccaca 

tgagtacaat 

tgtctgtttg 

ggcgtggctg 

tttctttttt 

gtgagttgaa 

gccgtctgtt 

acaggggaat 

tgtctcagtg 

catttgtaat 

gaatggaatg 

tctacacata 

tgtgttaatg 

aagatggacg 

tatatcgtaa 

gcccacgcgg 

attgaaattt 

attaagctcg 

ctcatttaaa 

tagaaaaaaa 

ataaatcaat 

caaacgaaaa 

gtcacttgaa 

aaggccttcc 

ctgtaggcca 

atgacatcca 

gattatttaa 

cagagcgagg 

cgctgcgact 

cgcatgtaga 

acctggtagg 

atgttcttct 

tatgaagaaa 

cagttgcatg 

aaactgtcat 

tgtcgctgac 

aatgggtatg 

tctgttggct 

atcgtaaaaa 

cgcgctttca 



aaatgattga 
gtactatgta 
tccctaaagt 
tccagtgatt 
ttgcgtttat 
ttggaatgag 
ttcacttctt 
gctataaaaa 
cgttgctaat 
ttaaagtcca 
cctcgtaagc 
caataatgga 
cccaacagct 
gtccgatgat 
atatagattt 
cgtatgcgcg 
aatgggaaaa 
taaattattt 
aactcattag 
aaatatatat 
ataaagcccc 
caatgctctg 
ggacccttct 
ggaaattctc 
gagtctgggc 
gtgacttgag 
aatatatgtg 
cgaaggcgat 
ccttggacat 
gaccgctgat 
tgcgcggacc 
ttgcatcgta 
cgtgtgcggt 
gcagccaagc 
acgaatagtt 
atttggctga 
ggcaatcggg 
agacgagact 
tattcgcagc 
atcgatgaat 



aaggaaatgt 
cttccctcgg 
aattaaaagt 
tgccacccgc 
gcaaatggcc 
ccgcaattct 
ttgcgggcgg 
gcccgaccaa 
ttggccatat 
attattgtgc 
cacatcgctg 
aaatatttgt 
gcaaatctcg 
gatccgcttg 
tgatgctgat 
tttttcgcac 
cacgcgctgt 
aattgcattc 
gaatagcaat 
atatcaaata 
acttaaaatt 
aaaatatttc 
cgtccagccc 
ggagaggaag 
ggtgacctgg 
cagcaggatg 
ggtttcatat 
gaccaccacg 
gcggacgccg 
gatcatcagg 
aaggaagtcg 
atccacctgc 
gtgggaatcg 
aacgtcaaac 
cggacgtcga 
caaattgttc 
ctttcgaggc 
ggtttgctga 
cttttgttta 
gcgtgtggaa 



tgtggggtaa 
cagctcctgc 
aagtaaagca 
aacgattttt 
gctgaccaaa 
cggagtccaa 
gacgaaacat 
acaaaaaatg 
ctttaaatag 
catttgaggg 
cacttcatta 
tccacaataa 
gtctgcggcg 
gctgaaatta 
gccgatgctg 
ttaactcatt 
tttttttttt 
ttgatttttg 
ggaaacgtgg 
tatgaaaaag 
acgacacaaa 
atcgatatat 
ccgagtgcac 
gcgtagagca 
atgactagct 
agctgttcaa 
gtattattgc 
acgacgagtc 
gttccctggc 
ggcagcaggt 
ttcgtcgtga 
ggatatcgat 
aatattgcgc 
ggccaacagc 
caactaactc 
agccggagtt 
ttgttgtttg 
cgcttaggag 
catttttgtt 
aattgcattt 



aagtaaaatt 
tgtgttcctt 
aagtcacccc 
ttttccagta 
agaattaatg 
aaactgcggg 
caacgaatgt 
taaagcccgt 
ccagccgaac 
gaaaactgtg 
aattgacgag 
aaggggttga 
gccttaaaaa 
ttctgtaaat 
cgcgagtaaa 
aaaacgtttg 
ttagaaattc 
atttgaaatc 
ttgagcggat 
gttttgaagt 
agtttccatc 
tgaccgagca 
caaccccact 
gcggattgat 
tggtgagggt 
tcaaacgaaa 
gtctcacctg 
gggtgacccg 
gccagagccg 
aggagctgat 
aggtgcacat 
gagaatgtgg 
atacgtcatg 
cagacatcgt 
gcaatctcac 
ttggatttgg 
ggggatggag 
ggtcgtaaac 
tacaatctcc 
aaaaaatctt 



ctcagatcga 
cagccgtttg 
ctcgtcggtg 
ctttcctttt 
cgtcccgaaa 
cagcctgtca 
caatgaattt 
aattccccat 
ggaaaggaat 
gcatgccagg 
acaaaagatc 
ggaggcactt 
tagagaaatg 
tctacctatt 
accaaattcg 
ccgctggccc 
gcctcaatag 
aaaacattcc 
gtttatattg 
ttgtcagtca 
aaacgaaatg 
ctcagatgtc 
caccttgtag 
acacgagctg 
gttcgtctcg 
gaatatgttt 
gacaggcagc 
cttgcgaccg 
catgatcatg 
gaagaaggtg 
gccgtaggtg 
ttgtgggaac 
ttggcggtga 
aaataaaact 
tttgtttctc 
ataaaagcgc 
aatctattgg 
tgcccggtga 
gccccttcac 
cccattaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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ccggcatgaa ttgcttacaa aaatgtttgt ttatggcccg cccttgtttt cgttacactg 2460 
agcgagcgat gttggcccac aaattgagtt cgaactctga atctgcgcag cgtcaagtgg 2520 
ccaaggccac agaagtagac aaatcatggc cagaccctcc gctgatttat gggttcattt 2580 
cgctggcatg ttctccgcaa aacccaacga cttcatagac ttgacaagcc attaatgccc 2640 
tttcctcgtt cgcgagaaat cagccaacag ctgtcaatca atgccatgac ctggagatcg 2700 
catatctagt tacgcagtcg gacatatgtg catttatgat tattatttga tatcgatttc 2760 
taataaaaag ataatggcac acgctctcac aggctcattt gatatgcata taccaagcat 2820 
gtagctgggc aatgtaatat ggaaataatt aatagcaaac tgttaattaa catgttgatc 2880 
aaacatgact taattatgcc tttttatcgc ctgttccttt gtatttgcac acgcagacac 2940 
cagataacaa agcaaagcac acatatgagc attcaatgtg ggctctattt ttccagctta 3000 
atccttagac aaagcaaagc aaacacataa acaaacaata atgccaccca ttggagcact 3060 
ctccccaatg cgaacaaaaa ccgaacgaga caaaaataac aatataacga agcgtgaaat 3120 
gccatttaat aaaatattag gttacaataa caacaccggg agcaacagtg ggccattgcc 3180 
atctcacaga taaaaaaagt tcatcatccg tgtgaagatg agattcgttt aattaaccgc 3240 
gactgtcgct tttggccaag gaaaagggga aacacaacgg cacaaacaaa aagttggtca 3300 
tgtgtgatgt gggaatcgct ttattggcac tgggaaatca gtggaaaaag cccgcgggcc 3360 
atttatgagt atcaacagaa attggaacca atgtccgacc ataattacgt gtagttgata 3420 
ctcgcaatga aaaaatatga tagatattca gtattaaata gcactcaaca attgtttgaa 3480 
ctgtaccgga aaattgaata ttagttaata atcccagtgt tttcttgcga ataaaaactt 3540 
aaggttaaag tctgcaaata caaatacatt gtatatcctc aaaatcgccg gtaaattccc 3600 
cttttcattt ttgatatcac ccgctgatct cgagcaaaac aattccgatc ttcgtcgatt 3660 
attccgcaat taaattgttt ctcgcacttt tcgcatcgat cataaaagcc taatgtccaa 3720 
gatgagcatt tcaaggtcgc cacccgcgaa gacatttatc atacgcagtg tgcgcccaaa 3780 
gttcgcaagt gcaaagtttc cataatggcc gcttgttacg acttacgact gcccatgatt 3840 
tacgcgcttt tggatcaaca gttgaatttg aaattttgca agccacacgg agaggaggtg 3900 
ggcagatatg ggcagatacc gcccagttgg caagtgtttc tcgctggcta tctggtctat 3960 
aattaaaaat ttatgaattc aaaatagcaa attagtaaca aaaacaaatc gggcgccgcg 4020 
catcgcgaaa ttttcaatat aaacacaaaa cagaaaataa gataaaatgg ggatacttta 4080 
ataggggcat atagtaaaaa caaatgagtt gtatcaggat cagtcgattg gacgatcgct 4140 
ggtttgtgcg gggtcttcgc tttctggatt acgatctggc agaaggcaga ccatctatct 4200 
aggtatctat agctacatta gaaccaggca gtcggcacca ttgacgtgca gcgagaggaa 4260 
tctaacgagg tgacgatgat ggaaagcaat aaatttgcct ggcccgtgtt cttcacagca 4320 
aattcaggca cagcaggccc acttatagca ttctacccac tcctcccact cttccactcc 4380 
cacttaccac cacgtcgtgg gtgaaggcca ctggcaccga- aacgaccagc accacgatcc 4440 
acagagtcac gatggcaatc agggtaatgt tctccgtcct catcatccgc gagcgaatgg 4500 
gatgaaccac cgccaggaac cgatcgatgg acatcagcac cagcgtgtaa atggaggcga 4560 
aggcggtcac cacaatcagg tactggacac tgcggcacca gaaccttcca tatggccagt 4 620 
agtacaccat gtaatcggtg gccgtgaagg gaatgcagag gattacgaac atcagatcag 4 680 
cggcagccag attgacaatc atcaggttgg tggtggagcg catgttgttg ttgaagacca 4740 
ccaccaggat gaccagcagg ttgccgaaga atcccgtgat ggcgatgacg ccgaagaaga 4800 
atcccactat ccacggcaaa gtgccattga tggccagcca ctcttcgtcg gtgaagaatg 4860 
aggtgatatt ctcctcattt ctggttgcat ttaggctaat attagccagc attgtggtgt 4920 
tctccatgtt ttctcagctt tagtgcacaa ctttccggtt cactttgtac tggctattgg 4980 
cgatttcgct tgattgtcac gggacatttt caattttggg ctctttttcg ctcacagttc 5040 
atcgaacgca aataacaaat ttccaattag gagagtttca caaagcagcc gccacaaaaa 5100 
ttcaattatc atgcggaata ctacgaccgt tgagcttcgg agacctcaaa aacgtttcgc 5160 
atacgccgtg tggtatcgaa ctcgaccgga attgttcgca acgtggcggg ggcaaaagca 5220 
atttggtcct tccacgataa cggtaattgt aggtgggtag cacaggtaag acaacaataa 5280 
aggacagcac acaaggatca cggtgaggcg aaaggcaaga gagtgagttt aaaggtgagg 5340 
gcacgtgcaa acgacaccga cacagccaaa ctgtgccgag tcacaccctg acgacaagga 5400 
caccggggtg tccttaggtt cgaggaccac gttgctgcac ctacaccata aattcaatat 5460 
gatgtcttcc tttcgctgcg tttctgttta catttctcgt ttgctttttt attgacaaac 5520 
gaacggcata aattttgggg ctataaggct atggccaaaa aaccacgttc ctattgttgc 5580 
taccgactgc tgcgccgcgg tacgctatat attttcgatt ttatactgct gcttttggcc 5640 
aattcgatag tccacactgc tcgcgtccgt atccagcgcc caactgattg ctggctgagc 5700 
ttgttgagga atacttcgat gcctcggcgg cggcaaaacg ccggcagcga cgccagccat 5760 
cccagcaaga ataatggcaa caatagcaac aaccagcgca gctggcatct gtgcgtgtga 5820 
tggttttggg tggccgccgg agccacctct tcctcctcat cctcctcatc cgatcccaat 5880 
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ccccatcagc tgggctggaa tgtttgctct tgttgctccc accgcta 



5927 



<210> 116 
<211> 990 
<212> DNA 
<213> Drosophila 



<400> 116 

atggagaaca 

atcacctcat 

gtgggattct 

ctggtggtgg 

gctgccgctg 

gtgtactact 

accgccttcg 

gttcatccca 

actctgtgga 

gtggattacg 

ttccttggtc 

atgatcatca 

gtccgcatgt 

gtggtcatcg 

ctggatgtca 

ctggcctaca 

cggaaggcct 



ccacaatgct 
tcttcaccga 
tcttcggcgt 
tcttcaacaa 
atctgatgtt 
ggccatatgg 
cctccattta 
ttcgctcgcg 
tcgtggtgct 
atgcaaagaa 
cgcgcaccta 
gcggtctcta 
ccaaggagtc 
ccttcgcctc 
tcgagacgaa 
gcagctcgtg 
tctacaaggc 



ggctaatatt 
cgaagagtgg 
catcgccatc 
caacatgcgc 
cgtaatcctc 
aaggttctgg 
cacgctggtg 
gatgatgagg 
ggtcgtttcg 
gaacatcacc 
ccaggtcacc 
catgcgcatg 
gcagcgcggt 
gctctggctg 
caccctcacc 
tatcaatccg 
gaatttctaa 



agcctaaatg 
ctggccatca 
acgggattct 
tccaccacca 
tgcattccct 
tgccgcagtg 
ctgatgtcca 
acggagaaca 
gtgccagtgg 
tacggcatgt 
ttcttcatca 
atcatgcggc 
cgcaagcggg 
cctgtccagc 
aagctagtca 
ctgctctacg 



caaccagaaa 
atggcacttt 
tcggcaacct 
acctgatgat 
tcacggccac 
tccagtacct 
tcgatcggtt 
ttaccctgat 
ccttcaccca 
gcaccttcac 
gctcctacct 
tctggcgcca 
tcacccgact 
tcatcctgct 
tccaggtcac 
ccttcctctc 



tgaggagaat 60 
gccgtggata 120 
gctggtcatc 180 
tgtcaatctg 240 
cgattacatg 300 
gattgtggtg 360 
cctggcggtg 420 
tgccatcgtg 480 
cgacgtggtg 540 
gacgaacgac 600 
gctgcccctg 660 
gggaaccggc 720 
cgtcgtcgtg 780 
gctcaagtca 840 
cgcccagact 900 
cgagaatttc 960 
990 



<210> 117 
<211> 329 
<212> PRT 
<213> Drosophila 



<400> 117 
Met Glu Asn Thr 
1 

Asn Glu Glu Asn 
20 

lie Asn Gly Thr 
35 

Ala lie Thr Gly 
50 

Phe Asn Asn Asn 
65 

Ala Ala Ala Asp 

Thr Asp Tyr Met 
100 

Ser Val Gin Tyr 
115 

Leu Val Leu Met 
130 

Arg Ser Arg Met 
145 

Thr Leu Trp lie 



Thr Met 
5 

He Thr 

Leu Pro 

Phe Phe 

Met Arg 

70 
Leu Met 
85 

Val Tyr 

Leu He 

Ser He 

Met Arg 
150 
Val Val 
165 

Val Asp 



His Asp Val Val 
180 

Met Cys Thr Phe Thr Thr 

195 



Leu 


Ala 


Asn 


He 
10 


Ser 


Leu 


Asn 


Ala 


Thr 
15 


Arg 


Ser 


Phe 


Phe 
25 


Thr 


Asp 


Glu 


Glu 


Trp 
30 


Leu 


Ala 


Trp 


He 
40 


Val 


Gly 


Phe 


Phe 


Phe 
45 


Gly 


Val 


He 


Gly 


Asn 


Leu 


Leu 


Val 


lie 


Leu 


Val 


Val 


Val 


55 










60 










Ser 


Thr 


Thr 


Asn 


Leu 
75 


Met 


He 


Val 


Asn 


Leu 
80 


Phe 


Val 


He 


Leu 
90 


Cys 


He 


Pro 


Phe 


Thr 
95 


Ala 


Tyr 


Trp 


Pro 
105 


Tyr 


Gly 


Arg 


Phe 


Trp 
110 


Cys 


Arg 


Val 


Val 
120 


Thr 


Ala 


Phe 


Ala 


Ser 
125 


He 


Tyr 


Thr 


Asp 


Arg 


Phe 


Leu 


Ala 


Val 


Val 


His 


Pro 


He 


135 










140 










Thr 


Glu 


Asn 


He 


Thr 
155 


Leu 


He 


Ala 


He 


Val 
160 


Leu 


Val 


Val 


Ser 
170 


Val 


Pro 


Val 


Ala 


Phe 
175 


Thr 


Tyr 


Asp 


Ala 
185 


Lys 


Lys 


Asn 


He 


Thr 
190 


Tyr 


Gly 


Asn 


Asp 
200 


Phe 


Leu 


Gly 


Pro 


Arg 

205 


Thr 


Tyr 


Gin 
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Val Thr Phe Phe 
210 

Gly Leu Tyr Met 
225 

Val Arg Met Ser 

Leu Val Val Val 
260 

Gin Leu lie Leu 
275 

Leu Thr Lys Leu 
290 

Ser Ser Cys lie 
305 

Arg Lys Ala Phe 



lie Ser Ser Tyr 
215 

Arg Met He Met 
230 

Lys Glu Ser Gin 
245 

Val Val He Ala 

Leu Leu Lys Ser 
280 

Val He Gin Val 
295 

Asn Pro Leu Leu 
310 

Tyr Lys Ala Asn 
325 



Leu Leu Pro Leu 
220 

Arg Leu Trp Arg 
235 

Arg Gly Arg Lys 
250 

Phe Ala Ser Leu 
265 

Leu Asp Val He 

Thr Ala Gin Thr 
300 

Tyr Ala Phe Leu 
315 

Phe 



Met He He Ser 

Gin Gly Thr Gly 
240 

Arg Val Thr Arg 
255 

Trp Leu Pro Val 
270 

Glu Thr Asn Thr 
285 

Leu Ala Tyr Ser 

Ser Glu Asn Phe 
320 



<210> 118 
<211> 7383 
<212> DNA 
<213> Drosophila 



<400> 118 

atcttttaag 

ttgatgtttc 

gaagcggaaa 

taattcattc 

gcttaatgac 

attaataagc 

gctgtgtgct 

aatttaattt 

ttcagctaat 

gaatcgatta 

ttattataat 

ctgcgaaaca 

aggcaaatgg 

ggttttacta 

ataccctaat 

tgacagccaa 

cattggaaca 

acttgcccgc 

tgagtatgca 

tggagctctc 

tggggggcgt 

tcggatgcag 

gtgccgaaac 

cattgacatt 

catgcacccg 

caaagagatt 

cgcgggtgtg 

aacaggccgc 

gacgacggtt 

tattcactta 

atcatttgca 

ggcagaaaat 

tgaaagtgct 

gtaaaataga 

cgagtttatt 

caaacagccg 



taaaaagcgc 
aaatttatag 
ttgcgaaccg 
ggctgcatgt 
tcgaaatgtt 
ctttctgt'tt 
tgtgtaggat 
aatgctactt 
aatacatttg 
tttacaacgt 
actgtacacg 
attatcagca 
tcatgttcca 
aagccttaca 
aatagtcatt 
tttactggaa 
cggctataac 
atccttgtcc 
ggtgccatcg 
cctgcttagc 
ggttgatggc 
aggtttgaag 
acccggagtg 
gttgttctgc 
cggcatatgc 
actccggatt 
catcgaaaag 
aaatcgcttg 
taggttaatc 
aatacaaatt 
aaaatacatt 
ttttcgtgaa 
ctaggcaaac 
atatctgttg 
cggagatctg 
acgggttttt 



tcaagtcaaa 
caaaatgcca 
aatttattcg 
aattaagaat 
tattcgtttt 
tttaatgaca 
t taatacata 
aattgatttc 
atttggtgag 
ttggacttcg 
gctcacgacc 
atgggacttg 
atgacacatt 
tacaattaca 
cactaaaaca 
tacacatcat 
ttagatcgct 
ttcgcctgcg 
cattcggttt 
agggccaact 
agctccatga 
ttgtttcgcc 
gccgcaaagg 
ccactacttc 
aagcccggct 
cgcatcgagg 
gtcctttcgt 
ttgaattccc 
catcccaata 
aaagtgcatt 
ttttcgctat 
aaagcgaaaa 
ctccatcttc 
acaaagtttt 
gctctcaatc 
tcccgctttt 



acttttactt 
taatttcagt 
tacaatttat 
aataaaaata 
cgtttggtaa 
aatattgtac 
atcggattgt 
aggtgctaat 
ttttggtttg 
tgctttaagg 
agcattcaca 
taatcacaat 
tagttgcggg 
ataaaattta 
ttaacacaca 
atgtgaatgc 
ctagtttaaa 
agccgggcag 
ggctgccaaa 
ggactccggc 
gtgccacgtc 
gccacggcgg 
taaccacgct 
cgttgtaaat 
tcccattgtt 
agttggcgta 
gggcgtccat 
gcttaaattt 
acatatcact 
gcataaatca 
caaactttct 
ctgccccggc 
gctgacaaat 
tcacgcacag 
tatcttttac 
atcggtttgc 



tcatttaaaa 
tcgtttgaaa 
gggccacaca 
ctggtatatt 
ttaaatacaa 
attacatttg 
tgttaaaata 
ttaatttgat 
agtttgataa 
atcgcaattt 
ctcattcaat 
tcattaagac 
atatctaaac 
aacatattag 
tacgttaaat 
cgactggctg 
ttaaagtggt 
gtggactcgc 
ttcctgttcg 
cccggatgcc 
gtcctcgcat 
cattgcaacg 
ttgatgctga 
gccgctgtta 
gacaccacca 
ggttgacctt 
gctcgtcttg 
ttcctgtgca 
aaagtaagcg 
taaacctctt 
ggttaattat 
tacggcatct 
cccatttgtc 
ccagcagggg 
agacaatttg 
cactttggca 



gtaaattttc 
caaacgaaat 
tgaacaataa 
ttgtttgttt 
taataattac 
cttacatcca 
ctatttaaat 
tcattttttg 
ggtcattaac 
gaagatacca 
atatatcctt 
gaaatcacag 
actgtaaagg 
tgtagtagct 
gaatgagcta 
cggaatgtgt 
tgccacaagg 
gacacctcgc 
caaatgcagc 
aactcctcgc 
tcgattacgt 
ccaacacctg 
tgatgttggc 
gcaccggatc 
cgcgccggac 
atggagctgg 
aacttgcaga 
gagaaaaata 
aaagtacttt 
agcaaagttt 
tatgtgaatt 
ttctggctaa 
ttcgctgaca 
aaaaagttta 
agtgtcaaag 
ttctgaggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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ttttgtggat 
tttgcttatt 
gtcagctatt 
gttatttttg 
ggaaaacatc 
tcttttttaa 
tttgcgtcta 
cgtgcaaagg 
tttttttact 
gacatccaat 
caaacttaag 
tttccccggc 
aacgtgtccc 
atgcaaattt 
tagatgccat 
aaccagacga 
ttatagagct 
ttgatgacct 
catcaacgtg 
tgtgcgttac 
ctgatgacac 
aattgatcat 
ttgttctctg 
tgacgtattg 
aagggcaccg 
aacttggtgg 
cgaatcgttt 
tgatggccct 
tcagggcctg 
gcagggcagc 
agattatcag 
ttacaaatca 
gtgtaaggaa 
caacccaaaa 
catgtcgggg 
aaagcccagc 
tgtgattact 
caaattgcaa 
aaatctattt 
tcaagttcaa 
aaaatcaatt 
ttaataatta 
agctaaaatg 
actttattca 
aagcaaactg 
caattcaatt 
agcttttagc 
gaaaatcaca 
atgcgcgtag 
aaattgatgc 
ggctggctac 
gtgatttatg 
tgcatgtatg 
taagcttaac 
gcaaaagcca 
accacccaga 
atggacaaca 
aaaaatagtt 
aactaaatga 
atgcgctcct 



ttcgcttcag 
ggctacagtg 
ttaggtgcga 
attaaaaaaa 
aacaaagcaa 
gcttctttga 
tgtttgtgtt 
ctttttttct 
ttaacttcat 
tgtacgcaag 
ttccccaaaa 
aaatttacca 
tgagcacgaa 
aacactccgt 
aaataaaggg 
tgctaatgaa 
gcagtggcag 
gcaagcgtcg 
caaaactcac 
caggagcacg 
agaatggaac 
cggataggtt 
ttcaaaggat 
acttaccgcg 
caaatagcag 
gacgtgccct 
cgaatgtggg 
atgccgatcg 
gacgaagggg 
ctggaactac 
aaacaacttg 
caaataatcc 
ataggatagc 
ttgcgactcc 
gtgttgtatt 
ggtgttgttt 
gcggatattt 
tttctcgccg 
gattgcggca 
ggttcaacga 
tactattttt 
tttcgactat 
caattgcggc 
aatgtccatt 
cgttcgcgtt 
actcggtatc 
actctatttt 
tttaaatttg 
gccatggcaa 
cggattatgc 
ctttctatgg 
cgttaaagtc 
catttcatat 
tgttgcaact 
aattagcgat 
tgaccaaagt 
gggccacaat 
gtgcattaga 
gatataacaa 
actaagttac 



ttttgcggtt 
gagcgctgct 
ggcgaaaaga 
aagtgacgcg 
tcgattcgct 
tttttgagac 
tcatttgatg 
tttaggtcta 
tattaaaagc 
gcaagttttc 
ctaacccaag 
cacatttata 
caaacatcca 
aaacaaatca 
attgtagcag 
gtggtagtcg 
ccagcagagt 
cagaaaaagg 
cttcttcttg 
tgtgccccac 
cagatactgc 
cttgttcatg 
gaaattcagt 
aaatctttcg 
ggccagcatc 
gcatgcgtaa 
agcccactct 
attgcaatgg 
cagaagctgc 
acggaatgaa 
actatacaac 
ctttaaaatg 
tacaagtgag 
attcaagtca 
ttcggccagt 
tggacactga 
gaagaccgcg 
gcttattgat 
ttaatataaa 
agtgtacaga 
atttgaaaca 
tcagttcgat 
attctgcgaa 
gctcattgat 
ttgcattcga 
acatacaaat 
gggtacttta 
atgtaggtaa 
ttttctaatg 
tgaaattggt 
tgaaaataag 
gatatccatt 
ccgcgaaaaa 
cacctgaaat 
atacatattt 
gttgccaatg 
ctcagcggga 
tttatatcac 
tctgttttac 
tgaaatttta 



tcacaatttc 
ttaattaact 
aattggaaat 
tatgcttata 
tacaatgaaa 
catgaaaact 
agataataga 
aaagtacgaa 
aagacgttaa 
ccaagatccc 
gcgaccttga 
gacttatgtc 
aaggatgtca 
tcatcatcat 
ctattgctca 
ttgatttccg 
ccaaagataa 
aaattaaacc 
tttttcaaca 
aatcgtaccg 
acaaaaacca 
cagaatggcc 
taaggtcgct 
gtcaactcct 
caaattccac 
gccgattttt 
cccccccagc 
cggtcagcgt 
acatgaacca 
aatattgcaa 
tatgcctaac 
ttcactatat 
tcgtccgcct 
cacggcaaac 
ttcaacattt 
tttgcgcagg 
ctttctccgt 
gacacttact 
ttaagtgccc 
ttgagcttta 
tagaaagatg 
ggttacattc 
acgagctgaa 
tgaccatttt 
cgacaccagg 
tccaaagaaa 
tagaatacaa 
tattccatta 
ccaaacatga 
gtccctgcgc 
cgaaaactaa 
tactagcgca 
aaaaatctcg 
ggtatgcaga 
gtcacggtcc 
acggccacga 
gccgcgtata 
tcggaaaaag 
acaaaaattt 
gttatttatt 



tgcggcccag 
ttgacaaagt 
tgtttcaaag 
aatcagcgag 
ttaagttaga 
tccgtaaaag 
aaaagctaat 
aataatatag 
gtatgtagat 
ctgtcaataa 
gtccctcatc 
tttttagtgc 
atcaaatccc 
tggagcccac 
tggccagcca 
gtatcgtgac 
tgaccacgat 
cagtcccgat 
gcgttatgtc 
ccatctggat 
gggtgtagcg 
gcgtcacatt 
ggcacttaat 
ccacacgaaa 
caattatgaa 
aatgattatg 
gactcaccta 
gaataccgag 
cggcaggttc 
ttattagttt 
ctttagcctg 
atatattcat 
gaggtttgat 
accattaatg 
cgtcgacaaa 
agtgagtgaa 
tctccacaca 
ggcggctgtg 
aaaatgtggc 
ttagtatgaa 
taaacaattt 
aatattttac 
tacaaagtaa 
ccgactaaac 
gaatcaattg 
gcccagaaaa 
atcccattaa 
ccacaacttt 
gcgctcgttg 
actgtgggtt 
tgctaatgtt 
acagtgcgta 
gtaattaccc 
agaggccaat 
gcatttgcct 
tactgattcc 
gacgctctgt 
tgcttgcaac 
aagaatgtat 
acaagaccat 



acggtaggtg 
atttttttgt 
taaggcagaa 
aaggaaaatg 
catatttcat 
tttcatttgc 
tttggaagca 
aaaaaattac 
attttgacga 
ctgactcacc 
aatccccaca 
gtgtgcaaag 
agccagacaa 
acttacattg 
atcgcagcaa 
atacagaata 
aatcagcatt 
ttgcatacca 
ccgtgaatcc 
gtagacaaag 
aaaggattgc 
gtaggtctca 
ccatgcaagt 
ggcaatggca 
cttcgatacg 
attgacgaat 
agtggattaa 
acatttacac 
caactctgca 
aaagtattgc 
agcatgatgt 
tttaattaaa 
atgatttaat 
aacagtggaa 
taaacgctcg 
aaagtgtgtg 
aagtcggccg 
acaggaccaa 
tggctttggg 
ttctgaggta 
gtactaccaa 
gtctaaaaca 
cgattcaatt 
agtattttca 
aagataatgg 
ctgttcacat 
cacataaacc 
tttcgacaag 
cccggaaatt 
tcaaaaactg 
gttggcttgt 
tgggcaattt 
gcattataat 
aatcacatcg 
ggtcgtggcc 
cccatagaaa 
gaatgaattg 
attattataa 
ccaatacata 
gcttttatga 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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gtgaataaat tttgtgtgtg gtgcttagct tcttccaatc cattagtttc agtgttgaac 5820 

ttgtgaaaaa tcgcgcttgc tgacagaagc agattaattc aagtgtcagt tgtcagtcaa 5880 

ctgatggcag gacagacgac attcacgttc ctgctgactt gtaaacgtca aggaaattga 5940 

accgtatttt atagccaaag agggaatttt ataaatgcca ctttggacag tatatacctg 6000 

tatatatata tacctatatc caatttaata taaaatattg atagttaaac ccaactttga 6060 

tatgaatttc attagcatat ccgaacatca cctgtttcct catatcaaag tcaatgagat 6120 

ggttgagccc aagtttggtc tttaaattcg gcacccccca cactcatgaa tgattcaaga 6180 

tcaaagaact aacggcgaac tgtcagaaga aaaaaacact tgtcaatgtt tttgattgat 6240 

gatacgtcaa gtcaatttaa tccagttgga tgagtcgcgt ccccagttgc ttgaagggat 6300 

tagagagtat acttaccaaa ttctgcttcc tcctcggctc cgggcaggaa ttccaggcgg 6360 

gactcctgct cgattaagtc cattgccata acactttttc gttttttagt tttagttgtg 6420 

tatcctaatg aggatatgat atccttcaaa ccttgtattt tagttcgaat gctgctgctc 6480 

tatgtagtgc aatacatctt caggggcttc tgagcttcat tgcaatctaa actggaaaga 6540 

tataaattac agcatttgtt catttgtttt ctgaaaattc gcaaaaaagt tatctgccac 6600 

aagtccagtg gtcagggaac aacaaaagtg cagacaacaa gagaaacaaa tgatctgaaa 6660 

aaaaatcatt gaaaagctat acaagcaata tcagtcttgc cctcaaggaa ctgaccaaat 6720 

tcagtgggcg gacatgagtt atggctcagg tggaatcacc tgtcagcttt cacaagtgaa 6780 

aagtttgttc tgtataattg aaagtgcacc caaattcagt ttggcgactg agggcactgc 6840 

actagaattt cgcttgtatt tcgtttctta tgaaaacagt tcatgttttc atgtttttat 6900 

tgaagagtca tgccaacgct caaatgttgc cttagcctta ttgaggtggc tcctaaaaaa 6960 

taaacaagaa agtcagcaag ttcattgaat gtttgagaat agaagcctca ctcaggttaa 7020 

ttggccattt atcgaactgt taacattgtt gggacttgtt aatttatctt cttcagtgta 7080 

aacactgtca agacttgtta atttattggg taactaaatg tttattctat aggcagacgg 7140 

aattgttata gattgactta gggatgcttg taggaaatag gctgggaaat tggcgaatgt 7200 

cgtgaaaaaa agatgtattt agttttgaat gactgtacta tttcccccag aaagaaacag 7260 

gttcacaggc tttaagttta tatggaaatg caaaaatttg tttcgagcat taaattactc 7320 

attttattat gctggtcaca agctaaaaga actaaaagga gccaccctaa cgcatgaccc 7380 

caa 7383 



<210> 119 

<211> 1623 

<212> DNA 

<213> Drosophila 



<400> 119 

atggacttaa 

gaatttgagc 

gggggaatca 

accaggcaaa 

attattggcc 

ctgccgtggt 

gtattcacgc 

gcacgtccca 

ctatttgcgg 

gagaacaatg 

gatcaattgc 

gtcatcagct 

aacgcacagg 

ctgattatcg 

ctgtatgtca 

tgcgattggc 

gaaaaattta 

agcatggacg 

acctacgcca 

gtcaacaatg 

ggcatttaca 

caaagcgtgg 

atgccgccgt 



tcgagcagga 
gtctatacgc 
gtatcgtggc 
tgcggaccgt 
tcttctgcat 
tcatgtgcag 
tgaccgccat 
ccaagttcgt 
tgccctttgc 
agacctacaa 
aatcctttcg 
ttgtctacat 
attcacggga 
tggtcattat 
cgataccgga 
tggccatgag 
agcgggaatt 
cccacgaaag 
actcctcgat 
ggaagccggg 
acggaagtag 
ttacctttgc 
ggcggcgaaa 



gtcccgcctg 
ggctcccgct 
cgtcattggc 
gacaaatatg 
accatttcag 
cttctgcccc 
tgcaatcgat 
atcgaagttc 
cattgccttt 
tgtgacgcgg 
ctacaccctg 
ccagatggcg 
cataacgctg 
ctttggactc 
aatcaacgac 
caatagctgc 
caacaagcga 
gaccttttcg 
gcgaatccgg 
cttgcatatg 
tgggcagaac 
ggccactccg 
caacttcaaa 



gaattcctgc 
gagattgtgg 
aacactttgg 
tatatcgcta 
ttccaggctg 
ttcgtccagg 
cggcataggg 
ataattggtg 
cgtgtggagg 
ccattctgca 
gtttttgtgc 
gtacgattgt 
ttgaaaaaca 
tgctggctgc 
taccacttca 
tacaatccct 
tttgcggcct 
atgcacaccc 
agtaatctct 
ccgcgggtgc 
aacaatgtca 
ggtgtttcgg 
cctctgcatc 



ccggagccga 
ccctgttgtc 
tcatctgggt 
atttggcttt 
ccctgctgca 
ccctgagtgt 
ccatcattaa 
gaatttggat 
agttgaccga 
tgaacaagaa 
agtatctggt 
ggggcacacg 
agaagaaggt 
cactgcagct 
ttagcatcgt 
ttatttatgg 
gtttctgcaa 
gcgccagctc 
ttggtccggc 
atggatccgg 
atggccaaca 
caccaggtgt 
cgaacgtaat 



ggaggaagca 
cattttctat 
ggtggccacg 
tgccgatgtg 
gagttggaac 
aaatgtctcg 
tccacttagg 
gctggccctg 
aagatttcgc 
cctatccgat 
tccattctgt 
tgctcctggt 
catcaaaatg 
ctataatatt 
ctggtt.ttgc 
catctacaat 
gttcaagacg 
cataaggtca 
gcgtggtggt 
tgctaacagc 
tcatcagcat 
tggcgttgca 
cgaatgcgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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gacgacgtgg cactcatgga gctgccatca accacgcccc ccagcgagga gttggcatcc 1440 

ggggccggag tccagttggc cctgctaagc agggagagct ccagctgcat ttgcgaacag 1500 

gaatttggca gccaaaccga atgcgatggc acctgcatac tcagcgaggt gtcgcgagtc 1560 

cacctgcccg gctcgcaggc gaaggacaag gatgcgggca agtccttgtg gcaaccactt 1620 

taa 1623 

<210> 120 
<211> 540 
<212> PRT 
<213> Drosophila 

<400> 120 

Met Asp Leu He Glu Gin Glu Ser Arg Leu Glu Phe Leu Pro Gly Ala 

15 10 15 

Glu Glu Glu Ala Glu Phe Glu Arg Leu Tyr Ala Ala Pro Ala Glu He 

20 25 30 

Val Ala Leu Leu Ser He Phe Tyr Gly Gly He Ser He Val Ala Val 

35 40 45 

He Gly Asn Thr Leu Val He Trp Val Val Ala Thr Thr Arg Gin Met 

50 55 60 

Arg Thr Val Thr Asn Met Tyr He Ala Asn Leu Ala Phe Ala Asp Val 
65 70 75 80 

He He Gly Leu Phe Cys He Pro Phe Gin Phe Gin Ala Ala Leu Leu 

85 90 95 

Gin Ser Trp Asn Leu Pro Trp Phe Met Cys Ser Phe Cys Pro Phe Val 

100 105 110 

Gin Ala Leu Ser Val Asn Val Ser Val Phe Thr Leu Thr Ala He Ala 

115 120 125 

He Asp Arg His Arg Ala He He Asn Pro Leu Arg Ala Arg Pro Thr 

130 135 140 

Lys Phe Val Ser Lys Phe He He Gly Gly He Trp Met Leu Ala Leu 
145 150 155 160 

Leu Phe Ala Val Pro Phe Ala He Ala Phe Arg Val Glu Glu Leu Thr 

165 170 175 

Glu Arg Phe Arg Glu Asn Asn Glu Thr Tyr Asn Val Thr Arg Pro Phe 

180 185 190 

Cys Met Asn Lys Asn Leu Ser Asp Asp Gin Leu Gin Ser Phe Arg Tyr 

195 200 205 

Thr Leu Val Phe Val Gin Tyr Leu Val Pro Phe Cys Val lie Ser Phe 

210 215 220 

Val Tyr He Gin Met Ala Val Arg Leu Trp Gly Thr Arg Ala Pro Gly 
225 230 235 240 

Asn Ala Gin Asp Ser Arg Asp He Thr Leu Leu Lys Asn Lys Lys Lys 

245 250 255 

Val He Lys Met Leu He He Val Val He He Phe Gly Leu Cys Trp 

260 265 270 

Leu Pro Leu Gin Leu Tyr Asn He Leu Tyr Val Thr He Pro Glu He 

275 280 285 

Asn Asp Tyr His Phe He Ser He Val Trp Phe Cys Cys Asp Trp Leu 

290 295 300 

Ala Met Ser Asn Ser Cys Tyr Asn Pro Phe He Tyr Gly He Tyr Asn 
305 310 315 320 

Glu Lys Phe Lys Arg Glu Phe Asn Lys Arg Phe Ala Ala Cys Phe Cys 

325 330 335 

Lys Phe Lys Thr Ser Met Asp Ala His Glu Arg Thr Phe Ser Met His 

340 345 350 

Thr Arg Ala Ser Ser He Arg Ser Thr Tyr Ala Asn Ser Ser Met Arg 

355 360 365 

He Arg Ser Asn Leu Phe Gly Pro Ala Arg Gly Gly Val Asn Asn Gly 
370 375 380 
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Lys 


Pro 


Gly 


Leu 


His 


Met 


Pro 


Arg 


Val 


His 


Gly 


Ser 


Gly 


Ala 


Asn 


Ser 


385 










390 




















4UU 


Gly 


He 


Tyr 


Asn 


Gly 


Ser 


Ser 


Gly 


Gin 


Asn 


Asn 


Asn 


Val 


Asn 


Gly 


Gin 










405 










410 










415 




His 


His 


Gin 


His 


Gin 


Ser 


Val 


Val 


Thr 


Phe 


Ala 


Ala 


Thr 


Pro 


Gly 


Val 








420 










4 25 










4 30 






Ser 


Ala 


Pro 


Gly 


Val 


Gly 


Val 


Ala 


Met 


Pro 


Pro 


Trp 


Arg 


Arg 


Asn 


Asn 






435 










440 










4 4 5 








Phe 


Lys 


Pro 


Leu 


His 


Pro 


Asn 


Val 


He 


Glu 


Cys 


Glu 


Asp 


Asp 


Val 


Ala 




450 










455 










4 60 










Leu 


Met 


Glu 


Leu 


Pro 


Ser 


Thr 


Thr 


Pro 


Pro 


Ser 


Glu 


Glu 


Leu 


Ala 


Ser 


4 65 










470 










475 










4 Ol) 


Gly 


Ala 


Gly 


Val 


Gin 


Leu 


Ala 


Leu 


Leu 


Ser 


Arg 


Glu 


Ser 


Ser 


Ser 


Cys 










485 










490 










495 




He 


Cys 


Glu 


Gin 


Glu 


Phe 


Gly 


Ser 


Gin 


Thr 


Glu 


Cys 


Asp 


Gly 


Thr 


Cys 








500 










505 










510 






He 


Leu 


Ser 


Glu 


Val 


Ser 


Arg 


Val 


His 


Leu 


Pro 


Gly 


Ser 


Gin 


Ala 


Lys 






515 










520 










525 








Asp 


Lys 


Asp 


Ala 


Gly 


Lys 


Ser 


Leu 


Trp 


Gin 


Pro 


Leu 












530 










535 










540 











<210> 121 
<211> 4391 
<212> DNA 
<213> Drosophila 

<400> 121 

ttccgccgaa aaatcgattg attttcaatt 60 
ttgaaagtta tttgctttaa ctcatgctga 120 
aaccttttat tcgaatttta tccttagttg 180 
tgatatcagt cgatatctct ttaattggcc 240 
tagttacgaa agtaatcggc ttaagagccc 300 
aggacttgca atgcactact gtctaatcca 360 
caagtatttt aaaactgaat tgtgagccgt 420 
ttgggcttaa agcagctgta gtggcaaata 480 
tttcgttcca catgacatga catcatggca 540 
gaagcctcgg gtttgcttaa atttttatgg 600 
ttgccagcta ctggatgttc agtagcaggg 660 
aaaattgaga atgaatggaa aatgggttac 720 
tttaattgaa ggcaaaaagg gatatatatt 780 
catgcgaaaa taaaaaattt agtattaatt 840 
tttatattag aatttttaaa ttttaattta 900 
ataaaatata acgttattaa tacattggtc 960 
tttagtatat ctactgatgt gatctgctgt 1020 
ggagctcagt ggtgcgcttt tggacatcca 1080 
cccacagccg acggttcgtg gagtccaaga 1140 
gcatagtggc cgtgatgcag aagaggtatg 1200 
atattccaaa gatccaggta agacccagca 1260 
gcttaaccac cattttcttt tcgtttttat 1320 
tgtagaaaac atacacgaat atcaccaagt 1380 
ccaccccaaa aatgagacca tagcccgatg 1440 
gggcagccga ggggacgtac gaatcgggat 1500 
agggcagtag ccaggccaca atggcggcct 1560 
cgataactgt gacgtaacgc tggaactgca 1620 
gaaccagaat ggagtactgc atggcagcgc 1680 
tcacaacaag cgcttcggat acatcatctg 1740 
gacacatagc caggcaaagg tgcagtaaga 1800 
tgaagagtgc ggcagtgagg aatattccca 1860 
ctatcgagat aatgtccagc actttctcat 1920 
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agtggaaatt tcgaaattga aatggaccct 
tcccctctgc gctttcttgt tttgctaagt 
aagaaaatat cagttgacta gccgcttttt 
ttgcacaaaa gagtctatgt gcttagttga 
aataactgct gaatatgtgg cagcacaata 
acactgcgat gtgcatttta gtgcattaaa 
gattacgacg acagccaatg gctttaagcg 
gattccagct ggagttaaat aggaaaactg 
ccaatcgagc atggcttcaa atcacgttca 
atggtatcaa agtgcacact tgtagacaca 
cctcaacact taccaggtgc acagcttttc 
aaatcgacag ctggaagctc gagtatcaag 
cactctcttt gttgcagaca aatttctcct 
tttaaatagt tggcttctga cagagactta 
aaatcgactc gctgggaagt attgtcttat 
gggacctata cataatggca atcaattagc 
ctggaccttt gttcgtcggg gaatacgtat 
tgtagttggt ggatgaggtg gtcatcgact 
gctccatctt ggtggggcaa atcagaccca 
ggacaaagta aatgaacatc acaaatccct 
aaaaagccac acctgcctgc atgaaggcga 
aaaagaagag cattatggag agtcggatct 
ggatgctctg gctcaaagag tgcgagatgc 
tgcacacagt aatcaaggtc acagggagca 
gatagcagat gccagtgtcc gtggaaagtt 
ctataagggc caccagcagg gtggggacca 
tcaggatata gcgaggtggt ctctcgattc 
ggaaggcgat gattagcatc caactgaaga 
ccagagccac acagcgaact gtgtttccat 
tattgagaaa cacgaagagc atcatctgga 
ccttcgtgga ggcctgactg cgccagcttt 
gtatacccaa aagggaaaga ctgcagccca 
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taatcggagt 
ggaatgcaaa 
ccgaactgct 
atccactccc 
gaaagggcag 
cgagtattcg 
agattacttt 
cgtgggcata 
gattctccgg 
cttcggagaa 
ctggcgcaga 
cgctgatatt 
ctccagtagt 
tgtccttggg 
gggaattggt 
tctccttgca 
gggcgccgaa 
gtcggaacat 
gcctcctgcc 
ccctgaatcg 
ggttgtcctt 
cgtgcagggt 
gatcgtggtt 
tacagtaaac 
agtactgaaa 
tcggcgaaac 
tgaatcaggt 
tagtactttt 
gattagattt 
taatcgaccc 
gcatttgata 
ggaacccttt 
tagcgatttc 
ttacgcgctt 
atttatacat 
ttttatttgc 
tatggcttct 
tcactcaagc 
ccttggttgc 
cttcgattat 
atttttacag 
ttgacacgtt 



aattagaatc 
ctgtgtgagg 
ttcagtgctt 
aatgctaaag 
tggcagagag 
aaaacaaatt 
gctgcgcggc 
gactttaaag 
caaaaatgca 
tcgaatgaag 
gaatatagct 
gccggtaatc 
ggccgtttct 
aaccagttgc 
agcgttgagc 
gacactgacc 
catttgtccg 
gttcctcatt 
ctcgagaacg 
cctcaaacac 
cactctgcac 
ggcacgttcg 
acgcgttatg 
cagatgtgtt 
cgttccgttt 
caaggccaag 
gccacagctg 
cacgtccgac 
ctgtagaccc 
ccgttttttg 
caatttacat 
gaggcaggga 
acgccttttt 
gaattgccag 
caatgataca 
atccgttcaa 
aataaatgtg 
aaaacagatc 
ataaagtgcg 
aatccacctg 
ccgacttcgc 
t 



tcctcgccaa 
tggttagtgt 
actccctcag 
gcctttgaat 
agataattgc 
ttcatgcgtt 
ttacgagtgc 
atgagatagg 
gctgtctcca 
cggtaccaga 
aatccagtta 
agagcttgga 
gcttcatcac 
tccggaagag 
tgagctgata 
aagtccacac 
gtcatgtgaa 
atgccagtca 
tcgtcatgca 
accacgggat 
tccctggtga 
ccgattctag 
tagtcgtgct 
tgcataagtt 
ctcaagcaca 
accacaaaga 
tttgcagctc 
gttcttgaaa 
cttcttcatg 
tatatctaca 
tgtgattaac 
agtaacacca 
accggtcttg 
ataagaagtt 
aaaataatgt 
gaaagagatt 
aaatgtgaaa 
ctttgtcctt 
attttcattt 
tactcacttc 
agcacgtaga 



ggtcgttggc 
ggcactcgat 
ttgaccaggt 
ccggattgcg 
ctatttaatg 
tcttacttct 
gttgcaatga 
ttgctcccct 
gattagattc 
agccactggc 
tgcgatcgca 
caccgatatc 
tggtcggctt 
catccatgta 
atctaagaac 
tgaccgtggc 
catcggcggg 
tctcacgacg 
ggctattcaa 
cccagtgttc 
ccggcattcc 
cccgtttcca 
gaaactcctc 
cgtccgatga 
tttgattttc 
gcccagcaca 
tccgaaaatg 
tatcgcggca 
gtgtttgcta 
cactactcac 
acgaccacgg 
aacttgtgac 
gtacttaaaa 
cttaccagga 
taaaaaaaat 
ttttttagat 
caccttcaca 
gaaaaaagtt 
tgacaggcta 
attgtaagcc 
ctgcgtgtgc 



tcgatagctt 
gatggcatcc 
ttcatagttc 
aagattttca 
gaagggtatt 
cagtcctggt 
ggtttccacg 
gcttttcact 
ctccttgacc 
ggaagtcttt 
ttcggggttg 
tgagaagttg 
ggccaccacc 
gctttcgaac 
cttactgtcg 
atccttgtcc 
tagaagattc 
cccctgggtg 
gtcaccgctg 
ggtggactcc 
gtttcgcaga 
cttgttctgg 
cggctcgcag 
attacgttcc 
ctttgggcag 
ccagttgaaa 
tttacaaatt 
cactgagctg 
tcagttgcca 
aaaaagacct 
caattgaagg 
gtcgtcttta 
tgctctggct 
aacacggctg 
ttaaaaactt 
ttgtttgatc 
taaaaaagta 
atgaaatgca 
cgcgttttca 
agtttttggg 
tagcggaaat 



ccacccacca 
ttcaagaggt 
caatatccgc 
ttacgcagaa 
caaattagtt 
attgaaatgg 
aacagagcat 
tgccgccaga 
ttagccaagt 
cgttgatccc 
gcgaagaaca 
tacgtttcca 
ttgccgcatc 
tggctgagca 
caggacatta 
tgctgcatca 
ccacctctct 
tcattggtgc 
atcgtgtgct 
cagttggctc 
aggcaaacat 
tgaggtgttt 
aacatggacg 
tctccatcac 
gcggaataaa 
ttcatggcga 
ctttggcaga 
tctgtcgatc 
ccacgagcgc 
caagtctgcg 
cgaatccgac 
tttgtgcaaa 
gattcatttt 
tacatgggct 
taaaaactag 
aattcattgc 
tttacgcaat 
aagcacttca 
aagactcagc 
ccaaaaagta 
tgttagcttg 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4391 



<210> 122 
<211> 2325 
<212> DNA 
<213> Drosophila 



<400> 122 

atgtgcttga 

gaacttatgc 

cagcacgact 

gctagaatcg 

gtcaccaggg 

gatcccgtgg 

ctgcatgacg 

atgactggca 

gttcacatga 

agtgtggact 

ttatcagctc 



gaaacggaac 
aaacacatct 
acataacgcg 
gcgaacgtgc 
agtgcagagt 
tgtgtttgag 
acgttctcga 
taatgaggaa 
ccggacaaat 
tggtcagtgt 
agctcaacgc 



gtttcagtac 
ggtttactgt 
taaccacgat 
caccctgcac 
gaaggacaac 
gcgattcagg 
gggcaggagg 
catgttccga 
gttcggcgcc 
ctgcaaggag 
taccaattcc 



tgtgatggag 
acgtccatgt 
caaacacctc 
gatgtttgcc 
cgagccaact 
gagcacacga 
cgcaccaatg 
cagagaggtg 
ctgatgcagc 
ataatgtcct 
ctgctcagcc 



aggaacgtaa 
tctgcgagcc 
accagaacaa 
ttctgcgaaa 
gggagtccac 
tcagcggtga 
acacccaggg 
ggaatcttct 
aggacaagga 
gcgacagtaa 
agttcgaaag 



ttcatcggac 
ggaggagttt 
gtggaaacgg 
cggaatgccg 
cgaacactgg 
cttgaatagc 
gcgtcgtgag 
acccgccgat 
tgccacggtc 
ggttcttaga 
ctacatggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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gctcttccgg agcaactggt tcccaaggac agatgcggca aggtggtggc caagccgacc 720 

agtgatgaag cagaaacggc cactactgga gtggaaacgt acaacttctc agatatcggt 780 

gtccaagctc tgattaccgg caatatcagc gtgttcttcg ccaaccccga atgcgategc 840 

ataactggat tagctatatt ctctgcgcca ggggatcaac gaaagacttc cgccagtggc 900 

ttctggtacc gcttcattcg attctccgaa gacttggcta aggtcaagga ggaatctaat 960 

ctggagacag ctgcattttt gccggagaat ctctggcggc aagtgaaaag caggggagca 1020 

acctatctca tctttaaagt ctatgcccac gatgctptgt tcgtggaaac ctcattgcaa 1080 

cgcactcgta agccgcgcag caaagtaatc tccatttcaa taccaggact gagaagcaat 1140 

tatctctctc tgccactgcc ctttcttctg cgtaatgaaa atcttcgcaa tccggattca 1200 

aaggccttta gcattgggag tggatgcgga tattggaact atgaaacctg gtcaactgag 1260 

ggagtaagca ctgaaagcag ttcggacctc ttgaaggatg ccatcatcga gtgccacact 1320 

aaccacctca cacagtttgc attcctggtg ggtggaagct atcgagccaa cgaccttggc 1380 

gaggagattc taattactcc gattaatgag aaagtgctgg acattatctc gatagtgggc 1440 

tgcagtcttt cccttttggg tatactggga atattcctca ctgccgcact cttcaaaagc 1500 

tggcgcagtc aggcctccac gaaggtctta ctgcaccttt gcctggctat gtgtctccag 1560 

atgatgctct tcgtgtttct caatacagat gatgtatccg aagcgcttgt tgtgaatgga 1620 

aacacagttc gctgtgtggc tctgggcgct gccatgcagt actccattct ggttctcttc 1680 

agttggatgc taatcatcgc cttcctgcag ttccagcgtt acgtcacagt tatcggaatc 1740 

gagagaccac ctcgctatat cctgaaggcc gccattgtgg cctggctact gcccttggtc 1800 

cccaccctgc tggtggccct tatagatccc gattcgtacg tcccctcggc tgcccaactt 1860 

tccacggaca ctggcatctg ctatccatcg ggctatggtc tcatttttgg ggtggtgctc 1920 

cctgtgacct tgattactgt gtgcaacttg gtgatattcg tgtatgtttt ctacagcatc 1980 

tcgcactctt tgagccagag catccataaa aacgaaaaga aaatggtggt taagcagatc 2040 

cgactctcca taatgctctt ctttttgctg ggtcttacct ggatctttgg aatattcgcc 2100 

ttcatgcagg caggtgtggc tttttcatac ctcttctgca tcacggccac tatgcaggga 2160 

tttgtgatgt tcatttactt tgtcctcttg gactccacga accgtcggct gtgggtgggt 2220 

ctgatttgcc ccaccaagat ggagctggat gtccaaaagc gcaccactga gctccagtcg 2280 

atgaccacct catccaccaa ctacaacagc agatcacatc agtag 2325 

<210> 123 
<211> 774 
<212> PRT 
<213> Drosophila 

<400> 123 



Met 


Cys 


Leu 


Arg 


Asn 


Gly 


Thr 


Phe 


Gin 


Tyr 


Cys 


Asp 


Gly Glu 


Glu 


Arg 


1 








5 










10 










15 




Asn 


Ser 


Ser 


Asp 


Glu 


Leu 


Met 


Gin 


Thr 


His 


Leu 


Val 


Tyr 


Cys 


Thr 


Ser 








20 










25 










30 






Met 


Phe 


Cys 


Glu 


Pro 


Glu 


Glu 


Phe 


Gin 


His 


Asp 


Tyr 


He 


Thr 


Arg 


Asn 






35 










40 










45 








His 


Asp 


Gin 


Thr 


Pro 


His 


Gin 


Asn 


Lys 


Trp 


Lys 


Arg 


Ala 


Arg 


He 


Gly 




50 










55 










60 










Glu 


Arg 


Ala 


Thr 


Leu 


His 


Asp 


Val 


Cys 


Leu 


Leu 


Arg 


Asn 


Gly 


Met 


Pro 


65 










70 










75 










80 


Val 


Thr 


Arg 


Glu 


Cys 


Arg 


Val 


Lys 


Asp 


Asn 


Arg 


Ala 


Asn 


Trp 


Glu 


Ser 










85 










90 










95 




Thr 


Glu 


His 


Trp 


Asp 


Pro 


Val 


Val 


Cys 


Leu 


Arg 


Arg 


Phe 


Arg 


Glu 


His 








100 










105 










110 






Thr 


lie 


Ser 


Gly 


Asp 


Leu 


Asn 


Ser 


Leu 


His 


Asp 


Asp 


Val 


Leu 


Glu 


Gly 






115 










120 










125 








Arg 


Arg 


Arg 


Thr 


Asn 


Asp 


Thr 


Gin 


Gly 


Arg 


Arg 


Glu 


Met 


Thr 


Gly 


He 




130 










135 










140 










Met 


Arg 


Asn 


Met 


Phe 


Arg 


Gin 


Arg 


Gly 


Gly 


Asn 


Leu 


Leu 


Pro 


Ala 


Asp 


145 










150 










155 










160 


Val 


His 


Met 


Thr 


Gly 


Gin 


Met 


Phe 


Gly 


Ala 


Leu 


Met 


Gin 


Gin 


Asp 


Lys 










165 










170 










175 




Asp 


Ala 


Thr 


Val 


Ser 


Val 


Asp 


Leu 


Val 


Ser 


Val 


Cys 


Lys 


Glu 


He 


Met 








180 










185 










190 






Ser 


Cys 


Asp 


Ser 


Lys 


Val 


Leu 


Arg 


Leu 


Ser 


Ala 


Gin 


Leu 


Asn 


Ala 


Thr 
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195 










200 










205 








As n 


Ser 


Leu 


Leu 


Ser 


Gin 


Phe 


Glu 


Ser 


i yx 


Met 


Asp 


Ala 


Leu 


Dv-A 

r x kj 






210 










215 










220 












Leu 


Val 


Pro 


T 

j_jy s 


Ben 


flrn 
r\L y 


Ly o 


Pi v 


ys 


Val 


Val 


Ala 


i-»y s 


r x u 


Thr 

1 I IX 


225 










230 










235 










24 0 


Se 3T 


rto 


VJJ.U 


Ala 


OlU 


Thr 


/Aid 


Thr 


Thr 


Gly 


Val 


Glu 


Thr 


1 y r 


no i 1 


php 

r i ic 










24 5 










250 










255 




Oc 1 


nop 


Tin 
lit? 


r 1 ,/ 
uiy 


vdl 


CZ 1 r-i 
oil 1 


Ala 
Al d 


Leu 


lie 


Th r 

1 I IX 


ftl V 


As n 


T 1 p 

X 1 c 


Ocl 


Va 1 
Vdl 


Pho 

rne 








£. DU 




















£. / U 






ri ic 


Ala 


Sen 
Ao 1 1 


P rn 

t 1 \J 


ulU 


uy i> 


A cn 
A5p 


A vn 

mx y 


T 1 p 

lie 


Th r 

1 X IX 


uiy 


Leu 


A 1 a 
rti d 


Tl a 

lie 


Pho 

rne 


oer 






275 




















285 








nl a 


Dv-n 


bi y 


Asp 


p i « 

Lai n 


Arg 


Lys 


Th r* 

i nr 


C V 

oer 


Ala 
Aid 


Ocl 


oiy 


rile 


Trp 


Tyr 


Arg 




? Qn 

£ yu 










OQC. 

£y*j 










*^nn 










Oho 


T1a 
11c 


Arg 


rile 


Ser 




Asp 


Leu 


Ala 
Mid 


Lys 


Va 1 
vdl 


Lys 


CZ 1 n 


i^i ii 

OlU 




Asn 












jIU 










jIj 










ion 


Leu 




i nr 


Ala 


Ala 
nl d 


Ph c» 

rne 


Leu 


Pro 


r*i n 

olU 


As n 


Leu 


i rp 


Arg 


pi n 

bin 


\7a "1 
vdl 


Lys 




















JJU 










ooc 

J J J 




Ser 


Arg 


biy 


Aid 


i nr 


Tyr 


Leu 


lie 


rne 


Lys 


17-. 1 

Vdl 


Tyr 


Ala 

Aia 


HIS 


Asp 


TV 1 — 

Aia 








~\ a n 










J4 J 










JJU 






Leu 


rne 


Val 

va l 


blU 


Th >- 

i nr 


oer 


Leu 


Pi T-l 

bin 


Arg 


i nr 


Arg 


Lys 


Pro 


Arg 


oer 


Lys 
















J DU 










jOj 








vai 


lie 


oer 


lie 


Car 

oer 


Tift 

lie 


Pro 


bi y 


Leu 


Arg 


oer 


Asn 


Tyr 


Leu 


ber 


Leu 














J / D 










■son 
J o u 










D v /-\ 


Leu 


Pro 


r ne 


Leu 


Leu 


Arg 


Asn 


r*i ii 

olU 


Asn 


Leu 


Arg 


Asn 


Pro 


Asp 


Cav. 

oer 


385 










390 










395 










d nn 

H U U 


Lys 


ril d 


tile 


c o -r 

Ocl 


T 1 p 
lie 


uiy 


O ex 


f^l \7 

oi y 


^ys 


ft:i w 
oi y 


iyr 


T 1 y- v-s 

i rp 


Asn 


Tyr 


bl u 


i nr 










d n ^ 

M U J 










a l n 

*a 1 U 










1 1 j 




Trp 


oer 


i nr 


Pin 


ui y 


Vdl 


Ser 


rp U y- 

i nr 


bill 


Ser 


OCX 


oer 


Asp 


Leu 


Leu 


Lys 




























^ JU 






Asp 


Ala 


Tl A 

lie 


i ie 




Cys 


Ml S 


i nr 


Asn 


HIS 


Leu 


i nr 


bin 


rne 


Ii 1 a 

Aia 


nU 

rne 






/IOC 

ft JO 










4 4 u 










A A ^ 
fl *■! D 








Leu 


val 


biy 


P 1 W 


Ser 


Tyr 


Arg 


Aia 


Asn 


Asp 


Leu 


biy 


blU 


biU 


Tin 

lie 


Leu 














^ 3D 










/i cn 
^ bU 










lie 


i nr 


Pro 


lie 


Asn 


Glu 


Lys 


Va 1 
Vdl 


Leu 


Asp 


ne 


lie 


Ser 


T 1 si 

lie 


i/-i 
vai 


pi ., 
biy 


A £ ^ 
3 0 D 










470 










A 1 












Cys 


oer 


Leu 


oer 


Leu 


Leu 


biy 


Tl Q 

l le 


Leu 


biy 


Tl A 

lie 


rne 


Leu 


l nr 


Til - 

Aia 


Ala 










485 










/i on 














Leu 


rne 


Lys 


Ser 


Trp Arg 


oer 


bin 


Aia 


oer 


i nr 


Lys 


vai 


Leu 


Leu 


HIS 








jUU 




















jlu 






Leu 


Cys 


Leu 




Met 


Cys 


Leu 


bin 


Met, 


Met 


Leu 


rne 


if— i 
vai 


rne 


Leu 


Asn 






oi j 




























Th t- 
1 i 1 x 


Asp 


Asp 


Vdl 


Ser 


Glu 


Ala 
Al d 


Leu 


Va 1 
vdl 


\/a 1 
Vdl 


Asn 


biy 


Asn 


Th y- 

i nr 


Va 1 
Vdl 


Arg 




J jU 










^ "3 S 
J J J 










c^n 
Jl u 












val 


A 1 a 
tti d 


leu 


Gly Ala 


Ala 


Met 


r;i n 


i yr 


Ser 


T 1 p 
lie 


Leu 


Va 1 
vdl 


Le u 


Pho 
rile 


54 5 










550 










555 










J DU 


Ser 


i rp 


Met 


Leu 


He 


He 


Ala 


Phe 


Leu 


Gin 


r iits 


n 1 n 

O 1 1 1 


nl y 


Twr 1 

i yr 


Va 1 
vdl 


Th t- 

1 I1X 










565 










570 










575 




Val 


T 1 p 

X 1 e 


Gly 


He 


Glu 


Arg 


Pro 


Pro 


Arg 




He 


Leu 


T 

i»y s 


Al a 
rti d 


Al a 
r\i d 


Tip 
lie 








580 










585 










^ Qn 

d y u 






Ua 1 
Vdl 


Al a 


Trn 

i rp 


T .on 


Leu 


Pro 


lieu 


Val 


n u 


Thr 

X i IX 


Leu 


Leu 


Va 1 
vdl 


Al a 

nld 


Leu 


Tlo 
lie 
















DU u 










DUD 








Asp 


Pro 


Asp 


oer 


Tyr 


Val 


Pro 


oer 


Ala 

Aia 


Ala 

Aia 


nl r-k 

bin 


Leu 


oer 


fh v 

i nr 


Asp 


i nr 




610 










615 










620 










Gly 


He 


Cys 


Tyr 


Pro 


Ser 


Gly 


Tyr 


Gly 


Leu 


He 


Phe 


Gly 


Val 


Val 


Leu 


625 










630 










635 










640 


Pro 


Val 


Thr 


Leu 


He 


Thr 


Val 


Cys 


Asn 


Leu 


Val 


He 


Phe 


Val 


Tyr 


Val 










64 5 










650 










655 




Phe 


Tyr 


Ser 


He 


Ser 


His 


Ser 


Leu 


Ser 


Gin 


Ser 


He 


His 


Lys 


Asn 


Glu 








660 










665 










670 






Lys 


Lys 


Met 


Val 


Val 


Lys 


Gin 


He 


Arg 


Leu 


Ser 


He 


Met 


Leu 


Phe 


Phe 
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675 

Leu Leu Gly Leu 
690 

Gly Val Ala Phe 
705 

Phe Val Met Phe 

Leu Trp Val Gly 
740 

Lys Arg Thr Thr 
755 

Asn Ser Arg Ser 
770 



680 

Thr Trp lie Phe 
695 

Ser Tyr Leu Phe 
710 

He Tyr Phe Val 
725 

Leu He Cys Pro 

Glu Leu Gin Ser 
760 

His Gin 



Gly He Phe Ala 

700 

Cys He Thr Ala 
715 

Leu Leu Asp Ser 
730 

Thr Lys Met Glu 
745 

Met Thr Thr Ser 



685 

Phe Met Gin Ala 

Thr Met Gin Gly 
720 

Thr Asn Arg Arg 
735 

Leu Asp Val Gin 
750 

Ser Thr Asn Tyr 
765 



<210> 124 
<211> 7986 
<212> DNA 
<213> Drosophila 



<400> 124 

gtgttaatcc 

ggcgcatatc 

gatcttgtat 

acataaagtg 

actctcagta 

gcagacactt 

gtcgcctcaa 

aattcctctg 

caatacgaga 

tgaattgtaa 

gttttggtgt 

cgccccattt 

tatgtggcca 

ttgtatgcac 

tgacgccagc 

attaagtgac 

cgaataaggt 

ggatgccatg 

ctcgaactcc 

tgtgatggcc 

ggtgagcatg 

caccgtggtg 

cgtaaccaac 

gatgcctcca 

tgcccatcga 

gaatcggggc 

tgcccaaatc 

aaattataaa 

agtggtgtgc 

ctatattttt 

taatttagta 

atttcataaa 

caaagtttaa 

aattgaagga 

aagcacggcg 

gacatttgga 

atcagtctgg 

atgaccacag 

ggtgcaaaaa 



ccttggctta 
tcagcgatgt 
aaatacgagt 
agatactcgt 
gccaagtact 
taggagtggg 
cagcacagcg 
ggaggtggag 
atgggcgaat 
atggccatat 
cggcagtcta 
tgtgtatttc 
cagtccggcc 
ttctggacgc 
gcgtttgggg 
ctacgtttcg 
ccaacctaag 
ctgccatact 
tccgatggta 
accggagccg 
gcgtggccca 
ggcaacacac 
tgtttcgtga 
tcggtggtgc 
tggccaaaga 
aagtgtttac 
gatgggggag 
atttctccgt 
gagtgtttta 
agcagaattt 
attttaaata 
tgtataaaaa 
tattccttgt 
tctgccagtg 
acaaatattt 
tctcgctgga 
acaggtgagt 
tccatagccc 
cagtgcaaat 



tcggtttaat 
tgggtaggta 
ggatagttag 
tttcgagaga 
cagtactcaa 
atagccggag 
atttcgattc 
catgtggctg 
ataagtttgc 
taattgggaa 
tgcatgttcg 
gcgtgctgtc 
gtaattggaa 
tccggttgcc 
acgacgttac 
cccacacaga 
tgagtgagct 
ccaaattgtt 
ctttgctgct 
gagccggaaa 
agtcgctggc 
tcgtcatcct 
tgaacttggc 
tctacataac 
tttatggctc 
tattcgaagc 
cacatttggt 
tggcaaaata 
tgggtttcgc 
tgaataaagc 
aattatgtat 
gcaatcaaac 
cataacttga 
taatgcctcc 
gttctttgca 
catcctgctc 
tggccaacta 
gttgtccacg 
aatttatttg 



ctatttataa 
accaaaagta 
ttagatatca 
tgcacttctg 
gctgtttgtc 
cagaccggct 
cgattccaaa 
atttatttat 
tcgaacggct 
tcatttaaaa 
gtggcataat 
caaggccaat 
attgtgtaag 
gggtctagca 
ctagcccagt 
atcatcaatt 
atccgccggc 
gtcaacagcc 
catccagacc 
gttgggtgtg 
agtggccgtc 
cgccgttctg 
catcaccgac 
aggtgagaat 
tgtgggggct 
cctccggtgg 
cgatttcgat 
tacatatttt 
tggaaaacct 
ataacaacat 
ctttaataat 
atgtgatctt 
acaatacaaa 
ttaatcccgg 
ggcacttggc 
tgcagcggct 
atttcgtaat 
gtcctcggtc 
ttgacgtttt 



aatcaattca 
aatacactct 
ccattttgca 
agtacccgta 
ttacgggtgt 
gaagtgtcat 
tccgattctt 
ttcgagcgat 
ggtccgttcc 
tttccaatga 
ttatgggcgc 
tacgcttact 
gtcgcaggca 
tttttctagc 
tttccctttg 
tgttcatgtc 
atggcggcgt 
ctcactgtgg 
agtgcggcac 
ccctcgatga 
ttcctggtca 
accacacgcc 
tggctggtcg 
ggcagcggag 
ttacacttcc 
aagttcgggt 
ttaaaataat 
tgattgcaaa 
ttgataattt 
tagtggttaa 
taacatctct 
ttcgaagcca 
gaagaattta 
ccaggtgcac 
gctttggctg 
ccatcctcag 
ttgccgttga 
cttagataaa 
tattgccgca 



aacataaagc 
aattcctatt 
ccgtgcataa 
ttcaaatcgt 
ggacgtgcga 
ccgatcgaag 
tggccatagg 
ataaacagag 
gagctgaaat 
ttttttctgt 
taattactat 
tattgcaatt 
atttcattat 
atttttctaa 
cagccatttg 
gccgacggtg 
tggaacatcc 
ccagtggtaa 
caggtggcga 
actcgtacct 
tcatcctggt 
gcctgcgcac 
gcacctgcgt 
ggcctgtcca 
ggggaaggtg 
agtaaacatt 
cccacagcga 
atatcttgtg 
tttctggaat 
aaattttgta 
tatttaacgc 
aatgaatata 
tagtggttta 
aaaccgattg 
gatactgtgc 
cctgtgcgcc 
tgtcatagca 
gtcatggcaa 
tgaccgttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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cctggccatt 
tgggcttgga 
accgtttgct 
gtggctcccg 
aggacaaaca 
ctgcggcttt 
catttccgtt 
gttttatatt 
gatccaagcg 
catgtccacc 
ttaacccatg 
aattgaacgc 
aaggaagaaa 
tatggccaag 
aattttgtaa 
ttattgagca 
gtggcttttt 
gtgcggtggc 
tatattatat 
ttgctgttgc 
ttgtttgccg 
cgcacacttg 
actcgcactc 
ttagttttgg 
gtctgaggcc 
tgggggaaaa 
taagcggact 
ttactcacaa 
ctatttaaat 
tttttaattt 
attaatgtta 
taccaacctt 
tagcagtaat 
agttgaatta 
ttgaaagaga 
gcatatttgt 
aatcgtgtgg 
tctgaaataa 
aaggatatat 
aagaatagtg 
gtggatgtat 
acctttagtt 
tcttccgcat 
gcgatgtgca 
tttgaattcg 
cccagccgca 
catccaccac 
aatttgccat 
tggactgccg 
tcattcccct 
ggcgacagcg 
catatatgta 
tcacacctaa 
agatccagga 
gggtatccca 
ctgagctttg 
attcttggat 
gtgtcaataa 
atagtagttg 
caagtgaagt 



cccagtgcat 
ttcccattca 
tttgttggaa 
agttgagctg 
ccgacgccca 
gcgaacaata 
gcgtttattg 
tcgcaggtat 
actggccctc 
ctatttgggc 
atggatcgcc 
attaaattgg 
aataatatat 
ataaatttat 
tattattttc 
cccatttttt 
gtggctgaaa 
gcagcataga 
ttgcgccttt 
tgctggtcgt 
cggcgcattt 
tgtgttgcat 
actcacacac 
ccccctttgg 
gaattacagc 
tcgcaattat 
agttttatga 
tctgttattt 
ttagatttaa 
aactaatgca 
aacaagaagt 
aacaaagttg 
taccgtgagt 
caacaccgga 
aacattatag 
aaaaattggc 
gcgtgagtct 
gcttgttccc 
atacttcatg 
ggtacaataa 
gtgacaattt 
gacacatttg 
attgatttaa 
taagtaaaag 
aatttcgccc 
atcacaaccg 
atctcacatc 
tcacatccac 
ctacaaccaa 
caccgtgatg 
cattgtgcgc 
tgtgtccacc 
cacatctaac 
gtgcgctcaa 
cttgagtacc 
ataacagaag 
gtaaactgaa 
atgaaatgaa 
ataaatatat 
aaattgtaga 



atcctttgag 
cccattccct 
gtttcaatta 
atccagctga 
ctggcgttgt 
tgtccgcttg 
gcccggcttt 
ctcgccgtca 
ctcatgattc 
tggtaagtta 
tattcatctg 
gttgggagta 
ttaatatcat 
ttgtagtgct 
actctttccc 
ttttaccaaa 
gtcctgctcg 
taatgtgcta 
ggctgcgcct 
gcagccatta 
gctttgttct 
acttccaggc 
accacaaagc 
ctgaaagcga 
cacagctgca 
aagcgccagc 
aaaaagtttg 
gttattgcct 
cttttggcta 
tagctaatgt 
ttagtaggaa 
catttgtaca 
ttttggataa 
aaattccatt 
gaaactgtgt 
catccaaaaa 
tttaggcgat 
gagatttgga 
cttctgtctg 
cccagtatga 
gcccattttt 
gtgtacgcca 
agtcagctcc 
gcttacaggt 
ggcttaactc 
caagagccac 
cttccaaccc 
attcgcaggt 
aacaagggct 
ctctacgttt 
gatgcggtaa 
actttggtgt 
cagctaacca 
cacttcaagt 
agccacttat 
tccaactaat 
tgaattatct 
aatgcaataa 
tcttttagtt 
aagtaggtga 



tgtgcgacgc 
atttccccat 
cattttaatg 
gccatggagc 
gtgtgccttt 
gcttttgttt 
tataatttcc 
cacaaccact 
tcgtcgtgtg 
gcgccagctg 
ttaagaaaac 
caggcaacat 
ataatattat 
ctattaagtt 
gtgtaattcg 
tttgtccttc 
cttttttttt 
caaaagggtc 
aaaaaacatt 
cattgaagtt 
cacctacgca 
gcccaaatta 
cagcatgtcc 
agagcatggc 
ggacattgga 
agatggcgcc 
caaatgcagc 
aatgaatact 
attgatttaa 
attaaattgc 
aatcggctat 
attatactgt 
cacatacttg 
gacagtagtg 
atccagtggg 
ataaatcaaa 
tggggggctt 
ctccggggga 
ctacatgata 
actggaagaa 
tttactcaat 
ccacattact 
gatttgtact 
gggactactc 
agccagttgt 
caaaccagag 
gcaaaccgca 
acgaggtcgg 
acgtggtctt 
acgtgaagat 
gctacatctc 
cccctgaaat 
ggggactaag 
gcagatcttc 
cgcacagtac 
atatctaatc 
tttgtttgtt 
gaacactttt 
tagtactaat 
cgtgaaatgg 



cacaatggca 
ttcctccatt 
gcatttccaa 
catggacctg 
tggcctggcc 
ttttccagca 
tctttttcac 
gacgtactcc 
gataactgcc 
cacggagaac 
cattcatgtt 
gtgataatga 
taataatatt 
tatagatata 
aaatattata 
cggaggcgtg 
tcttttttga 
tcgaaataca 
acgtaattga 
ttaatgagtt 
cacacactca 
accagcactc 
ttaatgccat 
ttaaagggca 
aaatttggaa 
acttacatat 
ggttgcaaca 
aaactgattg 
gcattgttta 
cctcaatttc 
ttccattcgg 
caggttattt 
gggtaacaag 
acaatgaggc 
cgtgccccaa 
acaactggca 
tagagtgggc 
caaatggaca 
ttcagtgaac 
gctgaatctt 
tccccacagc 
ctgttgtctc 
gcctgaatga 
tgactggctt 
cattccccaa 
caggtaccag 
tgcaattcga 
tcggcaccag 
ctcggcgatg 
tggttatgtc 
cagctccatc 
gtgtccggaa 
ttgtcaggca 
attaagttgg 
tgcaggtgat 
acaaattaaa 
agcatgaaaa 
tgaatattta 
tgaacgtttg 
aacgaagtaa 



aggattgggt 
tagccatccc 
gttttgtgtt 
aacctcaacc 
agaaagccgt 
tttctctggc 
cttcgttttt 
aagaagcgcc 
ctgtcaatca 
aaaatcaacc 
ataatcattt 
tcagtatcca 
atattataaa 
tatttttttg 
tgtaatattt 
acaatacttt 
gctaatggct 
cacgtttatt 
cgacgggttg 
gttaatgact 
gcgcgaaagc 
actcacacac 
tggccatggc 
cattaaaacg 
aatttggcgc 
tttcattctt 
actgtttgtt 
tgtttgcttg 
tttagttcat 
ggattgaata 
gtcttgtaaa 
tggtatcggg 
cagctagttt 
aataaacttt 
actttggttg 
atcattttat 
gtggtaatgt 
taaccgaatc 
ctagtattct 
gccacctaca 
acacgggtgt 
ggaatcgttt 
ccaacacact 
aggtttagga 
gtcccagttc 
gtacctccca 
actggaattc 
gcggagttcg 
ggatcgttct 
ctcacgtcac 
actatataag 
atttatcatg 
ctggggaaaa 
gatacccact 
attttaagtg 
tttgctgtga 
atgaacttca 
aaaagagtgc 
ctgccaagag 
ttacagcaat 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800. 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
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ccaaatcccg 
gtgctctgct 
cagactacga 
caaccaagtg 
cacccagcgc 
agtgctccaa 
tagacaaaac 
gtctctcgtc 
actccacgcc 
tgggtggcaa 
ccaacgagaa 
accaccatca 
atcaccatcg 
ccctgactat 
tgctgatacc 
ggattggctg 
ttcgcactgc 
accacctggc 
cggatccgga 
ctccagtcgc 
aacttttcgc 
aactaccaac 
aattgactct 
caatagataa 
aaaatcaaat 
aaatatatag 
attttggcaa 
aactaagtat 
atgccaaacg 
tgccttgtta 
agcatcttaa 
ccggaagttg 
tgtgagcgtt 
aattatagta 
ggatga 



aatctctatg 
tacatattcc 
tgtggatggg 
gctgccgaac 
cgccacgaat 
gtgctcctct 
gctgacgttc 
gctcattcac 
ctcgggcatg 
tgtgagcaac 
actggaggcg 
cccgcatgca 
catgccgatg 
tgtgatgggc 
cttcctgccg 
ggtcaactgc 
cttctggcgg 
catgttccgt 
accggtttcc 
acaatcgacg 
cgtgctggtt 
tcgcatttcg 
agactacagt 
gctgtttgag 
attagcttaa 
aaactagttt 
acgaagccaa 
tcatttgcca 
cagctagtaa 
aacccaaaaa 
aggtcacttt 
agattcttgc 
gattgtgcga 
tttgcgagtc 



ccctgacatg 
tcttctatta 
gacaacttca 
aggaagtccc 
gcagcagcga 
gccactggaa 
aaacatcagc 
tgctcggcaa 
cactacgttg 
aacaccacca 
gaggccaagc 
caccatcatg 
cgagtttcga 
ggcctaattg 
cgacccgccg 
gccatcaacc 
ctcacctgtc 
ggctagtttc 
gaatcactga 
ctcacttggg 
caccaacttc 
atctttttga 
acagatagta 
aaatgcacta 
caaaactgct 
aaaatttatc 
tggagggacc 
ggcactataa 
aactaagata 
atcgactgcc 
acggtcgcaa 
ctagaccaga 
ggcgagcggc 
tttgttgcct 



caatctaaat 
tgtccttcca 
tctcggagtc 
gctggaggtt 
agggacacca 
ctgctggagc 
ccacctttta 
ttagctgcaa 
gcacggaggc 
ccaccaccac 
aaaagcaaca 
gctcccatgg 
ccacgaaacg 
cttgctggct 
tcctcgagga 
cgttcatcta 
gccccatttg 
cggatgcaga 
gcacttgagt 
tggtggggga 
tatgtaatta 
agtacaagtt 
aataaaaaag 
tctcttatat 
gaatttaatt 
acacatgtcc 
ttttggccaa 
gctgtttgcc 
cttatggtca 
ttttgaagct 
tttttttttt 
tccaaatcct 
acgaagcgaa 
gacggcaact 



tattcctcac 
acagtactcg 
ggagcactat 
caactccctg 
gtcgtccgtc 
tggagctgga 
cgagctggtc 
gtatggagga 
ctccttctcg 
caacaccgtc 
gcagcaacga 
ccagcagagc 
tgatagcaag 
gcccttcttc 
cctcatgttc 
cgccttctac 
caagcagaag 
tgcagttgcg 
gtgtgtcata 
aatatctcag 
gcataaaaaa 
atgccaatgc 
aatttaagga 
taaatcaatt 
caagtttatt 
ctccaaaaat 
ctcgtcgcca 
gcactctatc 
atattgattg 
tcaaaagccg 
ttaagactca 
atatatagta 
acgtaacgaa 
gttggccatc 



gtttttgtca 
gagcgcacgg 
cattgcacgc 
cacgacccac 
aagtgctcca 
gctggtgtgg 
gaggtgtcgc 
ccctgcatgc 
gacacctgcc 
gcactggaaa 
tcctccttcg 
catcacaatc 
accgccaaga 
gtctattatc 
ggcttcacct 
aatccggact 
cgtccgccca 
gatgcggatg 
gtgccgccat 
gcatatcctt 
taagtggaga 
aaggacatca 
gtattaacta 
gaagataatg 
tcatgaaaac 
tgtaactaat 
ctagctcacc 
tgcatcgttt 
gagcaaagat 
catttcgagc 
ctggacttgg 
ttttcctgtg 
acaaaagaca 
gggatgagtg 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
7986 



<210> 125 
<211> 1704 
<212> DNA 
<213> Drosophila 



<400> 125 

atggcggcgt 

ctcactgtgg 

agtgcggcac 

ccctcgatga 

ttcctggtca 

accacacgcc 

tggctggtcg 

cgctttggct 

tccatcctca 

acgtactcca 

ataactgccc 

caggcggagt 

atgggatcgt 

gtcctcacgt 

tacgatgtgg 

aagtggctgc 



tggaacatcc 
ccagtggtaa 
caggtggcga 
actcgtacct 
tcatcctggt 
gcctgcgcac 
gcacctgcgt 
ggatactgtg 
gcctgtgcgc 
agaagcgccg 
tgtcaatcac 
tcgtggactg 
tcttcattcc 
cacggcgaca 
atggggacaa 
cgaacaggaa 



ggatgccatg 
ctcgaactcc 
tgtgatggcc 
ggtgagcatg 
caccgtggtg 
cgtaaccaac 
gatgcctcca 
cgacatttgg 
catcagtctg 
atccaagcga 
atgtccaccc 
ccgctacaac 
cctcaccgtg 
gcgcattgtg 
cttcatctcg 
gtcccgctgg 



ctgccatact 
tccgatggta 
accggagccg 
gcgtggccca 
ggcaacacac 
tgtttcgtga 
tcggtggtgc 
atctcgctgg 
gacaggtatc 
ctggccctcc 
tatttgggct 
caaaacaagg 
atgctctacg 
cgcgatgcgt 
gagtcggagc 
aggttcaact 



ccaaattgtt 
ctttgctgct 
gagccggaaa 
agtcgctggc 
tcgtcatcct 
tgaacttggc 
tctacataac 
acatcctgct 
tcgccgtcac 
tcatgattct 
ggtacgaggt 
gctacgtggt 
tttacgtgaa 
actcggagcg 
actatcattg 
ccctgcacga 



gtcaacagcc 
catccagacc 
gttgggtgtg 
agtggccgtc 
cgccgttctg 
catcaccgac 
aggcacttgg 
ctgcagcggc 
acaaccactg 
cgtcgtgtgg 
cggtcggcac 
cttctcggcg 
gattggttat 
cacggcagac 
cacgccaacc 
cccaccaccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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agcgccgcca cgaatgcagc agcgaaggga caccagtcgt ccgtcaagtg ctccaagtgc 1020 

tccaagtgct cctctgccac tggaactgct ggagctggag ctggagctgg tgtggtagac 1080 

aaaacgctga cgttcaaaca tcagcccacc ttttacgagc tggtcgaggt gtcgcgtctc 1140 

tcgtcgctca ttcactgctc ggcaattagc tgcaagtatg gaggaccctg catgcactcc 1200 

acgccctcgg gcatgcacta cgttggcacg gaggcctcct tctcggacac ctgcctgggt 1260 

ggcaataaac tggaggcgga ggccaagcaa aagcaacagc agcaacgatc ctccttcgac 1320 

caccatcacc cgcatgcaca ccatcatggc tcccatggcc agcagagcca tcacaatcat 1380 

caccatcgca tgccgatgcg agtttcgacc acgaaacgtg atagcaagac cgccaagacc 1440 

ctgactattg tgatgggcgg cctaattgct tgctggctgc ccttcttcgt ctattatctg 1500 

ctgataccct tcctgccgcg acccgccgtc ctcgaggacc tcatgttcgg cttcacctgg 1560 

attggctggg tcaactgcgc catcaacccg ttcatctacg ccttctacaa tccggacttt 1620 

cgcactgcct tctggcggct cacctgtcgc cccatttgca agcagaagcg tccgcccaac 1680 

cacctggcca tgttccgtgg ctag 1704 



<210> 126 
<211> 567 
<212> PRT 
<213> Drosophila 



<400> 126 



Met 


Ala 


Ala 


Leu 


Glu 


His 


Pro 


Asp 


Ala 


Met 


Leu 


Pro 


Tyr 


Ser 


Lys 


Leu 


1 
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10 
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Leu 


Ser 
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Ala 


Leu 


Thr 


Val 
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Ser 


Gly 


Asn 


Ser 


Asn 


Ser 


Ser 


Asp 








20 










25 
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Gly 


Thr 


Leu 


Leu 


Leu 


He 


Gin 


Thr 


Ser 


Ala 


Ala 


Pro 


Gly 


Gly Asp 


Val 






35 










40 










45 








Met 


Ala 


Thr 


Gly 


Ala 


Gly 


Ala 


Gly 


Lys 


Leu 


Gly 


Val 


Pro 


Ser 


Met 


Asn 




50 










55 
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Ser 


Tyr 


Leu 


Val 


Ser 


Met 


Ala 


Trp 


Pro 


Lys 


Ser 


Leu 


Ala 


Val 


Ala 


Val 


65 










70 










75 










80 


Phe 


Leu 


Val 


He 


He 


Leu 


Val 


Thr 


Val 


Val 


Gly 


Asn 


Thr 


Leu 


Val 


He 










85 










90 










95 




Leu 


Ala 


Val 


Leu 


Thr 


Thr 


Arg 


Arg 


Leu 


Arg 


Thr 


Val 


Thr 


Asn 


Cys 


Phe 








100 










105 










110 






Val 


Met 


Asn 


Leu 


Ala 


He 


Thr 


Asp 


Trp 


Leu 


Val 


Gly 


Thr 


Cys 


Val 


Met 
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Pro 


Pro 


Ser 


Val 


Val 


Leu 


Tyr 


He 


Thr 


Gly 


Thr 


Trp 


Arg 


Phe 


Gly 


Trp 




130 
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Asp 


He 


Trp 


He 


Ser 


Leu 


Asp 


lie 


Leu 


Leu 


Cys 


Ser 
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Ser 
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Ser 


Leu 
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He 
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Leu 
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Leu 
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Val 
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Thr 


Tyr 


Ser 
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Arg 


Ser 


Lys 


Arg 


Leu 
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He 


Leu 
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Val 
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He 
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Leu 
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He 
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Cys 
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Pro 


Pro 


Tyr 


Leu 


Gly 


Trp 


Tyr 


Glu 


Val 


Gly 


Arg 
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Gin 


Ala 


Glu 


Phe 
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Arg 


Tyr 
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Gin 


Asn 


Lys 


Gly 


Tyr 


Val 


Val 
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Ser 
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225 
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Met 


Gly 


Ser 


Phe 


Phe 


He 


Pro 


Leu 


Thr 


Val 


Met 


Leu 


Tyr 


Val 


Tyr 


Val 










245 










250 










255 




Lys 


He 


Gly 


Tyr 


Val 


Leu 


Thr 


Ser 


Arg 


Arg 


Gin 


Arg 


He 


Val 


Arg 


Asp 








260 










265 
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Tyr 


Ser 


Glu 


Arg 


Thr 
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Asp 


Tyr 


Asp 


Val 
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Gly 


Asp 


Asn 


Phe 
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Lys 
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Leu 


Pro 
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Pro 


Pro 
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His 
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His 


His 


His 
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Lys 
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Lys 


Thr 
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Thr 
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Leu 


He 
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Phe 
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Tyr 
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Leu 
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He 


Pro 


Phe 


Leu 


Pro 
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Pro 
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Val 


Leu 


Glu 
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He 
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He 
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Pro 
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Phe 
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Trp 


Arg 


Leu 


Thr 


Cys 


Arg 


Pro 


He 


Cys 


Lys 


Gin 


Lys 


Arg 


Pro 


Pro 


Asn 
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565 



<210> 127 
<211> 3681 
<212> DNA 
<213> Drosophila 



<400> 127 

tccatgcatt 

gtgtcctggg 

taattatgca 

tcttggtcct 

tgaccccaga 

cgtaccaacc 

actgctggca 

cttttacaca 

ggtgccattc 

tgatatcagg 

aagtgtcaag 

agtacaatta 

accaatgttt 

cgggaccatt 

ataataaata 

aattgcatgc 

atgctcatgc 

tataattctt 

cttcgacacc 

gctaatcatt 



tataattcaa 
tccttaagtc 
aatttttaat 
tggatctcta 
ggccccaaag 
caaccatttt 
ttcggtcttt 
catatgtgtg 
agttgaagcc 
cacattaaat 
taaatgcaaa 
aaatataatt 
tttgaacata 
ttgtttccag 
tcaggggact 
cgataaaata 
ggttcagtcg 
gtattcgtaa 
gtggatatat 
cctgctcatc 



cattcgctcg 
ctttctacct 
gactatgaaa 
tacccctcat 
caatccattt 
ccatttaatc 
agatttatac 
tctgtaaatg 
ctttaggtcg 
gtagtaaagg 
ttaacattaa 
agctatatat 
acagatatct 
atttcaaatg 
tggccaaatt 
ataagtgata 
agatccaaag 
tttcaagcca 
ctcatatccc 
tcacgggtct 



cggccttatg 
cgaaagtgtt 
atggtttccg 
acccctctaa 
gcatggctac 
cctacgatgt 
agattttgtc 
actgaaaggg 
tgattcgcat 
aacgtaagta 
tttaagactt 
ttgtaacttt 
accatatgaa 
aaatgaaacg 
ttaatacaat 
ccgaaagaga 
tttgcgcaga 
atcggaggca 
aaagctaaat 
atatacattt 



aaaagctcat 
aatgagccgg 
cggagcaaca 
aaatgccagc 
caatagctcg 
gagagctctg 
aaacaaaaca 
agagcgtcga 
atggtcgagg 
tgataaaggc 
gaattcgtct 
taagaacgga 
gtgaaataaa 
catatttttg 
ctcatttaga 
gcttttgatt 
atgcttctga 
gttattcctg 
gactcctatg 
aggcaactgg 



ttggttacga 
caggctgccg 
agtgctcgcg 
taattccatt 
tcgcattcgt 
tactgagctt 
tgaattcgca 
aagccgattt 
ccatttaagc 
agcatcacaa 
gtactgaaac 
tacttctggg 
tagaattttt 
ccagtttttg 
aaaataaaac 
agagtgaaat 
atattttagc 
gatgtgatta 
cgtacgagga 
cggatggatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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acaggagcca 
attctgctgc 
gaatctgaag 
atattacctt 
tgtttcggta 
gtatctcagt 
ctggacattg 
atggatactg 
cctactatat 
tccatggttg 
ttgcttggaa 
gcaggaggag 
tattaaagtt 
gctttggctt 
ttaatcaatc 
acacctgcta 
gacaaactga 
tagggtagag 
actggtctgc 
tcacgtttat 
cccaacagca 
tatttctaat 
tgggcgtgac 
gacaagtcct 
tatttattct 
aatagatttt 
aacctgcatc 
ttttttttta 
tcttattatt 
ttttttttgt 
aagttccgta 
acgcgccagg 
aaatgcagat 
gccgccggca 
aaattcattc 
cagtctgccg 
gagtcaaagt 
gtggttgggt 
aagtctggga 
gtaatcataa 
aaagcggaaa 
tgtgtgaagg 



gtgaagagtc 
ccccgcaaca 
cggatcaatc 
ctcaccgaca 
caggaggatc 
ataatttgat 
agcaagcagt 
gaacatgtat 
catctaatct 
gttgtattct 
aatgtttcat 
atttgaactt 
tgacttaaaa 
ctcacttttg 
gggatgaacg 
tcatgacggc 
attggattcc 
tgtttaagcc 
tatgatatac 
cctgacagta 
gcgaaagtgt 
tccattctct 
tgggatttct 
tcgggtgcca 
taagcgcagc 
aaagttacta 
ttgatcaacc 
taagtttagt 
gaaaaattct 
tagaacaaaa 
ggaaaatatg 
accccaaatc 
ttttaagcct 
gacacacaga 
aatactttat 
aaaccttttt 
tgctgccgct 
ggcgtgtgct 
ggctggctct 
cgcagtcatc 
atgggggctc 
agctttcatt 



acttaagagc 
aaatgatgcc 
cctacttgaa 
tgatcgtctt 
agtacaagct 
atctatagaa 
ggtactgcct 
atataccgaa 
catctcatct 
cacgattgca 
ttgctatgta 
gtggaataat 
tatgcgttta 
gctctctgtt 
tagctatcac 
aattacatat 
cggtgttggc 
agtttaagac 
ctatatggac 
aatcacataa 
atacagaata 
tactatagtt 
gaggtaataa 
aattttttcc 
acatttcaaa 
tatcaatatc 
tactagttta 
atagagcagt 
ttattattga 
tgtcaattgt 
cacttttatg 
cttccacctt 
tgcggcagct 
tactcgaaca 
ctcacatgca 
cgcacttaag 
gcggggcacc 
tttctcctcg 
ctctcatctc 
agctcgtaat 
agggaccaag 
c 



atgtatctgc 
gaatagtcgc 
gatcacactg 
aagatatgag 
atatgaggtt 
tggcagcttt 
ccatccccgt 
gtcaatgcca 
aattctctga 
gtgtatctct 
ttttgtaagt 
atttgctccc 
ttctaaacag 
atgagtcatc 
tttctcatct 
ctggtggatt 
ttatatcgat 
cagtaattga 
cgatgctgat 
tgaagattaa 
atgagtaagc 
ttctattata 
cttacttcgt 
atttgggttc 
tgataatgga 
attttagaat 
aataaatgca 
ccctctatgg 
tatttaatca 
catcaaaaat 
gaaactcgct 
ccccacaatc 
tttgcgcaaa 
ttcggtggtc 
ttgcacttca 
tggttaaatt 
tcaagtcaca 
gaaaatgcgt 
atcacgttca 
attacggccg 
gaaaaggtat 



aaactaaaac 
tgtagcgatg 
gaggatggaa 
tttcggtatt 
aagatttcaa 
atgattaaac 
ttggaagatc 
gccgtacctc 
tttttccagt 
acataaagaa 
ttgtgcagta 
tagcaggtta 
gttacactaa 
aaatatggaa 
acaacatcta 
gggcctggga 
gctggataaa 
ttttctttga 
cttgagttta 
gagtagcgtt 
tttaagtatt 
cctacgactg 
caagcggcac 
cggtatagtc 
gaggtaagag 
cagcggtccg 
aattatagat 
tcaataaatg 
ataatatatt 
atgtgtttgt 
ttttcaatta 
ctcggctggt 
cagattgtaa 
agtggccaga 
cttttgagcc 
ggctgtgata 
tgtggtgatg 
gaaaagtcca 
ttgagtgcct 
cctgacatat 
gcgtactatt 



cttgtatccg 
gactcacaga 
ctataggaaa 
gcgagaaggt 
tctttttgaa 
ccgatgtaaa 
cgaattccat 
agggcaagca 
cggtactatt 
actgcgaaat 
ccttatctgg 
gttgaatcca 
ctatttcttt 
aaacttgagg 
cggttggggc 
ggatagacct 
cagtgggttt 
acagcctatg 
ttcaatgtgg 
aagagttcta 
tataacttac 
tcggtgatga 
aaattttggc 
gtatttgtgc 
ctttttgctt 
aggagacaac 
aagtgagctc 
cttataatct 
tttatttttt 
gaataaaatc 
gtccttgcat 
gtcagaatgc 
acaaataaat 
ttgcagcctg 
acgttaagtt 
aattgttgcc 
gtgcgagtgg 
caagcacaaa 
gtcgctgaca 
gacatctggg 
aatatgccag 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3681 



<210> 128 

<211> 1503 

<212> DNA 

<213> Drosophila 



<400> 128 

atgcttctga 

gttattcctg 

gactcctatg 

aggcaactgg 

aaactaaaac 

tgtagcgatg 

gaggatggaa 

tttcggtatt 

aagatttcaa 

atgattaaac 

ttggaagatc 



atattttagc 
gatgtgatta 
cgtacgagga 
cggatggatc 
cttgtatccg 
gactcacaga 
ctataggaaa 
gcgagaaggt 
tctttttgaa 
ccgatgtaaa 
cgaattccat 



tataattctt 
cttcgacacc 
gctaatcatt 
acaggagcca 
attctgctgc 
gaatctgaag 
atattacctt 
tgtttcggta 
gtatctcagt 
ctggacattg 
atggatactg 



gtatt 
gtgga 
cctgc 
gtgaa 
ccccg 
cggat 
ctcac 
cagga 
ataat 
agcaa 
gaaca 



cgtaa 
tatat 
tcatc 
gagtc 
caaca 
caatc 
cgaca 
ggatc 
ttgat 
gcagt 
ttcca 



tttcaagcca 
ctcatatccc 
tcacgggtct 
acttaagagc 
aaatgatgcc 
cctacttgaa 
tgatcgtctt 
agtacaagct 
atctatagaa 
ggtactgcct 
tggttggttg 



atcggaggca 
aaagctaaat 
atatacattt 
atgtatctgc 
gaatagtcgc 
gatcacactg 
aagatatgag 
atatgaggtt 
tggcagcttt 
ccatccccgt 
tattctcacg 
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attgcagtgt atctctacat aaagaaactg cgaaatttgc ttggaaaatg tttcatttgc 720 
tatgtatttt gtaagtttgt gcagtacctt atctgggcag gaggagattt gaacttgtgg 780 
aataatattt gctccctagc aggttagttg aatccatatt aaagtttgac ttaaaatatg 840 
cgtttattct aaacaggtta cactaactat ttctttgctt tggcttctca cttttggctc 900 
tctgttatga gtcatcaaat atggaaaaac ttgaggttaa tcaatcggga tgaacgtagc 960 
tatcactttc tcatctacaa catctacggt tggggcacac ctgctatcat gacggcaatt 1020 
acatatctgg tggattgggc ctgggaggat agacctgaca aactgaattg gattcccggt 1080 
gttggcttat atcgatgctg gatatggtct gctatgatat acctatatgg accgatgctg 1140 
atcttgagtt tattcaatgt ggtcacgttt atcctgacag taaatcacat aatgaagatt 1200 
aagagtagcg ttaagagttc tacccaacag cagcgaaagt gtatacagaa taatgagtaa 1260 
gctttaagta tttataactt actatttcta attccattct cttactatag ttttctatta 1320 
tacctacgac tgtcggtgat gatgggcgtg actgggattt ctgaggtaat aacttacttc 1380 
gtcaagcggc acaaattttg gcgacaagtc cttcgggtgc caaatttttt ccatttgggt 1440 
tccggtatag tcgtatttgt gctatttatt cttaagcgca gcacatttca aatgataatg 1500 
gag 1503 



<210> 129 
<211> 172 
<212> PRT 
<213> Drosophila 



<400> 129 



Met 


Leu 


Leu 


Asn 


He 


Leu 


Ala 


He 


He 


Leu 


Val 


Phe 


Val 


He 


Ser 


Ser 


1 








5 










10 










15 




Gin 


Ser 


Glu 


Ala 


Val 


He 


Pro 


Gly 


Cys 


Asp 


Tyr 


Phe 


Asp 


Thr 


Val 


Asp 








20 










25 










30 






He 


Ser 


His 


He 


Pro 


Lys 


Leu 


Asn 


Asp 


Ser 


Tyr 


Ala 


Tyr 


Glu 


Glu 


Leu 






35 










40 










45 








He 


He 


Pro 


Ala 


His 


Leu 


Thr 


Gly 


Leu 


Tyr 


Thr 


Phe 


Arg 


Gin 


Leu 


Ala 




50 










55 










60 










Asp 


Gly 


Ser 


Gin 


Glu 


Pro 


Val 


Lys 


Ser 


His 


Leu 


Arg 


Ala 


Cys 


He 


Cys 


65 










70 










75 










80 


Lys 


Leu 


Lys 


Pro 


Cys 


He 


Arg 


Phe 


Cys 


Cys 


Pro 


Arg 


Asn 


Lys 


Met 


Met 










85 










90 










95 




Pro 


Asn 


Ser 


Arg 


Cys 


Ser 


Asp 


Gly 


Leu 


Thr 


Glu 


Asn 


Leu 


Lys 


Arg 


He 








100 










105 










110 






Asn 


Pro 


Tyr 


Leu 


Lys 


He 


Thr 


Leu 


Glu 


Asp 


Gly 


Thr 


He 


Gly 


Lys 


Tyr 






115 










120 










125 








Tyr 


Leu 


Leu 


Thr 


Asp 


Met 


He 


Val 


Leu 


Arg 


Tyr 


Glu 


Phe 


Arg 


Tyr 


Cys 




130 










135 










140 










Glu 


Lys 


Val 


Val 


Ser 


Val 


Gin 


Glu 


Asp 


Gin 


Tyr 


Lys 


Leu 


Tyr 


Glu 


Val 


145 










150 










155 










160 


Lys 


He 


Ser 


He 


Phe 


Leu 


Lys 


Tyr 


Leu 


Ser 


He 


He 











165 170 



<210> 130 
<211> 6552 
<212> DNA 
<213> Drosophila 



<400> 130 

agtatatacg 

cctctaagcg 

tcatttgatt 

aattctgtcg 

catgtggatg 

gagggcgccg 

ctaactctag 

atttctttac 



agtatatcct 
ggactttcat 
cattgtgtct 
ccgatggttg 
ggcggtcagc 
acctcgaggc 
gtaccgacac 
actcgcttac 



gtacgactat 
cgaggtcggt 
gttcgtgtgt 
tggttttgtg 
caactaactt 
ccgctagcta 
ctcgtgtcgg 
ctaatcgtac 



atatcaaacc 
gcccctcatt 
gttggcgtgt 
tccatggccc 
ggaacattcg 
ctgtatccaa 
cgtggaacaa 
tgtgggaatc 



gaaagtcgaa 
atcgtattca 
tactctgtta 
atcaatgggg 
tatgtgatgt 
ttgaactgtg 
tccggcctct 
ggaaggaagg 



ctgaaagtgt 60 
tttgtggggc 120 
agttggatgc 180 
atgctccatc 240 
gtctgtggtg 300 
atttagctaa 360 
aaggatatga 420 
ttctgtgtgc 480 
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cgtgggtttt 
gggagcagcc 
gttttgctta 
ctacttgggg 
caccgttcat 
tatttacata 
gaaacgaaac 
tttgccatcc 
ttgctataac 
tcttgaaggc 
tgttgatgta 
tcttgaactt 
acaggaagac 
tatggcagag 
acccaaaaca 
gcgtttcttt 
gataataggc 
aagagaggta 
agcccaaata 
tgcgagttct 
ttgcgtgcca 
gcgacgctgc 
actactagtc 
ttttgggggt 
tgatgaaaac 
ccaccaaata 
attatgaaaa 
attggatttc 
ttgacaacaa 
tggaagcaaa 
gaaaaaagcc 
gtgtgcaatt 
cattgcgcaa 
gtacaaaaac 
ttttaaactg 
aatgatttag 
aaatattagc 
aacttggatg 
tggggatgct 
tcgacaccga 
cgccgctgac 
ggcggcgagc 
acgcctgcaa 
ggatcggcga 
cgttatggca 
tatccatcaa 
tccggcgttg 
ggcaagattg 
tgccattggc 
tcagcccatc 
cctcctgtca 
ctaattggct 
gattcgaatg 
cggcaggacc 
ccggacgaag 
attgctttgc 
ggcttatcta 
ttgcaagttg 
tttgcagggc 
tctgcgaaat 



ggttttgggt 
cactttgaag 
cgtctacttt 
tttgcttaat 
gctccacata 
agtagctagt 
tggagcccaa 
tctcccgggc 
tattgctggt 
cttgcgaaac 
gcccagccag 
ggcgcacatg 
acctgtttga 
cgcaaggacg 
atggccaatg 
ttcgaggcct 
catggtgtgg 
gagaaggcga 
actaataact 
tcttatcctt 
aagtcatggc 
gaacggaagt 
actatttttg 
ttcgaggata 
gtgagctctt 
atcaattgat 
gtagtggcaa 
aatctgtttg 
taacccgaca 
gatgttgaca 
atgcgtgtta 
taagcaattt 
aaagagcact 
ttgaatggga 
ttgcgaaaag 
taccaatgtg 
aatttctcga 
aggcattatg 
atttctaagg 
ctgacctgag 
tgctgacgct 
cgctgctccg 
ggacatcgtc 
cgaccagctg 
atgtggagcg 
tcaatcaatc 
atgaaggacg 
agttttaata 
tgtgatttct 
ggttcggttc 
ttaatggttt 
gcgcttaacg 
ccgtggggcg 
tgattgaaag 
cctgccatta 
tggtctgctg 
ttacgtgttt 
caagcggaca 
catcggagat 
tcgccgaaag 



gggaggaaaa 
gacactactc 
ggaatcataa 
gtaattgtta 
gcatatatgt 
agaaccttaa 
ttgagatctc 
tttaatccat 
gacgtgctcg 
tcgggattga 
gtggtcatca 
gcgtccatga 
gggtttgaag 
tggtccaaaa 
ttttggtggc 
tgtgcacctt 
tatatattgc 
gaaggaaatg 
gtaattaggg 
gcgcgaggac 
ctcctgcact 
cggtcggatt 
ggcttgccgg 
aagtggcagg 
ttttgctaag 
gatttcataa 
gtggcagaac 
aaaaatgtaa 
tatttaatta 
tttttgacgc 
tacgcattta 
taatatgcta 
gcctatagct 
atgctttatt 
tttcatttta 
gtacagatat 
atttccgata 
tttttatccg 
tggcaggctt 
ggcgggttcg 
ggatcgctgg 
cggagcggct 
aacgtttgag 
cggctgtgtg 
atgtgacggt 
aatcggcgat 
aaaagatgga 
aagtgacgca 
tttggcgggt 
ggttcggttc 
catttcatgt 
ctcgcactta 
gggacatccg 
tgcagcacgc 
gattgacagt 
gtctggaagg 
ggggcacgtg 
aggatatgga 
agcccccagc 
tttccgcttt 



gcttgctgct 
tattctggta 
tcgcgtgctt 
aaaattgtac 
atgtatgtag 
agctattggg 
tgctgcaacg 
tccacaaggg 
ccgttgataa 
atattgtgta 
tgtaggcggt 
tgttgcagct 
gttatatata 
cggctggggg 
cttgcgctcc 
gtatatcgtg 
attttccaca 
gggtggcatt 
gcaatggacg 
gttttcgatg 
ccgtaccgca 
cggaatgaga 
attgcctaca 
tttctcgcta 
caacttaaat 
gcctgctgat 
taatactgca 
tgtgttaaaa 
aactttgcca 
ggtcaaactc 
ttgttttgtt 
cgtaatctag 
gaaaaaagta 
ttgtcgggct 
cctatggtac 
tggacggtat 
taattgccgg 
tgcacgaact 
acggatctga 
ccgcgcttcg 
agggtggcgc 
gatgtggcag 
gctgcataac 
gtgatttggg 
ggggttatcc 
agacgacgag 
aaatcggggt 
ttaattgaat 
aaagtttcgc 
ggatcggatc 
ccttattgga 
atggcggttt 
attgcaagga 
ttttatgtcg 
tttccatggg 
aagtgcagtc 
ccagggctgt 
cacggaaacg 
caaacattcg 
tcataccccg 



cgtgatatgt 
aagcgaatcg 
gctgtgtgtt 
ctagcctaca 
tttagtggta 
aaacgcaacg 
atggcctcca 
acccatcatc 
tcaccccatg 
gataaccggg 
gagcccaggt 
gaagaacggc 
tgatggcata 
aggtggctca 
tttttggccg 
aatctgtgcg 
ggcaaaaaag 
agcatcgact 
gcgttactca 
tagtcgatag 
caaaggacaa 
acttaactgc 
ttgccaacat 
aacactacgt 
caagcgattg 
atgtaaacat 
aatcactatt 
aaaaaaaaaa 
tttgtttacc 
tttgtctgtt 
tataaataat 
tgtaagtaga 
ttatattatt 
gcaaaaacag 
aatcaaaaag 
tttgtttttt 
aactttgatt 
tctactacgc 
atgccaaata 
gcgagtcgtc 
cactgggcag 
cagaggcgga 
cagaatcatg 
ccacaacgct 
aattaatggt 
tgggcgaaga 
gcaacgtcaa 
tattgagaat 
ggtcttgttt 
ccatcccttt 
ttaagcaggc 
cggccaggaa 
tgatgggcaa 
gcaattgagt 
catgcaccgt 
gagaccactc 
cagggttgtc 
aaactgctga 
ttctccaggc 
ttcacgtaag 



gataggttgt 
tatccttggc 
gattatatcg 
gcttgcaatc 
tgtacagatc 
aacgacaaac 
tttggcacac 
caattgccta 
tgcatgatct 
ttcacaaagc 
cggcagtctt 
agccagcaga 
tggattagaa 
acttgaactt 
acgatttttc 
tcacgcgaag 
gcaaatgatg 
ggccaacaag 
ccttgacgcc 
cgtcgagtcg 
tggcttcatg 
ggcattacgt 
gcggcttcta 
aataggtctg 
tgcaagaatg 
gccgacaata 
taaatgtgca 
catgtaatag 
aattaatatt 
tttattgaac 
gacaattgat 
taaaaaaaaa 
caaaaactat 
ttcattaaac 
aatgttgttc 
ttttttttct 
ttaaaacaat 
ccccaactac 
cctggagtta 
gccggtgggg 
cggactgtcc 
ggcgggcgcc 
attaccattc 
gctgcgagtg 
gccgcgacca 
agcagcagaa 
ttaaaggcaa 
tatttttctc 
gccatccatt 
gtgcatccag 
catcaggatg 
ttgggagaaa 
gggatccttt 
gccgctgtgt 
ctgggggtcc 
agcagaaaca 
aggttgcaag 
gctccgaact 
gatggagttt 
atgtaagtgg 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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gtttgcaaat 
agtaacatta 
tatttccttt 
acagcacatc 
gacaacggtg 
gatgtccggc 
ggcggcctcg 
caggcgccgg 
gggcttgcgg 
tacgagatgt 
gacgacggac 
gcagtagaga 
cagtcagcag 
caagccagca 
aaatccgatt 
aagagcgaca 
gcttggagtg 
tcgctactca 
aagctgctca 
tcgcggttgg 
gcccacacca 
attggctgtg 
tatgggtctc 
ttgaatatcg 
tcaggcaggg 
caggtaagaa 
acttaatcga 
acaatatttt 
tcgacgtctc 
ccaacaaaaa 
aaaaggccga 
cccaacctgt 
caaccgattc 
tagaggtagc 
tcgaatacac 
atatttgaat 
taactaaaat 
cttgtttttc 
gcggggctca 
aaagtaaaag 
ggcagtcgca 
cgagcataac 



gtacatatat 
gccgccaaaa 
ttgtatattc 
aaagccccta 
gccagcatga 
gagatggcgt 
tcgggcagaa 
gtctgggcaa 
gccgcccgct 
aaaatacaga 
cacggactac 
gcggattagg 
caaatgtgca 
ttgcagatta 
ttaattgcgg 
gaaagaccaa 
ttcaaaggaa 
ccttgaaaat 
gcgaggagta 
gcgtgttgtt 
gcagtatcgt 
tcacagcgat 
cgaaataggg 
atgaagtaga 
cccaggctcc 
cgaggatgtg 
ctcgggaatg 
cccatttccc 
cggcagcatc 
cgtcaggaat 
gtgggccacc 
cgtggttgca 
ggtattagaa 
tgtcacatag 
tcggcggagc 
aatactacta 
gcatgtagac 
ttcttcaatc 
acgcgtggga 
tgaaagtgaa 
gacttggaat 
aa 



gtccaatccg 
gaaggtctta 
caaatctgat 
aatggtatcg 
actcggtgga 
tctcgtcctc 
tcctactggc 
tgttctcgat 
gcttccgcgc 
aaacagtata 
gggctgtggg 
cactgccatg 
caaacagaaa 
tggatctgcg 
tcttttaata 
agcgcggaaa 
ttaacatgtt 
gttccagtag 
gagtataaag 
gaggcccaga 
aaggcaaacg 
gtatctgtgg 
gccgctgtga 
gcatatcaca 
atttacctgc 
tatgtcgatg 
gagctctatt 
gtgctgctgc 
tatgttatgg 
ttcccaagcc 
agttcggttg 
ctccccatag 
tggcatcgat 
gtatttgttg 
cagtttccca 
ctacatatat 
ggtatcgaat 
ggttttgcgc 
agagtgtaaa 
agtcaagtga 
ttcttagaac 



actccattgg 
cacagccaaa 
ttaaagctat 
taaaatgagt 
cttgtcgttc 
ctcctcgttg 
gtggcgactg 
gacgctgccg 
tcgactcctc 
ccaaacatgg 
acacgggatg 
ctgcgcttgc 
cagaaacagg 
ccagcaatat 
atagtgcttg 
attcgctcta 
tggaatatat 
agaaacacca 
tcggcgttgt 
actatcggcg 
cggcggctat 
gatatatccg 
actcacctgt 
tccatggcta 
aaaacaaaaa 
cggatgagga 
ggaatgccca 
cacaaattag 
cccaacccga 
aaacaatttg 
gaacttaatg 
cccgtgtacg 
agctttttgg 
actgatctgc 
gaatcagtag 
tgtacttata 
agtcgaagtt 
ctaaattata 
atattgatac 
aagcagcata 
tcggatgaag 



caacgtgttt 
aaaaaacttt 
catgttctac 
actcacttgc 
acgatgacgt 
gagccgctgg 
ctgtccaggg 
cccgtgagtt 
agcgcctgtg 
gacacaggat 
caggatgcag 
atcctacgga 
aagagaaaca 
tcattccact 
aaattttttt 
ggaattaatg 
cgtatatata 
tgatgatgca 
agaaggcgca 
agccgatggc 
ttttgtgctt 
tggaatgtgt 
ctatggagat 
tatagaaatc 
gaaggagggg 
gcacacttca 
tcctttgtcg 
ttttagactc 
atgcggactc 
tagcacacaa 
tggatatgtt 
aggtatacat 
atgtgaatac 
gggtaagagg 
tttaacattt 
tgccaccaat 
aatcgaccct 
caaacacatt 
acacgcaacc 
gaaatgtgct 
actccttagc 



atattcgtga 
gtgaaacgta 
gcatttctac 
ccgtctcctc 
ggcagtcgtc 
ccgtgttggt 
ccgtctccgc 
ccgataggtg 
ggtgggtggg 
acagactacg 
gatgcaggat 
tattcgcccc 
gaagcatccc 
cgactcggag 
gcgcaatgga 
tttgtgggaa 
catatatcgc 
gggtatgtag 
tacatcgggt 
agccgatatg 
ggcgtacttt 
gcttgggtca 
ggcaactaag 
acaaacgacg 
gggggggcgt 
tttgttcggc 
aatattaata 
tatctcttca 
cataaaaacc 
agcacataaa 
tgggtgaaca 
aatgtagcgc 
tattagtaag 
tatcgcatag 
taagaagtca 
gtgtgtgtgc 
aacgtactat 
gaaacttgaa 
acaccacacg 
aaaatagctg 
cggagctaag 



4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4 620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6552 



<210> 131 
<211> 1392 
<212> DNA 
<213> Drosophila 



<400> 131 

ttgcaggtaa 

gatgtgatat 

atcgctgtga 

acgatactgc 

aacaacacgc 

tactcctcgc 

attttcaagg 

tcggaactca 

acggccctgg 



atggagcctg 
gctctacttc 
cacagccaat 
tggtgtgggc 
ccaaccgcga 
tgagcagctt 
cgctgaggag 
cgggcggcag 
acagcagtcg 



ggccctgcct 
atcgatattc 
aaagtacgcc 
catatcggct 
acccgatgta 
ctacataccc 
tcgagcgcgg 
cgtcatcgag 
ccacgccagt 



gacgtcgttt 
aacttagttg 
aagcacaaaa 
gccatcggct 
tgcgccttct 
tgcatcatca 
aagcagcggg 
aacattgccc 
aggattctgc 



gtgatttcta 
ccatctccat 
atagccgccg 
cgccgatagt 
acaacgccga 
tggtgtttct 
cggcccgcaa 
agacccggcg 
ccgacgaggc 



tatagccatg 60 
agacagatac 120 
cgtttgcctt 180 
tctgggcctc 240 
ctttatactc 300 
ctactggaac 360 
gccccaccta 420 
cctggcggag 480 
cgccaccaac 540 
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acggccagcg 
tgccacgtca 
acgggcaaca 
aatggtaatc 
gcgcccgcct 
gacagtccgc 
cccaccggcg 
ttgtcctttg 
ctatcgacta 
ttcacgatat 
aaggagcgca 
ctgccgttct 
cgacctgggc 
aacccggtta 
cacatggggt 



gctccaacga 
tcgtgaacga 
gcgttgtggc 
atgattctgg 
ccgcctctgc 
tgcccagtgg 
acgactcgcc 
tgcggtacgg 
catcgaaaac 
acaaggtgca 
aggccaccaa 
tcagctgcaa 
tcaccgccta 
tctacacaat 
ga 



ggaggaggac 
caagtccacc 
ccaaatcacc 
ttatgcagcc 
tgccacatca 
cgccaccctc 
gaagcgcggc 
agtgcaggag 
gtcctcgcgc 
caaggcctcg 
aacattggcc 
catcatggac 
catgatgacc 
attcaatccc 



gagaacgcca 
gagttcatgc 
acacagccgc 
tcaaacgttg 
gccgctccgc 
cagcgatcca 
gaacccgccc 
gccatgactt 
aaggataaga 
aaaaagaaac 
at.tgttttgg 
gccatgtgcg 
acctggctgg 
gagtttcgca 



tctcgccgga 
tggccaccgt 
agctggtcgt 
acgatgtcct 
ggagcagcgg 
gcgtcagcag 
tcagcgtcgc 
tggcacgcaa 
agaactcgca 
gcgaaaaatc 
gtgtcttcct 
ccaagttcaa 
gctacatcaa 
aggccttcaa 



catcgacgac 
tgtcgaggag 
cgccgatccg 
tgcaggcgtg 
ctcgccgccg 
tcagcggcgc 
catgaagcca 
cgactcgacg 
ggcgtcaaga 
gtcggccaaa 
gttctgctgg 
gaaagactgc 
cagctttgtg 
gaagatcatg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1392 



<210> 132 
<211> 444 
<212> PRT 
<213> Drosophila 



<400> 132 

Met Asp Val He Cys 

1 5 
Ser He Asp Arg Tyr 
20 

His Lys Asn Ser Arg 
35 

He Ser Ala Ala He 
50 

Pro Asn Arg Glu Pro 
65 

Leu Tyr Ser Ser Leu 
85 

Phe Leu Tyr Trp Asn 
100 

Gin Arg Ala Ala Arg 
115 

Val He Glu Asn He 
130 

Asp Ser Ser Arg His 
145 

Asn Thr Ala Ser Gly 
165 

Pro Asp He Asp Asp 
180 

Phe Met Leu Ala Thr 
195 

Gin He Thr Thr Gin 
210 

His Asp Ser Gly Tyr 
225 

Val Ala Pro Ala Ser 
245 

Ser Gly Ser Pro Pro 
260 

Arg Ser Ser Val Ser 
275 

Lys Arg Gly Glu Pro 
290 



Ser 


Thr 


Ser 


Ser 


He 
10 


Phe 


Asn 


Leu 


Val 


Ala 
15 


He 


He 


Ala 


Val 


Thr 
25 


Gin 


Pro 


He 


Lys 


Tyr 
30 


Ala 


Lys 


Arg 


Val 


Cys 
40 


Leu 


Thr 


He 


Leu 


Leu 
45 


Val 


Trp 


Ala 


Gly 


Ser 
55 


Pro 


He 


Val 


Leu 


Gly 
60 


Leu 


Asn 


Asn 


Thr 


Asp 


Val 


Cys 


Ala 


Phe 


Tyr 


Asn 


Ala 


Asp 


Phe 


He 


70 










75 










80 


Ser 


Ser 


Phe 


Tyr 


He 
90 


Pro 


Cys 


He 


He 


Met 
95 


Val 


He 


Phe 


Lys 


Ala 

105 


Leu 


Arg 


Ser 


Arg 


Ala 
110 


Arg 


Lys 


Lys 


Pro 


His 
120 


Leu 


Ser 


Glu 


Leu 


Thr 
125 


Gly 


Gly 


Ser 


Ala 


Gin 
135 


Thr 


Arg 


Arg 


Leu 


Ala 
140 


Glu 


Thr 


Ala 


Leu 


Ala 


Ser 


Arg 


He 


Leu 


Pro 


Asp 


Glu 


Ala 


Ala 


Thr 


150 










155 










160 


Ser 


Asn 


Glu 


Glu 


Glu 

170 


Asp 


Glu 


Asn 


Ala 


He 
175 


Ser 


Cys 


His 


Val 


He 
185 


Val 


Asn 


Asp 


Lys 


Ser 
190 


Thr 


Glu 


Val 


Val 


Glu 
200 


Glu 


Thr 


Gly 


Asn 


Ser 
205 


Val 


Val 


Ala 


Pro 


Gin 
215 


Leu 


Val 


Val 


Ala 


Asp 
220 


Pro 


Asn 


Gly 


Asn 


Ala 


Ala 


Ser 


Asn 


Val 


Asp 


Asp 


Val 


Leu 


Ala 


Gly 


230 










235 










240 


Ala 


Ser 


Ala 


Ala 


Thr 
250 


Ser 


Ala 


Ala 


Pro 


Arg 
255 


Ser 


Asp 


Ser 


Pro 


Leu 
265 


Pro 


Ser 


Gly 


Ala 


Thr 
270 


Leu 


Gin 


Ser 


Gin 


Arg 
280 


Arg 


Pro 


Thr 


Gly 


Asp 
285 


Asp 


Ser 


Pro 


Ala 


Leu 
295 


Ser 


Val 


Ala 


Met 


Lys 
300 


Pro 


Leu 


Ser 


Phe 
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Val 


Arg 


Tyr 


biy 


val 


bin 


blU 


Mia 


Met 


i nr 


Leu 


Mi. a 


Arg 


Asn 


Asp 




305 










Tin 




















~k ?n 

J £- \J 


Thr 


Leu 


[-> 

ber 


i nr 


Thr 


C V 

ber 


Lys 


l nr 


Ser 


Ser 


Arg 


Lys 


Asp 


Lys 


Lys 


Q. C TT1 

noil 










o o c 




















JJJ 




Ser 


Gin 


Ala 


Ser 


Arg 


Phe 


Thr 


He 


Tyr 


Lys 


val 


His 


Lys 


Ala 


C n v 

ber 


Lys 








34 0 










"3 A c. 
i<i O 
















Lys 


Lys 


Arg 


Glu 


Lys 


Ser 


Ser 


Ala 


Lys 


Lys 


Glu 


Arg 


Lys 


Ala 


Thr 


Lys 






355 










360 










o r c 
36b 








Thr 


Leu 


Ala 


He 


Val 


Leu 


Gly 


Val 


Phe 


Leu 


Phe 


Cys 


Trp 


Leu 


Pro 


rhe 




J / 0 










J / D 










ion 










Phe 


Ser 


Cys 


Asn 


He 


Met 


Asp 


Ala 


Met 


Cys 


Ala 


Lys 


Phe 


Lys 


Lys 


Asp 


385 










390 










395 










400 


Cys 


Arg 


Pro 


Gly 


Leu 


Thr 


Ala 


Tyr 


Met 


Met 


Thr 


Thr 


Trp 


Leu 


Gly 


Tyr 








405 










410 










415 




He 


Asn 


Ser 


Phe 


Val 


Asn 


Pro 


Val 


He 


Tyr 


Thr 


He 


Phe 


Asn 


Pro 


Glu 








420 










425 










430 






Phe 


Arg 


Lys 


Ala 


Phe 


Lys 


Lys 


He 


Met 


His 


Met 


Gly 











435 440 



<210> 133 
<211> 2970 
<212> DNA 
<213> Drosophila 



<400> 133 

tccatgccag 

gagagttcgg 

agcaatggtc 

cgcatcatgc 

ggcaatggct 

ctgattagct 

agacggtccc 

ttcacagact 

agcgatccac 

caggaaacac 

gagtggcggc 

tatgtcattg 

gacatcgtgc 

gagaaggtgc 

gaggtaagac 

tgtttaagaa 

ttctaccaag 

tgatctcctg 

cctcgcactg 

agttcttcca 

tcgtcttgta 

agaagtacgg 

tcttccagtc 

ccaccgactg 

cgaggaacat 

caggcgcgag 

ctggcagcag 

ctcgacgggt 

atttatcgcg 

taccgatcca 

aggaagaagc 

accagatgat 

aacagattgg 

atattgccgc 



gcctggatgc 
tttgcgtttt 
agcgctacac 
agttcctgtc 
gctatcgtgc 
gggtggacgg 
aagttgctgg 
tgttctacaa 
gacgccaatg 
ccaatgatct 
aatcggtgcg 
gcctgggcga 
atatcgacta 
ccttccgtct 
tattgctcat 
acggtgctcc 
ttaaagcgtt 
agatgaggat 
cctcgccgtt 
cgagtagata 
gatcatcgag 
tagccaacag 
tggatggagc 
tcgcaatcgc 
acagatccag 
aacttcgtcg 
agccaccaat 
catgtatcta 
ttcgcttaat 
ccgcaatcga 
gcagcaggca 
agggcagtgc 
agatgaccga 
ccaggataag 



acatgggccg 
gcccaccaag 
tttcctgttc 
catctcgaat 
ccaccactat 
ggtcacacca 
gaatgcggga 
caaactctcg 
gccaatatcc 
gcttgcccgc 
tcgctttgtc 
tcgtcatttg 
caacgtgtgc 
tacccagaat 
cctataccaa 
taaagataat 
gggtttccca 
gtggcctcca 
cggcagcata 
atcggattga 
ctgctctggc 
agcgtgaagc 
agcagatttc 
agcgcctgct 
atgatcacaa 
aattcgcacg 
gcacccaagc 
cggtttcaaa 
accttacctt 
gatcagagtc 
gggacctctc 
cagtcccaca 
tcgcaggtgg 
gacaaacaag 



gatggggatc 
acgaaaccca 
aagggcatgg 
gccattatgg 
tcggttatac 
gtctttgcgc 
gcaggtgctg 
cccctgttgg 
gtcctgctcc 
gaactgtggt 
cgctgcatgt 
gacaacgtgc 
ttcgagaagg 
ctagtgcagg 
tcaattatct 
gtccactttt 
attccgttga 
tgcggtgctt 
atgtttggac 
gagcagagtt 
agatgctaaa 
agcccatgat 
gcatggtggt 
tggatgcctc 
aacccggcgt 
cctgtgcatc 
accaattgaa 
gggtttcaca 
ctatagtgca 
aacatggaca 
atggcgccaa 
cagaagtcgg 
cggcaggtgc 
aaggcattga 



aggtatatat 
aaaaggtcgc 
aggatctcca 
cctgccgcag 
cattgggacc 
tctacaagaa 
tggcaaatgt 
ccaagcacaa 
aggtgctcga 
gccaggcggg 
cggtcatgtc 
ttatcaacct 
gccgcaccct 
ccatgggcat 
caacaaatcg 
attgctatat 
tcactgagtg 
gctgtccgcc 
aaaggctcgc 
ggcaatggcc 
tagctgggca 
gaagaccaca 
ggcctcgtcc 
gcgcatgatg 
tatcactccg 
cggccatcga 
tatgatgatg 
tatgtagcca 
gagcatagac 
cgcagcaggc 
tgtccgagcc 
agacggccag 
gcaccgccac 
tggctagcca 



cgagagcgtc 
cttttacggc 
tctggacgaa 
cgacgcaccc 
gcaatccgga 
gtggcaacag 
cccgcgacgt 
tatgcaggtg 
cgaattgtca 
aaatgccgcc 
gatgattggc 
gggaagcggc 
gcgcatcccc 
cacaggaatc 
ctgctggggg 
ctccatttaa 
gccttcggtt 
ggcaattgat 
cgaaagccag 
agagagaacg 
atggccacac 
atctggacgc 
gtgttgtaga 
cgccagtaga 
gcaatgtaac 
ttccagaaca 
ctatatgcca 
ttgaattagg 
gactgctata 
gcagatgagc 
catgtaaaat 
cgacaggatg 
aatggtcaga 
cagcagatgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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ccggcaccga 
tgctgccaca 
ggcgatgagc 
agtggttggc 
cggctcggct 
ttgtttacgt 
gcccagatcc 
cattcaccac 
ccagccaaac 
atggccaaac 
attgataaaa 
tcccaaattt 
taattgttat 
ccactcttgg 
tctctgatct 
tcagaacaga 



aactggtcgc 
ggaacgggac 
tgctggtaaa 
tatctggcaa 
cggatcggat 
ttcgctggtg 
accgtgagaa 
cctcatacct 
tatccattgc 
agatcgctgg 
ggccagggcc 
cctcgataca 
aaatccgaga 
ttcttttaga 
aatcccatat 
ccaggagttt 



ctggagggtt 
gacacccaac 
tgccatgagc 
ccgtgggagc 
tcagattcaa 
gctttacgac 
ctgcaatccg 
aaccactcac 
ccccgagtat 
gggttttagc 
agcaactggg 
ttgcaagcga 
ccaacagtac 
attagaaatc 
ttgaattatc 
gctaaggatg 



atttccgttg 
gcttccgggg 
cacgatggta 
tggagttgtg 
ttgggtttcg 
atcgtcatcg 
ttgataagcc 
ccacccatcg 
ttcgagtgtc 
agattagccc 
atacgtggta 
agaaactata 
aaccagtact 
aggagaatat 
tttaagatga 



gcgtttccgt 
agcttcgata 
aacgaagcgc 
ctgacttttg 
gctcggaccg 
cgccagttaa 
ctccactttg 
agtcaatcca 
cggactcccg 
agctccgcgg 
aacagataca 
tgtaaatagt 
attaggattg 
atagtatata 
gatattaaag 



ggcgttctgg 
atcccaagcg 
acgcgacgcc 
gccaggactt 
tgggcgtcgt 
ctgatagcgc 
gcccaaccac 
accactccaa 
ggcccaatgc 
accaaaatag 
tcattgatat 
gatatcatca 
acatggttat 
taggtacaac 
atttcatgga 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2970 



<210> 134 
<211> 1116 
<212> DNA 
<213> Drosophila 



<400> 134 

atggcattta 

ggtgtcgtcc 

ctccaggcga 

tttgtcctta 

ctgcgatcgg 

ggactggcac 

ggtccctgcc 

ctgatctcga 

tacatgaccc 

gtggctctgc 

gaagttctcg 

atctgtatgt 

ttgcgacagt 

catccagact 

tggctaccgt 

atgatctaca 

tactcgtgga 

gcgaggcagt 

tcatctcagg 



ccagcagctc 
cgttcctgtg 
ccagtttcgg 
tcctgggcgg 
tcatctccaa 
tgccctatca 
tgctgcgctt 
ttgcggtgga 
gtcgagtggc 
tgccagtgtt 
cgcctggtta 
tcctcgtcta 
cggtggtcta 
ggaagagcgt 
acttctgtgt 
agacgacgtt 
agaactctgg 
gcgaggatca 
agatcaaacc 



atcgcccgct 
gcagcaccag 
tgccggccat 
caatattctg 
tctgttcatc 
tctggtattt 
cttcctgctc 
tcggtatata 
atatagcatc 
ctggaatcga 
cattgccgga 
ctggcgcatc 
caacacggac 
ccagattgtg 
ggccattgcc 
ctctctggcc 
ctttcggcga 
attgccggcg 
gaaggccact 



tgggattatc 
aacgccacgg 
ctgctgtggc 
accattgtgg 
ctgtcgctgg 
tacatgggct 
atctgcgcct 
gcagtcgtct 
atcatattca 
tggccggatg 
gtgataacgc 
atgcgcgagg 
gaggccacca 
gtcttcatca 
cagctattta 
attgccaact 
gcctttgtcc 
gacagcaagc 
cagtga 



gaagctcccc 
aaacgccaac 
tagccatcaa 
cggtgcgcac 
ccgtctccga 
cggacattgg 
gctgcgtgtc 
atgctctgca 
attggtgctt 
cacaggcgtg 
cgggttttgt 
catccaagca 
ccatgcgaaa 
tgggctgctt 
gcatctgcca 
ctgctctcaa 
aaacattatg 
accgcatgga 



ggaagcgttg 
ggaaataacc 
tgccttcttg 
ctgccgccac 
cttctgtgtg 
cgccatgaga 
catgttgact 
ctatagaaga 
gggtgcattg 
cgaattcgac 
gatcatctgg 
ggcgctgcga 
tctgctgctc 
cacgctctgt 
gagcagctcg 
tccgattatc 
ctgccgaacg 
ggccacatcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1116 



<210> 135 
<211> 371 
<212> PRT 
<213> Drosophila 



<400> 135 

Met Ala Phe Thr Ser Ser Ser Ser Pro Ala Trp Asp Tyr Arg Ser Ser 

15 10 15 

Pro Glu Ala Leu Gly Val Val Pro Phe Leu Trp Gin His Gin Asn Ala 

20 25 30 

Thr Glu Thr Pro Thr Glu He Thr Leu Gin Ala Thr Ser Phe Gly Ala 

35 40 45 

Gly His Leu Leu Trp Leu Ala He Asn Ala Phe Leu Phe Val Leu He 

50 55 60 

Leu Gly Gly Asn He Leu Thr He Val Ala Val Arg Thr Cys Arg His 
65 70 75 80 

Leu Arg Ser Val He Ser Asn Leu Phe He Leu Ser Leu Ala Val Ser 
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85 
























Asp 


fne 


Cys 


vai 


Gly 


Leu 


nld 


Leu 




iyr 


n jl j 


Leu 


Val 


Phe 


Tur 


Me t 






inn 




















110 






Gly 


Ser 


Asp 


lie 


Gly Ala 


Met 


Arg 


biy 


Pro 


Cys 


Leu 


Leu 


Arg 


rne 


irne 






115 










ion 
IzU 










1 £. O 








Leu 


Leu 


lie 


Cys 


Ala 


Cys 


Cys 


M. 1 

val 


Ser 


Met 


Leu 


J. nr 


Leu 


lie 


Coy 

oer 


lid 
lie 




t n a 
ljU 










J. JO 










X 4 U 










Ala 


Val 


Asp 


Arg 


Tyr 


lie 


Ala 


Val 


vai 


Tyr 


Ala 


Leu 


nis 


Tyr 


Arg 


Arg 


145 










150 










1 J J 










1 ou 


Tyr 


Met 


Thr 


Arg 


Arg 


Val 


Ala 


Tyr 


ber 


lie 


lie 


lie 


rne 


Asn 


Trp 


Cys 










165 










1 / U 










1/0 




Leu 


Gly 


Ala 


Leu 


Val 


Ala 


Leu 


Leu 


Pro 


val 


Phe 


Trp 


Asn 


Arg 


Trp 


Pro 








180 










185 










T OA 

i yu 






Asp 


Ala 


Gin 


Ala 


Cys 


Glu 


Phe 


Asp 


Glu 


Val 


Leu 


Ala 


Pro 


Gly 


Tyr 


lie 






195 










200 










O A C 

zUo 








Ala 


Gly 


Val 


lie 


Thr 


Pro 


Gly 


Phe 


Val 


lie 


lie 


Trp 


lie 


Cys 


Met 


Phe 




210 










215 










o o a 
220 










Leu 


Val 


Tyr 


Trp 


Arg 


lie 


Met 


Arg 


Glu 


Ala 


Ser 


Lys 


Gin 


Ala 


Leu 


Arg 


225 










230 










one 










O A A 
C H U 


Leu 


Arg 


Gin 


Ser 


Val 


Val 


Tyr 


Asn 


Thr 


Asp 


Glu 


Ala 


Thr 


Thr 


Met 


Arg 








245 










250 










o c c 

2oo 




Asn 


Leu 


Leu 


Leu 


His 


Pro 


Asp 


Trp 


Lys 


Ser 


Val 


Gin 


lie 


val 


val 


rne 








260 










1 c c 

£ O 0 










Z t U 






lie 


Met 


Gly 


Cys 


Phe 


Thr 


Leu 


Cys 


Trp 


Leu 


Pro 


Tyr 


Phe 


cys 


val 


Ala 






275 










O O A 

2o\J 










Z O J 








lie 


Ala 


Gin 


Leu 


Phe 


Ser 


lie 


Cys 


Gin 


Ser 


Ser 


Ser 


Met 


Tin 

lie 


Tyr 


Lys 




290 










295 










■3 A A 

JUL) 










Thr 


Thr 


Phe 


Ser 


Leu 


Ala 


lie 


Ala 


Asn 


Ser 


Ala 


Leu 


Asn 


Pro 


lie 


lie 


305 










310 










315 










320 


Tyr 


Ser 


Trp 


Lys 


Asn 


Ser 


Gly 


Phe 


Arg 


Arg 


Ala 


Phe 


Val 


Gin 


Thr 


Leu 










325 










330 










335 




Cys 


Cys 


Arg 


Thr 


Ala 


Arg 


Gin 


Cys 


Glu 


Asp 


Gin 


Leu 


Pro 


Ala 


Asp 


Ser 








340 










345 










350 






Lys 


His 


Arg 


Met 


Glu 


Ala 


Thr 


Ser 


Ser 


Ser 


Gin 


Glu 


lie 


Lys 


Pro 


Lys 




355 










360 










365 









Ala Thr Gin 



370 



<210> 136 
<211> 7659 
<212> DNA 
<213> Drosophila 



<400> 136 

gectaaaega 

cacatgccag 

tgatatccaa 

caaacatttg 

gttgacagaa 

aatgtgcaag 

aatacaagct 

aaaataaatg 

gecactgata 

ttcccaatta 

ttaattacca 

agcaattatt 

aaactcaatt 

cgaccataaa 

cgccaagctg 



taagattaga 
aaatttgtcg 
ctggacatga 
tttgctgttt 
atatgeaage 
ctcaccgaat 
ggtgaagcga 
tggcggtcaa 
ataatagttt 
gaaaactttt 
gatgacagtt 
tttagacttc 
aagttacaca 
taattccget 
ttggaaaacc 



aaatggaaat 
tttgccatcc 
ccagtttccg 
tctgaccaaa 
agacacaagt 
aaatattata 
ttgetctgea 
tttccagctg 
tttttattat 
ttttaacaaa 
tttatttctt 
tgccacgaat 
cgcaacccac 
gcaatcagtg 
aattaaattc 



ggaaatgaag 
aaggccccga 
gaggaatgea 
aacaagccac 
aaatatttat 
ttttaaaatc 
aattccacat 
aacagttctc 
tcgattacat 
atatttaata 
ttgactatac 
gectttaatt 
atcttctgtg 
caacagtatt 
cctttcaaat 



acagccacct 
tttccgaaat 
ctccagttac 
acttatgetg 
ttgattattt 
aataagtaga 
atttaaacta 
aacaattaat 
actttaaata 
ttcegtaaca 
ttacattatt 
aatagctgat 
taaaaaaaaa 
cagtcctgga 
ggagcgtaaa 



gaegcaatet 
cgaaaatcct 
acaactggca 
gcactttttt 
attttaaaga 
aacctaaaca 
ttaatttaca 
gaaaataatt 
tttttttagt 
gtatttatgc 
ttgaaaaaat 
geaatcgett 
tcaatgtatt 
actttaatag 
aegcaaaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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cttacgtatg 
tttgattgac 
cacgacgacc 
tggcagtggc 
cgatgcactt 
gtcgtgcact 
tttatttgat 
tataaacata 
gccagttatt 
ggaaatataa 
caactgcaat 
ccctttcctt 
ccactgtgcc 
cgcctacgaa 
ggcaggagct 
ggatctgcat 
gttacatata 
ggataagttg 
catccccgag 
ccaaatcaat 
agtattcaca 
attatccaaa 
agagtctaac 
ggcagagtag 
ggatttaaat 
tggcaaggtg 
ttcaccaact 
aattatttct 
atataaatat 
attatgcttg 
taaatgtgga 
gcattaatgg 
accagggcaa 
aaacaaggcg 
cgccaccccc 
cttgtacaag 
cgccggatgt 
tgccacacag 
tcttggccaa 
gacctgataa 
gtgccactcg 
ttttgtcacg 
gattatgcaa 
ggttaacttg 
cgcaaaatgt 
tttagcaagt 
ggtaaagaat 
gcatggttaa 
cccacaagga 
ggtccaacta 
aaatatccta 
cacacagtga 
tcgttggcca 
ttggccacaa 
cgctggtgac 
gatggcaaac 
atgctcattg 
acattgtgtc 
gcagaccaat 
gataaaggat 



atgggggcgc 
taacgaaact 
atgagcgacc 
accaacgtcg 
cgggccgcct 
gcgagaaaat 
gcagttgtaa 
aatcaaaata 
tgtctttact 
cagtttttgg 
ttcggcgtcc 
ttggcccctt 
ggcacctttg 
ctctgcccgg 
gaattgttta 
atgacacgaa 
ggttaaatag 
gttgggttgt 
ctcatcgtta 
tggaaacttg 
aaaccacacg 
agagctcgac 
cgaaatcaaa 
aagtcaaata 
tttatcatgt 
ttgatgacgg 
cattgtcgta 
caactttctg 
aaatataaat 
gcattaaagc 
taaatatttg 
tatacaatat 
cgaaaatagc 
agcgtcaaga 
tttttcatcc 
attatgtgct 
cgatgtggtg 
cgcgtatacg 
agatcatcgg 
aactcacctt 
gtaaattgca 
aacttcgaat 
ttttcaaatt 
gtggcagatg 
aaatatgagc 
gtaaatattt 
taatcgtata 
ttatttttta 
atgcggtctc 
cacaacctgc 
tatatccttt 
atctgatctc 
tattgggtta 
gttgacagct 
ttatattgcg 
aaatgcgaca 
cactcattgc 
cccatctcct 
tagtgctctg 
aatggatgtg 



cacacactcg 
tatttgccac 
agattggcaa 
cctccatcgc 
gcgaaacgcg 
atatcacaca 
aacgctcacc 
aatttttttg 
taagcgaaag 
tatttcacta 
attatgcacg 
ggcaaacagg 
atggctggct 
acttcatcac 
ttaaattcca 
aaaatatata 
catgccaaag 
ttggtaggtg 
tatacacata 
ttatgtttat 
ggctaaaggc 
aaaatgctaa 
tgtaaggcag 
tgtgtgtggg 
ttgtgaattt 
tcataataac 
tcattaacat 
ccacactgcg 
taatcaactg 
aaacgtttaa 
aaccaattta 
gtagccagcc 
gagagttgaa 
aagtgctact 
tttttgcatt 
ttggcactgc 
agctgtggtc 
tgattttctc 
tcgaaaaaaa 
cgaagttcct 
ttgcaaactt 
tgattcgact 
gatatgcttg 
caataaataa 
cggaatattt 
gtgtggtgca 
tacaatattt 
tttatcgcaa 
gatggcgagt 
gtaaatctcg 
atgtaattaa 
cgaggttggc 
tttcaagtga 
tcgttgggag 
acttatatta 
gtggcttaat 
tcatttaatg 
ggtgactcga 
ggtttgttta 
cggactgagg 



cacacgcatt 
tttatttcag 
ccccaatgca 
cgaaagtgtg 
gctaaatgcc 
gcacataact 
agtaaatttc 
cactgtttta 
aaaaaattca 
catatttttg 
agtaacttta 
ctccggagga 
ttgttggccg 
gggattcgat 
caaagctaca 
tatatacata 
ggaaatgtgt 
gagtgtgcca 
aatggaacag 
gctttgattt 
aaagctcctt 
aatgtttcgc 
aaaacacggc 
tatttaaaga 
catatttttt 
ataactgaag 
atcataaata 
tgtatttcat 
attggccaaa 
aaacaaagcg 
tggcataaat 
aacttaaacc 
tgactttttt 
tttgctacat 
tttttttttg 
gccgccatta 
gagaactccc 
ggtctgcgtg 
aaagaaaacg 
aacaaaaatt 
ggtaatctac 
taaacacaca 
caattcgaat 
tggttaattc 
atgcagtagt 
caggatgtga 
attgaggagc 
actgacattg 
ggttcaagca 
aagacctcaa 
ttaatcgcaa 
tacggcacct 
gtatggccaa 
atggcaaaaa 
tgacgactgg 
caactgccaa 
cggcacaatt 
agcaggatat 
tcgcacattc 
ggcgagatcc 



tatgagctcg 
cggacgcagt 
acattcagtg 
gcggagagcg 
agcggtcaac 
attaaagttt 
atttaaattt 
cttacattcc 
ctctgaaatt 
cccgtgcttc 
aattgctgac 
ccaggcgcag 
gatacggctg 
ccagcaagta 
tccgtttcgg 
catatatcac 
ttgcacgagg 
gtctaccctc 
aaaatacgct 
cgtgtcgtat 
ggagctccct 
ctgatttggc 
caaattgatt 
tgctggagca 
taatcattaa 
aagaaaaccc 
aaaatatcaa 
tcaattataa 
cgcacaaaga 
cacgctctag 
catgcgctgc 
tcaacctcgt 
tgttatgaca 
ttagcttgtg 
ccttacgggg 
acttttatgg 
tcaccccccg 
ccaaacggcg 
gaagagccgg 
ctccagtaat 
aatcgaatcg 
catgacccaa 
gcgaaccgca 
aattgaatgc 
caatcatgaa 
aaatgtttgc 
tgtaataagc 
ctttaattat 
tccgctgacc 
cgtaagccga 
cactatatat 
cactgctggc 
gggctttgca 
cacttagggg 
cgccataagt 
cctggatgct 
gacgccggaa 
cgattaattt 
gaatggaact 
catagacggt 



gcaaaaccca 
gcagcagttc 
gcagtggcag 
gaccggactt 
tggcgggtga 
atgcagttat 
tagatgtttt 
ctttctttat 
gtagcactag 
tggctctggc 
ctaaccactc 
aagcaggaac 
tcggcacttc 
agtgcgccgg 
ggtcagctgg 
atgtatgtat 
caatgcggtt 
taattgggct 
ccaagccagg 
ggaaacatca 
gggccactaa 
acaatttgcc 
gaggctgctg 
cgaagaaagg 
tcagtgttct 
tttgaatagt 
aggtgacttg 
atatatatat 
aaactaatta 
aaaatcgtca 
atgataattt 
tttggccatg 
gagcggagaa 
gttagcatgg 
tcgccgccag 
cacttccggt 
gctacttatc 
ataaatggat 
aaaaacgaac 
ttttctacaa 
aatggaaagt 
acaacagaat 
attcttttgc 
tcgggccgac 
atcacacata 
tagaaaaaac 
agagtaaaca 
gcaggatatg 
aataaaacat 
tcccatacat 
ccagtggagg 
cattctgctg 
ctccttcatc 
taatgaggca 
gaaatttact 
ggatgcccaa 
atgggaaaat 
cccagtgcct 
ggggctcaag 
tcggccaact 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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gacagaactg 
aatttggtct 
accagccccg 
caaataaaat 
agaagtatta 
agaaagttta 
gtaaagtaga 
aaatcaaact 
aacggaaaat 
cctctttcgg 
atagacaaac 
cgattcgcat 
acatgaatct 
tggtcatgcc 
tgcgtaatgg 
ccagatgcgc 
ctggatgctc 
gaagaggctg 
cgtctatagc 
taatgaatgg 
gaccgacaac 
aatcccacag 
tcagtctgtc 
tcatctcgca 
attcttatgg 
gtgcgagtgg 
tcgcgaacgg 
aagaactgtt 
ccacggggcg 
ccatcaggtc 
ttgaattacg 
aggcattata 
tcggcctcca 
catacgaaat 
gtttctgttc 
ttgttttgca 
gttagttcgc 
ataccgaata 
taatgaaaat 
ttaaattgct 
catgaaatgc 
aactcagtct 
ccgcaactta 
aatacccaaa 
ttttcgtgtt 
atggaggtag 
caaaagtaat 
aacccaactc 
caggaaaatc 
gttgagctct 
ccttttttca 
agccagtcga 
ttatgtcgct 



ttttaaattg 
gccctaaaaa 
gaaaaacgca 
ataagagtat 
ttataaaata 
gaagtgaaaa 
gaagggatta 
agattcagtt 
gggaaaatgg 
ccctatccaa 
ttgcaaattt 
atttgtcttt 
gttcgcctcg 
gaattcggag 
tgaatcaacc 
tgcgttgccc 
tgcgagggat 
gtcaaatggc 
atggctcgcg 
gtacagctta 
agacaaggtg 
acttgagtat 
gagctcttga 
ttccttcgcc 
tgcctgtgtg 
tccttttgaa 
ttttgcaagc 
gggggggaga 
aatgcgtaat 
ttttattgtt 
catacgccgc 
attcaccctc 
gtacatcctt 
catctcggcg 
tcgtttcgtt 
cgctctcata 
ctcatggtgc 
ccaaatacca 
acacgccatg 
ggccgagatt 
actctcgcac 
cgatagacca 
aagacaacga 
gaatctttca 
tttgtgcacg 
aggtgcctaa 
ttctaaggca 
aatcggttca 
cattatcttg 
tgactatttt 
tcattgccaa 
cgggtacttt 
gttttccgtt 



tgtgccaaag 
agtgaataat 
ttcctcaaac 
aaaggaaatt 
taccatcagt 
gtagtatata 
acgaaaacaa 
tggccagatg 
acaaatgaac 
gctgcctcac 
ggccataaaa 
cgattgaaca 
ttcgctgcca 
ctactgcatc 
tcggcttccg 
tgcacataac 
tctatctgca 
tcatcgcatt 
gtctgggcgg 
caccgaaaga 
taaccatatt 
cacagattga 
ccaaatgcta 
aacttgcagc 
tatctccatg 
actgaatgcc 
atttcggtaa 
ttgaaacact 
tatattttgt 
tattcgctct 
gttgacttgc 
tcatcctgtt 
acgccctttc 
tttacggcct 
tttttttcgt 
aaatcacttt 
catccaccac 
ggactagcag 
tgaaataccc 
ggagcgaagt 
acaggaactc 
gcatttacac 
cggcaaactt 
gttcatgttt 
aaatggtttt 
aagtatgttg 
ccccctagct 
aaaaggtagc 
cgagtagaca 
ctataacttg 
tcgataggtg 
tcctttgcat 
ggcttatttg 



gcaaataaaa 
taaaagttta 
agctgtaatg 
gagtcgttgt 
atattctcag 
aaataattta 
tctaacctat 
tcagcaatca 
tttgtccttt 
atgcccctgg 
acttgccaca 
gatccctgaa 
ataactcgtt 
agagtccggt 
atttccatat 
gctacactac 
caccgtcctc 
cggctggggc 
cacgcccgag 
aaatgaggaa 
gtgttgtcta 
tatcaagata 
atcaaatcaa 
tgctggatga 
ttcctgaacc 
ccggccagta 
gttttcccgt 
tataagcgaa 
ttcccttgta 
gcactctgga 
atgttaatac 
ttagggcaac 
gtcctgcacc 
catttcaagt 
tttttttttt 
acgactccga 
tgggaatgca 
ctcattagat 
accattggca 
tcccccgttg 
gtaaaagcaa 
atggttaaca 
catccagcta 
gcagtacaca 
tcattacaga 
cagcttagat 
gttat tcgaa 
ccattgaatt 
taaaatcatc 
gcaaactgaa 
aaatctttta 
tttcctcttg 
ctgcaaaga 



tgggtccaga 
atgacagagc 
aaaataccag 
gagcggccaa 
tcaataattg 
tttagacaga 
tgaattaaat 
atgcaattga 
tggtcgcagg 
ccccgaatgc 
aattgccggc 
gtgcgcccgc 
gtggctggtc 
aagtgtgcac 
tccatactcc 
ttcctcctgt 
gtcgccgcat 
tccccggcca 
gacaatcgtc 
tgccaatagt 
gaagtatcct 
tttctgtgta 
tcagcaattc 
accaaaccaa 
tcctgttcct 
ttcagggcag 
ccaaaacaaa 
gcagagaact 
tttagactct 
agttgggctc 
acagctgaaa 
gctgcttttg 
caaacatccc 
aagtaaaatg 
tgcttttttc 
gctggctaaa 
aataagccag 
cggggctctt 
tttcacttga 
gaaaaatgta 
ctcgagcttg 
cacatacata 
cgttgccttc 
gataaaataa 
ctgttgatat 
ggctttcctt 
aatcttttag 
cataaacagc 
ttgggtgcca 
cattgtcacc 
ttcgactgct 
tcgtatttat 



tacgtgtggg 
aatgggcaac 
agttttcatt 
aaataaataa 
tattgcttaa 
agtaaattaa 
tgctgaaaag 
gcaaaggaga 
actttttctg 
aaatgaagcc 
caaagtaatt 
atcacgctgc 
tggtacctgc 
acctggcgaa 
gtgcgaacac 
ccaattactc 
tcatttccga 
tcgtcatatt 
agtaagtgaa 
tatcaaaagt 
agtttaatga 
gctaagatat 
acccactgac 
ctaccaaaac 
gtgcaacatt 
ctgcggtcca 
agaaaaaaaa 
ctcaactggc 
ttcccatcgg 
attttacatt 
aacgaaaaca 
gttcctctgc 
tgggagaata 
ttttcgtttc 
tttccggctt 
aagttatgat 
cattactgga 
ttcaaatcaa 
tctcagttta 
gcaagttgag 
tccctcggcc 
tatttcttgg 
ttgtttgttg 
tttgcatata 
aacagttgcc 
tacaaaattt 
ttcgatctaa 
cctaaaacaa 
ctaagctgca 
tcggtctgag 
aaaagtttaa 
tcactttttt 



4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7659 



<210> 137 
<211> 1365 
<212> DNA 
<213> Drosophila 
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<400> 137 

cggacgcagt 

acattcagtg 

gcggagagcg 

agcggtcaac 

acctttgatg 

tgcccggact 

gatggcgagt 

gtaaatctcg 

ggcacctcac 

attcgcatat 

atgaatctgt 

gtcatgccga 

acgctacact 

cacaccgtcc 

ttcggctggg 

ggcacgcccg 

atggtgcctg 

gtggtccttt 

acggttttgc 

atccttacgc 

tcggcgttta 

ttcgtttttt 

ctcataaaat 



gcagcagttc 
gcagtggcag 
gaccggactt 
tggcgggctc 
gctggctttg 
tcatcacggg 
ggttcaagca 
aagacctcaa 
tgctggccat 
ttgtctttcg 
tcgcctcgtt 
attcggagct 
acttcctcct 
tcgtcgccgc 
gctccccggc 
aggacaatcg 
tgtgtatctc 
tgaaactgaa 
aagcatttcg 
cctttcgtcc 
cggcctcatt 
tttcgttttt 
cactttacga 



cacgacgacc 
tggcagtggc 
cgatgcactt 
cggaggacca 
ttggccggat 
attcgatcca 
tccgctgacc 
ctggaggcac 
tctgctgtcg 
attgaacaga 
cgctgccaat 
actgcatcag 
gtccaattac 
attcatttcc 
catcgtcata 
tcactgctgg 
catgttcctg 
tgccccggcc 
ggcaacgctg 
tgcacccaaa 
tcaagtaagt 
tttttttgct 
ctccgagctg 



atgagcgacc 
accaacgtcg 
cgggccgcct 
ggcgcagaag 
acggctgtcg 
gcaagatatg 
aataaaacat 
acagtgaatc 
ttggccatat 
tccctgaagt 
aactcgttgt 
agtccgatgc 
tcctggatgc 
gagaagaggc 
ttcgtctata 
atgaaccaaa 
aacctcctgt 
agtattcagg 
cttttggttc 
catccctggg 
aaaatgtttt 
tttttctttc 
gctaaaaagt 



agattggcaa 
cctccatcgc 
gcgaaacgcg 
caggaaccca 
gcacttccgc 
cccacaagga 
ggtccaacta 
tgatctccga 
tgggttattt 
gcgcccgcat 
ggctggtctg 
gctgcgttgc 
tctgcgaggg 
tggtcaaatg 
gcatggctcg 
ccaactacca 
tcctgtgcaa 
gcagctgcgg 
ctctgctcgg 
agaatacata 
cgtttcgttt 
cggcttttgt 
tatga 



ccccaatgca 
cgaaagtgtg 
gctaaatgcc 
ctgtgccggc 
ctacgaactc 
atgcggtctc 
cacaacctgc 
ggttggctac 
caataattcg 
cacgctgcac 
gtacctgctg 
cctgcacata 
attctatctg 
gctcatcgca 
cggtctgggc 
aaacattctt 
cattgtgcga 
tccatcgcga 
cctccagtac 
cgaaatcatc 
ctgttctcgt 
tttgcacgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1365 



<210> 138 
<211> 444 
<212> PRT 
<213> Drosophila 



<400> 138 

Met Ser Asp Gin He Gly Asn Pro Asn Ala Thr Phe Ser Gly Ser Gly 

15 10 15 

Ser Gly Ser Gly Thr Asn Val Ala Ser He Ala Glu Ser Val Ala Glu 

20 25 30 

Ser Gly Pro Asp Phe Asp Ala Leu Arg Ala Ala Cys Glu Thr Arg Leu 

35 40 45 

Asn Ala Ser Gly Gin Leu Ala Gly Ser Gly Gly Pro Gly Ala Glu Ala 

50 55 60 

Gly Thr His Cys Ala Gly Thr Phe Asp Gly Trp Leu Cys Trp Pro Asp 
65 70 75 80 

Thr Ala Val Gly Thr Ser Ala Tyr Glu Leu Cys Pro Asp Phe He Thr 

85 90 95 

Gly Phe Asp Pro Ala Arg Tyr Ala His Lys Glu Cys Gly Leu Asp Gly 

100 105 110 

Glu Trp Phe Lys His Pro Leu Thr Asn Lys Thr Trp Ser Asn Tyr Thr 

115 120 125 

Thr Cys Val Asn Leu Glu Asp Leu Asn Trp Arg His Thr Val Asn Leu 

130 135 140 

He Ser Glu Val Gly Tyr Gly Thr Ser Leu Leu Ala He Leu Leu Ser 
145 150 155 160 

Leu Ala He Leu Gly Tyr Phe Asn Asn Ser He Arg He Phe Val Phe 

165 170 175 

Arg Leu Asn Arg Ser Leu Lys Cys Ala Arg He Thr Leu His Met Asn 

180 185 190 

Leu Phe Ala Ser Phe Ala Ala Asn Asn Ser Leu Trp Leu Val Trp Tyr 

195 200 205 

Leu Leu Val Met Pro Asn Ser Glu Leu Leu His Gin Ser Pro Met Arg 
210 215 220 
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Cys 


Vdi 




Leu 


HIS 


Tl fl 

i le 


i nr 


Le u 


n 1 S 


Tyr 


rne 


Leu 


Leu 


oer 


Asn 


iyr 


ZZO 










£. J u 










z. J J 












Ser 


Trp 


Met 


Leu 


Cys 


GlU 


biy 


rne 


Tyr 


Leu 


LI -J c- 

HIS 


i nr 


val 


Leu 


val 


Ala 










24 5 










Z O U 










zoo 




Ala 


Phe 


lie 


Ser 


Glu 


Lys 


Arg 


Leu 


val 


Lys 


Trp 


Leu 


lie 


Ala 


rne 


biy 








260 










1 c c 
Z DD 










Z IU 






Trp 


Gly 


Ser 


Pro 


Ala 


He 


Val 


He 


Phe 


Va 1 


Tyr 


Ser 


Met 


Ala 


Arg 


biy 






275 










280 










"~i o c 

Zoo 








Leu 


Gly 


Gly 


Thr 


Pro 


Glu 


Asp 


Asn 


Arg 


His 


Cys 


Trp 


Met 


Asn 


Gin 


Thr 




290 










295 










3U0 










Asn 


Tyr 


Gin 


Asn 


He 


Leu 


Met 


Val 


Pro 


Val 


Cys 


He 


Ser 


Met 


Phe 


Leu 


305 










310 










315 










320 


Asn 


Leu 


Leu 


Phe 


Leu 


Cys 


Asn 


He 


Val 


Arg 


Val 


Val 


Leu 


Leu 


Lys 


Leu 










325 










330 










335 




Asn 


Ala 


Pro 


Ala 


Ser 


He 


Gin 


Gly 


Ser 


Cys 


Gly 


Pro 


Ser 


Arg 


Thr 


Val 








34 0 










34 5 










350 






Leu 


Gin 


Ala 


Phe 


Arg 


Ala 


Thr 


Leu 


Leu 


Leu 


Val 


Pro 


Leu 


Leu 


Gly 


Leu 






355 










360 










365 








Gin 


Tyr 


He 


Leu 


Thr 


Pro 


Phe 


Arg 


Pro 


Ala 


Pro 


Lys 


His 


Pro 


Trp 


Glu 




J / u 










j i o 










JOU 










Asn 


Thr 


Tyr 


Glu 


He 


He 


Ser 


Ala 


Phe 


Thr 


Ala 


Ser 


Phe 


Gin 


Val 


Ser 


385 










390 










395 










400 


Lys 


Met 


Phe 


Ser 


Phe 


Arg 


Phe 


Cys 


Ser 


Arg 


Phe 


val 


Phe 


Phe 


Ser 


Phe 










405 










410 










415 




Phe 


Phe 


Phe 


Ala 


Phe 


Phe 


Phe 


Pro 


Ala 


Phe 


Val 


Leu 


His 


Ala 


Leu 


He 








420 










425 










430 






Lys 


Ser 


Leu 


Tyr 


Asp 


Ser 


Glu 


Leu 


Ala 


Lys 


Lys 


Leu 











435 



440 



<210> 139 
<211> 4185 
<212> DNA 
<213> Drosophila 



<400> 139 

attcgttcgg 

atgatcaaca 

cgctcgggaa 

gtaagcccac 

atttgaacaa 

gtatggctaa 

tatttgctgt 

cgcaaagcgt 

gtcaagtttg 

gctacttaat 

gctgctcgag 

aaccggaaac 

tagaattaga 

atatacgtgg 

cggtgaatca 

atgaatatgt 

ctgtatggtg 

tgactgtggg 

caactatatg 

agagttaagg 

ttcttctcaa 

gtattttttt 

gattagctct 

caaagcgata 



tattttcgca 
atattcaacg 
actgaatttg 
caaatcgaag 
cgatcgaaac 
gaactactgg 
gttttatatt 
gcgaagattg 
gcaaattgtt 
atggactttg 
tagcatatga 
ggaaatgttg 
tcgctgctac 
taagctcata 
atcataattg 
ctttcaacgt 
gctgagtggt 
gtggtgcttt 
gcaaacaggc 
aaatggctca 
agggattacg 
ttgaggagaa 
gcttgcaaac 
cggaattgcg 



tttggcatgt 
acaaacactc 
acccactaag 
cgttcagggg 
ttcacactag 
aaatgtcttt 
cgtttagtgg 
aagattttcc 
taataaacgc 
aaaaaggttt 
ttttcgtata 
ctcttgccta 
atatgtatat 
aatacacaat 
gatacgctaa 
gtctgtggtt 
tgagtggttg 
cattggcatg 
aagtaagtgt 
atgcaattac 
gaggatgggt 
tgaaaatcta 
ggcccttagt 
gcgacatact 



ttgtggctaa 
gacttgggac 
ggaactaggc 
cggccaaaag 
cctatcattt 
ttgggcttat 
gccttggttt 
gaattttgaa 
tacacgattt 
atacttatgc 
ttgtttttgt 
gtgaattgtc 
gtatatggct 
aataatcgtc 
aaactcttag 
ttggtgtttt 
cgtttttagt 
tcgcgtagcg 
ttatatatat 
aaccaattat 
ttgggagggg 
aatctaactc 
cttacaacaa 
ggtcatgatc 



gatatcatca 
cctggatgct 
aaaattgcaa 
gaaaatgaat 
cagtatcata 
ttgctttttg 
ttcgtttctt 
ttctcatatt 
cattttgtaa 
aatctttaga 
tgtagcaagt 
tcttgtctac 
agagttaagt 
aatcaattca 
cttagcttag 
tggtgtttct 
tttcagtttt 
ccaaacggcc 
tgagagagag 
ttacttgcag 
gatttttacg 
tagtcatctg 
atggttaagc 
agaattttcc 



attcaacaca 
tcatcggcgg 
cgctctgttg 
cattctaccg 
gttgtatgtt 
gattttaaca 
aagcgcaatt 
tcggcccgtc 
aattctaaat 
taataatcct 
tcgaattgta 
gaattagaat 
acaagtatat 
atcaatcaat 
gtctagaaat 
gtgtggtttg 
tctgtggctg 
cgtaaatatg 
agatatatag 
gactttgtag 
gataattatt 
ggtccaaccc 
aaaatactct 
aaaacgaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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caatatgccg 
gaccacctcg 
taatgcgaat 
ttctttatcg 
gtagttgtgt 
atatgctgcc 
catcgagtag 
catccagcat 
aatccagatg 
ccgtttcata 
aggtgtatac 
tgtcgccgta 
gtccgcagag 
gtcctgcgat 
gcaaatgcaa 
cgacattcac 
tttataggat 
ggtttttgaa 
gggaggatgc 
aagtataaac 
cttatgaaaa 
aaagtgttat 
taatgaggag 
tgagggcctc 
ggcactgtcg 
tccgtttccc 
gctttctccg 
gaccgtactt 
tgtgtgtcgg 
tgtgtgtgtg 
acacgcgaag 
ccaaatttgt 
ccaccttttg 
taaggttaag 
ttaaaatgct 
tttctatctt 
acgctctttg 
ctactagttt 
atgattatta 
tttcggggtt 
tgttttgtaa 
cggttgttaa 
ttgcttgtaa 
cgttttaagc 
gcttcataaa 
cggcagcagg 



atccagacga 
tacagacgca 
gacatcatat 
tgtatcggct 
acgattgaaa 
atattccccc 
cccaggattg 
tggcatctgc 
aggaggaatt 
tctgagtggc 
cgataccgcc 
catccattct 
cagatcggcg 
gacagcaata 
acaataatat 
tagaactatt 
aaaagtttag 
ttttttcaaa 
cgtgttactt 
attggactta 
tattatgctt 
gtgagtgtga 
gttggacagg 
caccttggaa 
gttgaaatgg 
tcctccttgg 
caccttcttg 
tacttccggt 
tgtgtctgtg 
tgtctgggtg 
gtcgtcggag 
ataaattttc 
gcgccgagcc 
gataaggttg 
gcttgggcta 
agaggatcgt 
ctaagtttta 
taatatcgat 
atttaactat 
agttgaaata 
ggcttctgtt 
gtatcggttt 
attctggctc 
taattgtgcg 
tagtttacag 
cggaaaaagc 



cggcgaagtt 
acagaatcca 
gactagggtt 
cgccggacct 
tgaggctggg 
agtccagcat 
ggggacagca 
acgataaagg 
cggggttcca 
ttccgcaccg 
cacagagtga 
ccggtcaaag 
atggccaggg 
tcgttagaaa 
ttaaatggat 
tagaaaccta 
gtacaaaaag 
gcaattaaga 
cattcctttc 
agtggtcaaa 
gaggccaaac 
tattcgtgca 
acgatggcca 
ttatcctgta 
gcgctcctgt 
ctcctcctgc 
gcgggctgat 
catggcggac 
agtgtctatg 
cgagtgttgg 
tccttggcgc 
tactgcctac 
aagcatttga 
tgctggtgtg 
aaatcttaca 
taatgattta 
atgaacgtca 
ttaaaaatgt 
ttaatgtgta 
tctttcgact 
gtgctacatt 
tgctatttaa 
ttgttgctgt 
catctgaaag 
aatcttccat 
cttttagctg 



gataagagta 
tggcagccag 
cgataaattt 
aatcttacgc 
acctaagact 
gcaaatgtgc 
cagcagggcc 
tcacaagaga 
ttggcgctac 
caagatagcg 
cctccagata 
cgggatacac 
acaagaggta 
acgaagtaga 
cattcaataa 
aggttcttaa 
gggaaagaaa 
caacaagaaa 
actttcacat 
ataaattata 
ccaaccgcga 
acgtaccctt 
cgcccagtat 
ggtagctcat 
gctcgttgta 
tttcacctgc 
tgttatgcaa 
gcccatgcaa 
tgtgcttagg 
tggcgtcctt 
tgttttttcg 
ttttatttgg 
ttgttaccct 
tgtgtgtcta 
cgtaaaatct 
aagaattcgt 
aaactgttag 
acacatttca 
ataattacgc 
ataattaaag 
taagtttgtt 
tttcgctgct 
tgttggtgtt 
gacaaaggat 
cttgagccaa 
cacaaacgct 



ctttggataa 
gacacccaga 
tcagccagag 
atcattacaa 
aatatcgcca 
gtcatgttgt 
gccgatatcc 
tcaaagatta 
tcacctcgtt 
atccactgat 
gccggtgaag 
ggagaagggc 
gtagttgatc 
ggtgtattca 
tttgggcgag 
atttatccaa 
taatatgctt 
acaacacatt 
caaaatcatg 
ttcaacacat 
gattcgagat 
gaagttggca 
cgatatgagc 
ttcctgcatt 
tacgctccct 
tttcgctgct 
aggccagtgg 
ttgtcctgtg 
tgagtgtgtc 
tattgtgcaa 
tcctttctcc 
gctgcaactt 
cacaatcctg 
tgtgtgtgtc 
agtaattgcg 
tttgttttgc 
ttgtatgctt 
aactcgcaag 
ggttttcagg 
cacagcacaa 
aaaatagttc 
tggtgattgt 
taaactccac 
agcaattgta 
tcaataaatt 
ttaca 



catcgcccgt 
atgcaaagac 
cagttgcata 
agatatagcc 
aggtggcgct 
tctcgatggg 
aagtgaacac 
atccagtttt 
ttcgtctgca 
atccacatga 
cgacacacga 
accaccagga 
acctctgtgg 
aaaataatag 
tctatttgta 
taacaatcaa 
aagcacaatt 
tgaatttttg 
gatgcatttt 
atgtaggtta 
gttcctgtgg 
taggtggcga 
accgccttgg 
tttacgggct 
tagctccttg 
gctttcaccc 
acgtggacgt 
ccttcggttg 
gatgtgtgag 
cgaggcttgc 
aacgttggcg 
ctactactgc 
gcgtcgtggt 
tgaagctcgt 
tacaatttgg 
taaattcaca 
gaataattcg 
tggaagaata 
gctctttttc 
atggttttgt 
gataatagtt 
gattatcctt 
gtcctgctgt 
tgcctttagc 
atcccacata 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4185 



<210> 140 
<211> 1584 
<212> DNA 
<213> Drosophila 



<400> 140 

acacactcac 

acacacaacc 

cggtcacgtc 

aaagcgggtg 

acggacaagg 

gtgccagccc 

cctcaccaag 



acatcgacac 
gaaggcacag 
cacgtccact 
aaagcagcag 
agctaaggga 
gtaaaaatgc 
gcggtgctca 



actcacctaa 
gacaattgca 
ggcctttgca 
cgaaagcagg 
gcgtatacaa 
aggaaatgag 
tatcgatact 



gcacacatag 
tgggcgtccg 
taacaatcag 
tgaaagcagg 
cgagcacagg 
ctacctacag 
gggcgtggcc 



acactcacag 
ccatgaccgg 
cccgccaaga 
aggagccaag 
agcgcccatt 
gataattcca 
atcgtcctgt 



acacaccgac 60 
aagtaaagta 120 
aggtgcggag 180 
gaggagggaa 240 
tcaaccgaca 300 
aggtggaggc 360 
ccaacctcct 420 
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cattatcgcc acctatgcca acttcaaggg acccacagag gtgatcaact actacctctt 480 

gtccctggcc atcgccgatc tgctctgcgg actcctggtg gtgcccttct ccgtgtatcc 540 

cgctttgacc ggagaatgga tgtacggcga catcgtgtgt cgcttcaccg gctatctgga 600 

ggtcactctg tgggcggtat cggtatacac cttcatgtgg atatcagtgg atcgctatct 660 

tgcggtgcgg aagccactca gatatgaaac ggtgcagacg aaaacgagat gccaatgctg 720 

gatggtgttc acttggatat cggcggccct gctgtgctgt cccccaatcc tgggctactc 780 

gatgcccatc gagaacaaca tgacgcacat ttgcatgctg gactggggga atatggcagc 840 

atatagcgcc accttggcga tattagtctt aggtcccagc ctcatttcaa tcgtacacaa 900 

ctacggctat atctttgtaa tgatgcgtaa gattaggtcc ggcgagccga tacacgataa 960 

agaatatgca actgctctgg ctgaaaattt atcgaaccct agtcatatga tgtcattcgc 1020 

attagtcttt gcattctggg tgtcctggct gccatggatt ctgttgcgtc tgtacgaggt 1080 

ggtcacgggc gatgttatcc aaagtactct tatcaacttc gccgtcgtct ggatcggcat 1140 

attgaattcg ttttggaaaa ttctgatcat gaccagtatg tcgccgcaat tccgtatcgc 1200 

tttgagagta ttttgcttaa ccatttgttg taagactaag ggccgtttgc aagcagagct 1260 

aatcgggttg gacccagatg actagagtta gatttagatt ttcattctcc tcaaaaaaaa 1320 

atacaataat tatccgtaaa aatccccctc ccaaacccat cctccgtaat ccctttgaga 1380 

agaactacaa agtcctgcaa gtaaataatt ggttgtaatt gcattgagcc atttccttaa 1440 

ctctctatat atctctctct ctcaatatat ataaacactt acttgcctgt ttgccatata 1500 

gttgcatatt tacgggccgt ttggcgctac gcgacatgcc aatgaaagca ccaccccaca 1560 

gtcacagcca cagaaaaact gaaa 1584 



<210> 141 
<211> 322 
<212> PRT 
<213> Drosophila 



<400> 141 



Met 


Gin 


Glu 


Met 


1 








Thr 


Lys 


Ala 


val 








20 


Asn 


Leu 


Leu 


He 






35 




Val 


He 


Asn 


Tyr 




50 






Gly 


Leu 


Leu 


Val 


65 








Trp 


Met 


Tyr 


Gly 


Thr 


Leu 


Trp 


Ala 








100 


Arg 


Tyr 


Leu 


Ala 






115 




Lys 


Thr 


Arg 


Cys 




130 






Leu 


Leu 


Cys 


Cys 


145 








Asn 


Met 


Thr 


His 


Ser 


Ala 


Thr 


Leu 








180 


Val 


His 


Asn 


Tyr 






195 




Gly Glu 


Pro 


He 




210 






Leu 


Ser 


Asn 


Pro 


225 








Trp 


Val 


Ser 


Trp 


Thr 


Gly 


Asp 


Val 



Ser Tyr Leu Gin 
5 

Leu He Ser He 

He Ala Thr Tyr 
40 

Tyr Leu Leu Ser 
55 

Val Pro Phe Ser 
70 

Asp He Val Cys 
85 

Val Ser Val Tyr 

Val Arg Lys Pro 
120 

Gin Cys Trp Met 
135 

Pro Pro He Leu 
150 

He Cys Met Leu 
165 

Ala He Leu Val 

Gly Tyr He Phe 
200 

His Asp Lys Glu 
215 

Ser His Met Met 
230 

Leu Pro Trp He 
245 

He Gin Ser Thr 



Asp Asn Ser Lys 
10 

Leu Gly Val Ala 
25 

Ala Asn Phe Lys 

Leu Ala lie Ala 
60 

Val Tyr Pro Ala 
75 

Arg Phe Thr Gly 
90 

Thr Phe Met Trp 
105 

Leu Arg Tyr Glu 

Val Phe Thr Trp 
140 

Gly Tyr Ser Met 
155 

Asp Trp Gly Asn 
170 

Leu Gly Pro Ser 
185 

Val Met Met Arg 

Tyr Ala Thr Ala 
220 

Ser Phe Ala Leu 
235 

Leu Leu Arg Leu 
250 

Leu He Asn Phe 



Val Glu Ala Leu 
15 

He Val Leu Ser 
30 

Gly Pro Thr Glu 
45 

Asp Leu Leu Cys 

Leu Thr Gly Glu 
80 

Tyr Leu Glu Val 
95 

He Ser Val Asp 
110 

Thr Val Gin Thr 
125 

He Ser Ala Ala 

Pro He Glu Asn 
160 

Met Ala Ala Tyr 
175 

Leu lie Ser He 
190 

Lys He Arg Ser 
205 

Leu Ala Glu Asn 

Val Phe Ala Phe 
240 

Tyr Glu Val Val 
255 

Ala Val Val Trp 
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260 

He Gly He Leu Asn Ser 
275 

Ser Pro Gin Phe Arg He 
290 

Cys Lys Thr Lys Gly Arg 
305 310 
Asp Asp 



265 

Phe Trp Lys He Leu 
280 

Ala Leu Arg Val Phe 
295 

Leu Gin Ala Glu Leu 
315 



270 

He Met Thr Ser Met 
285 

Cys Leu Thr He Cys 
300 

He Gly Leu Asp Pro 
320 



<210> 142 
<211> 4900 
<212> DNA 
<213> Drosophila 



<400> 142 

taaatatgca 

tgcgaaatga 

cttcgacgac 

gtgtcagatg 

gtaatgtata 

ttctatgaca 

gtaattataa 

ttttaaccat 

gggtccgcat 

catttggaca 

tcgctttgac 

ttgataataa 

tagtattgat 

attttcactt 

atttcgatga 

ataattctaa 

agcacatata 

tcaaatggga 

ctatcatctc 

gaagaacatg 

ctggttgcga 

tgattacaac 

gactaatgat 

gatgaatgta 

taaagttata 

aatctcggtc 

aaattatatt 

tcagtcgtaa 

aataccgaga 

caataaatca 

atagtttacc 

tttttttttt 

cagttgagtt 

gggagtgcat 

ttttatgttg 

gcaaacatct 

aatatactta 

acatcgtatc 

taatagtctt 

tattataaag 

tatattttgt 

tttagttgag 

ttcctttctt 



tattgaacga 
gttgcgaaca 
gtgtggacac 
ccaacattac 
attgggctgc 
tcacgacatt 
tataatataa 
attctcaaat 
ttgccaatta 
ggtttgctca 
gagttgcagt 
caaagtaatg 
tttatacata 
taatggtcct 
caagaagtcc 
gcaatggttt 
catcaaaaga 
cagacccata 
acgagcaccg 
acccctcagg 
gcctacaaca 
tccccctggg 
gagcaaggga 
ggggacttct 
gcatgcgctc 
ttttgtttat 
agatgtagtt 
gtcattttgc 
atccatttcc 
aatacagttg 
tcttaaattg 
tttttgcttg 
gctgctcacg 
tcagttgaaa 
attctttctt 
tgaacataat 
agtggctggc 
agtatatata 
ggttatagct 
gtatactctt 
taacaatttt 
taacgagtat 
ttttaatttt 



ggcaatgcaa 
ggatttatat 
gtcgactttg 
ttatgcactg 
atgacatgca 
catacggaca 
taaagtatct 
gcaaaatgtg 
attgcctaca 
gctcactggc 
agcaatttcg 
ataaattaga 
aatgacacaa 
gtaaaacaca 
tttaaaattc 
atgtatttaa 
gcggaactag 
tgcaccagcg 
atatcgagaa 
tggatgagtg 
gcatccatgg 
cacgaaaccc 
agccaagacg 
ttacctttga 
agcttaacca 
gtccgggttt 
aagccgttga 
gttgtagcag 
atcaaggcta 
aaaaatgtaa 
aaataaaatt 
tgtgagcaat 
agctgaataa 
gtatgtgagc 
gtttctttaa 
ggtcctgacc 
ggtggccgat 
tatggggcca 
ttacttgtta 
gatcaggatc 
gttaacaatt 
gtggtagtcg 
aagccgttta 



tgcgattgag 
tcagatttag 
attcggccaa 
aaagaaataa 
tgataatgat 
gaaagacgaa 
aaagaatttt 
aagttcttgc 
acgcccaggt 
agctccactt 
acaccgtatg 
tgctgttgac 
tttccctttt 
agttagttgt 
aaataaatat 
actccttaaa 
atagtctaca 
ggttcctttc 
gtttgtaaag 
ccagggctat 
atacgtttcg 
gcaatgttgg 
atacacacag 
actcttgcca 
tcaaaatcac 
cccttttcag 
ggcgttaatg 
acggccttta 
gttcatagcc 
ctttattgct 
ctgaaccaaa 
gttaagaaat 
ataaaatatt 
gctcacagtt 
atcacctaga 
agaaaggaaa 
atgtttgtga 
ggtaagcata 
tttatttatt 
tacacacgtc 
cccatatatc 
aggcaatcgg 
aatttttttg 



gtccggctgt 
ccattagcca 
ctgagtgatc 
ataaaggatt 
caactttcaa 
catggattta 
ttacttggat 
aaatagatac 
tgctttgaat 
tggccattca 
cccataatga 
gtctaattca 
attttgtaaa 
aaagcacaga 
ataattctaa 
ttttgttcca 
atgatgcaag 
aacgacacgg 
ctatggcagg 
tgccaggggg 
ctgatggtaa 
ccaaaacaaa 
accttcgccg 
aagtcgtcct 
aaatgaaatt 
ggcttttttg 
tgcttttttc 
ccacaggaat 
catcttttca 
ttccttcgtt 
cttcggttac 
gtaaatacga 
catccacttt 
tcgatttgac 
tttgcatttt 
tggccaagac 
tgctggaata 
catttagtaa 
agtgttatca 
tgtctgtccg 
ctttcccctt 
ctatgacctt 
aattatttaa 



gcgtttcagt 
aatgcaacac 
cggggtagta 
tacatgtgta 
gcaataataa 
ccattaaaat 
ctcctagcaa 
acagactaat 
cacaggtgga 
aaattgttta 
agaaacggca 
tttttgatga 
acattatggc 
aaaactttcc 
ttctaaataa 
gtttttaact 
agaccggcaa 
ttctgaagga 
agtaccagat 
aaatatacaa 
gtaatactcg 
ggaatcggtt 
acagaccact 
tttttcgaat 
agtggccggg 
aaaaggatat 
attataattt 
tgggtcagaa 
aataaatata 
cgcacaaaaa 
tgtagttcca 
gaggtcacga 
ggtgtccaag 
aaaggacgca 
cgggacaata 
gcccataaat 
tataccctat 
taatatttaa 
gctagattat 
tttctaaggg 
tgcacttact 
ctctgtaggt 
tattctttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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atggctgttg 
atgcacttca 
tggcggtatg 
gaattcgccg 
gcaattcgaa 
cttctcttcc 
cgaacggggg 
tcatcctgtc 
acgacgtgga 
tgctctacag 
aattgtaaaa 
gctcatcaaa 
tttgtgcgag 
tgccatccag 
ttagtgctta 
tttctgtttt 
cttttttata 
aacgatcgga 
ccgcagggat 
tatccgccgt 
gtgctgtacg 
cggcactttg 
ttgggcacat 
gtgagggaca 
tttataaaag 
attgtacggg 
cctaaggcgt 
gtttcgcaaa 
ttcaaattcg 
atacgagaca 
aaactgtagc 
ttagctggaa 
agtgccaatg 
gcttatacta 
gcatacaaat 
gacaacagca 
tcccaaatca 
gatgcgcagc 
gataaaatct 



caggtgaaaa 
cccagatcct 
tggctatcag 
tcatccatgt 
cagacatccg 
cccgtctttt 
ctgtgccaat 
gccgatcctc 
caaggtcaac 
gtgcgcttag 
tatttcatcg 
cttctgccgt 
gcatcgagac 
cttaatctga 
agaattcgcc 
tgaatggtct 
aatttgtctt 
ctaccctttt 
tacttggtct 
ttggagagct 
gtctgatgtc 
gcagcaccga 
ctaaagatgt 
tccaagatgt 
tttcgttgac 
ctgaccagtt 
tgactggtgc 
gtgccatttc 
tattagtata 
tttacttttc 
catgctaaga 
tcacgaaaga 
cattttaaat 
tttataaata 
gaaacgttta 
ttcacctcca 
gactcgccca 
agggatttca 
gctctcgagg 



agatttgagc 
gcatacgatt 
gtgagatgat 
agagtttgtc 
gctaatctga 
gtttatagcc 
ttcctgctag 
tgcctgccca 
tcggaggcca 
ataagaagtc 
tgaaaatcct 
gcggcatatt 
gtcgcttgaa 
atgaaaccaa 
tcatttgcag 
cggattttgt 
tttgtttttg 
gctggttgcg 
gctgtccgga 
gatggatctg 
gaaacagttc 
aatgaccaga 
gatcatgggc 
tccggcccgg 
taaacaaagc 
gatcagttga 
atcgatcggt 
atttccaaat 
taagtttcat 
cataagatgc 
ggactgaaga 
attctatttt 
tggctaattg 
aatgtgaaca 
ttgctttcca 
gtggcaattc 
tatccgcgga 
gcttggtggc 
gagtttcgac 



tacacctggg 
tccatcggat 
ccttcggcca 
tccggttcac 
ttaccacatc 
aaatgttacc 
cacattcccg 
cgtactttgt 
tgtatcatgt 
tgatgtcatt 
taacagattc 
aattgtgatc 
gctgagggat 
atccaagaag 
tttgccttgt 
caaatctctt 
tatcttaggc 
gtactggttt 
gtgatggaga 
ctggcactga 
cggaccacct 
ttgacccgcg 
aggagcaaac 
cattgaattt 
ctaaatatga 
tttccacaat 
ttagaaattg 
aattacattg 
aggctaagtc 
agaaaaaaaa 
actcactgag 
gattgtatgg 
aataaccata 
ctttcaaaaa 
ttaccgaaaa 
gtccggcaga 
atccagcagg 
ttggaatttg 
cataatgtaa 



cggtttgtct 
tgactgtcac 
gctatgcatg 
attcatgata 
attgtggcaa 
gtctctattc 
ggaggcgatc 
gttccaggtg 
ttatttcgac 
tatgcatcct 
aacttctgga 
agtgcagtac 
tacaacaatc 
taagtacgca 
tagtttcatg 
cttaggttcc 
cgcctcgctg 
tgtttctgat 
agtgcttctt 
tcaacgccgc 
tccgatcgct 
taaccacgac 
actagtgagg 
gaattttcta 
gaaactgtga 
ggtttacttt 
attctgcttg 
attagtttac 
agctgtacaa 
acaggaatcc 
ccccgagttg 
cgttaactgg 
ttttgagtac 
aaatatgttt 
atgaacatct 
agttccgtgg 
atgccattgc 
tactgcagga 



gctggttcac 
gctggcggtg 
acttattccg 
atccacatcc 
tgtggcttct 
cacagacatc 
ctcctgccct 
cgtgagacgt 
aaggactcgg 
gcgaagaacc 
tacattccgt 
ttatgcacgt 
ccgccaaata 
cttgactcgt 
atctttgatt 
tttttgccat 
tgatcgccgt 
cacggagttt 
cgcccactgc 
cgtcggattc 
gttcatgaag 
ttgcgtctag 
gatgaagtag 
aagcataaac 
gtgtggaatt 
taggtgtagt 
tgatctggct 
cagttggtgt 
agctgtacaa 
atcatcagca 
ctctctcttc 
ttttcaacaa 
tttttaagtc 
gatgtgagtt 
aattctgact 
cccgattgtg 
cctgattcga 
taatgagatt 



<210> 143 
<211> 1245 
<212> DNA 
<213> Drosophila 



<400> 143 

atgatgcaag 

aacgacacgg 

ctatggcagg 

tgccaggggg 

ctgatgattt 

aaggaaatgg 

tttgtgatgc 

gaaaaagatt 

atcctgcata 

atcagacatc 

ctcctgccct 

cgtgagacgt 

aaggactcgg 

ccgtgcggca 



agaccggcaa 
ttctgaagga 
agtaccagat 
aaatatacaa 
gcattttcgg 
ccaagacgcc 
tggaatatat 
tgagctacac 
cgatttccat 
cgaacggggg 
tcatcctgtc 
acgacgtgga 
tgctctacag 
tattaattgt 



tcaaatggga 
ctatcatctc 
gaagaacatg 
ctggttgcga 
gacaatagca 
cataaataat 
accctataca 
ctgggcggtt 
cggattgact 
ctgtgccaat 
gccgatcctc 
caaggtcaac 
attcaacttc 
gatcagtgca 



cagacccata 
acgagcaccg 
acccctcagg 
gcctacaaca 
aacatcttga 
atacttaagt 
tcgtatcagt 
tgtctgctgg 
gtcacgctgg 
ttcctgctag 
tgcctgccca 
tcggaggcca 
tggatacatt 
gtacttatgc 



tgcaccagcg 
atatcgagaa 
tggatgagtg 
gcatccatgg 
acataatggt 
ggctggcggt 
atatatatat 
ttcacatgca 
cggtgtggcg 
cacattcccg 
cgtactttgt 
tgtatcatgt 
ccgtgctcat 
acgttttgtg 



ggttcctttc 
gtttgtaaag 
ccagggctat 
atacgtttcg 
cctgaccaga 
ggccgatatg 
ggggccaggt 
cttcacccag 
gtatgtggct 
ggaggcgatc 
gttccaggtg 
ttatttcgac 
caaacttctg 
cgaggcatcg 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4900 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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agacgtcgct tgaagctgag ggattacaac aatcccgcca aatatgccat ccagcttaat 900 

ctgaatgaaa ccaaatccaa gaagccgcct cgctgtgatc gccgtaacga tcggactacc 960 

cttttgctgg ttgcggtact ggttttgttt ctgatcacgg agtttccgca gggattactt 1020 

ggtctgctgt ccggagtgat ggagaagtgc ttcttcgccc actgctatcc gccgtttgga 1080 

gagctgatgg atctgctggc actgatcaac gccgccgtcg gattcgtgct gtacggtctg 1140 

atgtcgaaac agttccggac caccttccga tcgctgttca tgaagcggca ctttggcagc 1200 

accgaaatga ccagattgac ccgcgtaacc acgacttgcg tctag 1245 



<210> 144 
<211> 414 
<212> PRT 
<213> Drosophila 



<400> 144 



Met 


Met 


Gin 


Glu 


Thr 


Gly 


Asn 


Gin 


Met 


Gly 


Gin 


Thr 


His 


Met 


His 


Gin 


1 








5 










10 










15 




Arg 


Val 


Pro 


Phe 


Asn 


Asp 


Thr 


Val 


Leu 


Lys 


Asp 


Tyr 


His 


Leu 


Thr 


Ser 








20 










25 










30 






Thr 


Asp 


He 


Glu 


Lys 


Phe 


Val 


Lys 


Leu 


Trp 


Gin 


Glu 


Tyr 


Gin 


Met 


Lys 






35 










40 










45 








Asn 


Met 


Thr 


Pro 


Gin 


Val 


Asp 


Glu 


Cys 


Gin 


Gly 


Tyr 


Cys 


Gin 


Gly 


Glu 




50 










55 










60 










He 


Tyr 


Asn 


Trp 


Leu 


Arg 


Ala 


Tyr 


Asn 


Ser 


He 


His 


Gly 


Tyr 


Val 


Ser 


65 










70 










75 










80 


Leu 


Met 


He 


Cys 


He 


Phe 


Gly 


Thr 


He 


Ala 


Asn 


He 


Leu 


Asn 


He 


Met 










85 










90 










95 




Val 


Leu 


Thr 


Arg 


Lys 


Glu 


Met 


Ala 


Lys 


Thr 


Pro 


He 


Asn 


Asn 


He 


Leu 








100 










105 










110 






Lys 


Trp 


Leu 


Ala 


Val 


Ala 


Asp 


Met 


Phe 


Val 


Met 


Leu 


Glu 


Tyr 


He 


Pro 






115 










120 










125 








Tyr 


Thr 


Ser 


Tyr 


Gin 


Tyr 


He 


Tyr 


Met 


Gly 


Pro 


Gly 


Glu 


Lys 


Asp 


Leu 




130 










135 










140 










Ser 


Tyr 


Thr 


Trp 


Ala 


Val 


Cys 


Leu 


Leu 


Val 


His 


Met 


His 


Phe 


Thr 


Gin 


145 










150 










155 










160 


He 


Leu 


His 


Thr 


He 


Ser 


He 


Gly 


Leu 


Thr 


Val 


Thr 


Leu 


Ala 


Val 


Trp 










165 










170 










175 




Arg 


Tyr 


Val 


Ala 


He 


Arg 


His 


Pro 


Asn 


Gly 


Gly 


Cys 


Ala 


Asn 


Phe 


Leu 








180 










185 










190 






Leu 


Ala 


His 


Ser 


Arg 


Glu 


Ala 


He 


Leu 


Leu 


Pro 


Phe 


He 


Leu 


Ser 


Pro 






195 










200 










205 








He 


Leu 


Cys 


Leu 


Pro 


Thr 


Tyr 


Phe 


Val 


Phe 


Gin 


Val 


Arg 


Glu 


Thr 


Tyr 




210 










215 










220 










Asp 


Val 


Asp 


Lys 


Val 


Asn 


Ser 


Glu 


Ala 


Met 


Tyr 


His 


Val 


Tyr 


Phe 


Asp 


225 










230 










235 










240 


Lys 


Asp 


Ser 


Val 


Leu 


Tyr 


Arg 


Phe 


Asn 


Phe 


Trp 


He 


His 


Ser 


Val 


Leu 










245 










250 










255 




He 


Lys 


Leu 


Leu 


Pro 


Cys 


Gly 


He 


Leu 


He 


Val 


He 


Ser 


Ala 


Val 


Leu 








2 60 










265 










270 






Met 


His 


Val 


Leu 


Cys 


Glu 


Ala 


Ser 


Arg 


Arg 


Arg 


Leu 


Lys 


Leu 


Arg 


Asp 






275 










280 










285 








Tyr 


Asn 


Asn 


Pro 


Ala 


Lys 


Tyr 


Ala 


He 


Gin 


Leu 


Asn 


Leu 


Asn 


Glu 


Thr 




290 










295 










300 










Lys 


Ser 


Lys 


Lys 


Pro 


Pro 


Arg 


Cys 


Asp 


Arg 


Arg 


Asn 


Asp 


Arg 


Thr 


Thr 


305 










310 










315 










320 


Leu 


Leu 


Leu 


Val 


Ala 


Val 


Leu 


Val 


Leu 


Phe 


Leu 


He 


Thr 


Glu 


Phe 


Pro 










325 










330 










335 




Gin 


Gly 


Leu 


Leu 


Gly 


Leu 


Leu 


Ser 


Gly 


Val 


Met 


Glu 


Lys 


Cys 


Phe 


Phe 








340 










345 










350 






Ala 


His 


Cys 


Tyr 


Pro 


Pro 


Phe 


Gly 


Glu 


Leu 


Met 


Asp 


Leu 


Leu 


Ala 


Leu 






355 










360 










365 
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lie Asn Ala Ala Val Gly Phe Val Leu Tyr Gly Leu Met Ser Lys Gin 

370 375 380 

Phe Arg Thr Thr Phe Arg Ser Leu Phe Met Lys Arg His Phe Gly Ser 

385 390 395 400 

Thr Glu Met Thr Arg Leu Thr Arg Val Thr Thr Thr Cys Val 
405 410 



<210> 145 
<211> 4116 
<212> DNA 
<213> Drosophila 

<400> 145 

gttaaatgat ttacaaacaa gccgacatct cgggaaccat taaagagaca tttaaagtca 60 
aaaagtggcc aactgaaacc cgttaaaatc cgcttgcgac cgcaacttgg cccgactaag 120 
gcaaaaaata acataatcct gtcaagtcat gcagacaggg cgatttacat tataatttta 180 
aatggcaatg gattagtcat catcagtacc tgcttatctg gttttcctga tttttaaatc 240 
aaatctgtga ctgtatctct tacctttcct cttgtacctc gttattataa aatgcgaaaa 300 
ccctctgcca aaggacacta taaaacgaat tttttaaaag acgccaaaca aaatggcact 360 
ttttgttgag aagaaaatat gccgataaga acgttttact tgtaagacgc tgcagaaaga 420 
ttaggaacga aaactatcga caatgtttac ttgtataatc tgagctttct aaaacacaaa 480 
aatagtgtac ttttaccata cgcttaattt cgaaattaat ccgggatata cagttgggga 540 
tccagatttt aaattcgtga acacagctgc tggagtaggg catttcctga tcgatctgtt 600 
agttttcgga cttttacttt ggttgttgta aacaatatcc gcaagtttct cgcaaagtgg 660 
gttgcgttgt gccgtgcctc agtagagcac tacacactac cttcgattga gttcgctggg 720 
atttggattt cgttcacagt ccaagtttgg accgtgctag gagaagataa tgatccgatc 780 
actcaattaa cttcatgcac ccagtgacgt catcagtgct ttactattcg gtgtctatat 840 
agtctatttt gtttaatttc ttacacccta catttcgata cgattcgatt ttcgtttgga 900 
aattgtctga aggtaaacaa gcataaattg gacgttcaag cgtaaacaaa tcggttttcg 960 
gagcaaaata caaaaatcgt cagttgagaa agtctctcaa atggcgcagt tctgcattct 1020 
tggagtcctc ctcattcttt ctggaacaca ctgctcgtgg ggattccatg aagagacaca 1080 
ttatccgtgt gcctttatcg atacggcaaa cataacgggg agctacggct tagatggccc 1140 
atttgtgcac aactggacgg tgattccccg tcacttcgta gccgtgtacg actttgtcat 1200 
cgaaaatggc atccgcatcc ccgcaagcag gcatctcaga gcctgtgtct gtaagaccaa 1260 
accctgtgta cggatttgct gtttacgggg agaaatctac gacctggaga agaggcagtg 1320 
cttagtccct gtggccgggg tgtccagcct tccctcccat agccatatgg aggtggagtt 1380 
gggcaacggc agtctgcgtc tagtgaaact gcaaccccgt tttagcatcc acgtggaaac 1440 
accatgtgag cacatgaaag ccgtaaccaa gggctcggaa tacgttcact ggacacttca 1500 
tgaggtcggt tgtgaaataa accttccatc acattaatgg ctttacatcg ccttttcaga 1560 
acggaaccat cagccaccga ggacacatat tttctaagca ttactgcttc actccgctgc 1620 
tccatgggaa ctccacctgg gagtggcagc cattggcctg tgcaccagaa aaactatatt 1680 
tcgtcttggg ggtgcgggag tggacatacg ccatatgtaa gtagcaagag gaaacatttt 1740 
tccagcaact ttcagcaatt cattatccca aatatgcaca atactgtttt cacaacaata 1800 
cataaaagta attgctaaag caccaaaaaa caatcgatga tagcattaat agtttgatag 1860 
tcagaattta atttacccag cgtattcact ttccaggtct cttgatagct attctatcaa 1920 
tgtttatcgt gctgatggtc taccttatgt gttcggaaat gcgcaacagt ttctacggtg 1980 
tggccataaa ggcgtacgcc atttgtatga tcttgggata tgccctactg gcttatctca 2040 
cgctccacaa cccagcaaat ctctcaaatg cggcttgtcg tattctacgt aagtggtaaa 2100 
tttgagtatt ctgtgatatt taaacattcc cttctttttt ttttttagca agtttggcgt 2160 
tgatgaactt ggtgctttcc ttttacattc tgagctttat tgcatttaag ctgtacttga 2220 
gcttttatgg cgttgtcttc acaaaactga tgttttggct aatatttact cccatcgtac 2280 
tggtagccgt gggctggtct ttctttgtgg gtttcagtta ttacggctct aggctcatct 2340 
tcggaggcga tacatgctgg tttgatcgta agcactgcca atccatatat gttttataat 2400 
aattaattgg tattccacat cgctagcacg caattggtcc gttatgatat atttttacgc 2460 
tcctgttttc gtggcttgcg ccattagcgg tttcttctat gtcctgagcc agatctacat 2520 
ccgtgatcag ccagatatcg agacggagaa gagcttcgag tccatagaga agaaccgatt 2580 
caaatcattt tggaagtact ttggctacac ggcagtcgta tgggtcgttt gcatatgctc 2640 
ctttgccttt aactactact gggagaacag atcccacctt aactacgctg taagcttctg 2700 
catggccttc cacgggtttg cggccctcta tgcgctgatt ggaaagaatc agcaaatcca 2760 
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aaattttctg 
aatatatact 
ggttaacttt 
tcgacaatat 
tttaaatgta 
aaaaaagtac 
atatctgaaa 
caacaatcct 
ataacatatt 
ttatccttca 
atttcaagtc 
gatctctcta 
aaacagacag 
gggggctgtc 
cgcaaacttt 
tcggagtccc 
ggctgggcgt 
atagacaagg 
ctggaaataa 
acgtgcgcac 
tgtgcgagtg 
gcgcccggta 
catttagttt 



cggcgcatag 
gttgcagtgg 
tcttgttgtg 
ctgtgcaatg 
aaattttgta 
acttctaccc 
aaaaaatcag 
ttcagtatag 
aatattctaa 
cctttcttga 
ccgattttac 
tattataaat 
agagagagag 
aagagcaagg 
ccccctgtac 
caattcattt 
acagcagcag 
aaagcagatg 
catggcacaa 
tggaaaatcc 
ttctgttttg 
gatcgcatag 
gggacacatt 



ataagtaagt 
cgaagatacc 
cccactgtag 
tttttaaaat 
ggatgtgtgt 
tttaaacgaa 
gtctggaatt 
aacaaaacat 
acaatttgag 
gttccaaatt 
tgtaccctat 
gtgcgccgat 
agagaaagtt 
aatcaagtgg 
acccccttgc 
tcatgccgat 
gactcggggt 
gatcgataaa 
acaggcccca 
cgagaggtag 
gttagatacg 
gcaatatcga 
aaagttaatt 



tgataactat 
tgtgaaaact 
cgttagtgta 
cacttatcag 
aaaatatatt 
attacgataa 
tctacaattg 
ggaatatact 
cagttttctt 
cctggaccgt 
ccgcatcgcc 
tttatttaac 
ggtcagagcc 
aatgttggtg 
caaggcctaa 
tatgcttcgg 
acacactaag 
acgtagtaaa 
aactgcatcg 
gacatgtttg 
tatgaaaaag 
tatgggaaat 
tgactc 



gaattttaag 
cagttccgct 
actggaagca 
catagacaaa 
tgtaaaattt 
taaagataca 
ttaaatatga 
taagccatgt 
gagtttacgc 
gatcaagttt 
aagcccggtg 
atcagaggtc 
acgagaactg 
gtgctctttt 
tcgcagaaaa 
gttttggtcc 
agaggtctaa 
aatgctcgta 
acaacaccat 
gcgcgttttt 
tgaaaaagcc 
cattgactaa 



<210> 146 
<211> 1709 
<212> DNA 
<213> Drosophila 



<400> 146 

atggcgcagt 

ggattccatg 

agctacggct 

gccgtgtacg 

gcctgtgtct 

gacctggaga 

agccatatgg 

tttagcatcc 

tacgttcact 

aagcattact 

gcctgtgcac 

tgtctcttga 

gaaatgcgca 

ggatatgccc 

tgtcgtattc 

tttattgcat 

tggctaatat 

agttattacg 

tggtccgtta 

ttctatgtcc 

ttcgagtcca 

gtcgtatggg 

caccttaact 

ctgattggaa 

acctgtgaaa 

tagcgttagt 

aatcacttat 

tgtaaaatat 

gaaattacga 



tctgcattct 
aagagacaca 
tagatggccc 
actttgtcat 
gtaagaccaa 
agaggcagtg 
aggtggagtt 
acgtggaaac 
ggacacttca 
gcttcactcc 
cagaaaaact 
tagctattct 
acagtttcta 
tactggctta 
taccaagttt 
ttaagctgta 
ttactcccat 
gctctaggct 
tgatatattt 
tgagccagat 
tagagaagaa 
tcgtttgcat 
acgctgtaag 
agaatcagca 
actcagttcc 
gtaactggaa 
cagcatagac 
atttgtaaaa 
taataaagat 



tggagtcctc 
ttatccgtgt 
atttgtgcac 
cgaaaatggc 
accctgtgta 
cttagtccct 
gggcaacggc 
accatgtgag 
tgagaacgga 
gctgctccat 
atatttcgtc 
atcaatgttt 
cggtgtggcc 
tctcacgctc 
ggcgttgatg 
cttgagcttt 
cgtactggta 
catcttcgga 
ttacgctcct 
ctacatccgt 
ccgattcaaa 
atgctccttt 
cttctgcatg 
aatccaaaat 
gctctcgagt 
gcagttgagt 
aaaaaccatg 
tttgaactcg 
acaaaaaat 



ctcattcttt 
gcctttatcg 
aactggacgg 
atccgcatcc 
cggatttgct 
gtggccgggg 
agtctgcgtc 
cacatgaaag 
accatcagcc 
gggaactcca 
ttgggggtgc 
atcgtgctga 
ataaaggcgt 
cacaacccag 
aacttggtgc 
tatggcgttg 
gccgtgggct 
ggcgatacat 
gttttcgtgg 
gatcagccag 
tcattttgga 
gcctttaact 
gccttccacg 
tttctgcggc 
tttggttaac 
gtatcgacaa 
tattttaaat 
aacaaaaaag 



ctggaacaca 

atacggcaaa 

tgattccccg 

ccgcaagcag 

gtttacgggg • 

tgtccagcct 

tagtgaaact 

ccgtaaccaa 

accgaggaca 

cctgggagtg 

gggagtggac 

tggtctacct 

acgccatttg 

caaatctctc 

tttcctttta 

tcttcacaaa 

ggtctttctt 

gctggtttga 

cttgcgccat 

atatcgagac 

agtactttgg 

actactggga 

ggtttgcggc 

gcatagataa 

ttttcttgtt 

tatctgtgca 

gtaaaatttt 

tacacttcta 



tactactgtt 
ctcgagtttt 
gttgagtgta 
aaccatgtat 
gaactcgaac 
aaaaataaat 
taacaagtcc 
gaaagaacac 
ctccaacaat 
ctccagtctt 
ttacttcccc 
aatggccgca 
catcccgaca 
gaaaaggctt 
cccacccggg 
tgctcttttt 
gaatatagag 
gtaatggaag 
cacaatccag 
cgactgcatg 
tatattgacc 
catttgcaga 



ctgctcgtgg 
cataacgggg 
tcacttcgta 
gcatctcaga 
agaaatctac 
tccctcccat 
gcaaccccgt 
gggctcggaa 
catattttct 
gcagccattg 
atacgccata 
tatgtgttcg 
tatgatcttg 
aaatgcggct 
cattctgagc 
actgatgttt 
tgtgggtttc 
tccacgcaat 
tagcggtttc 
ggagaagagc 
ctacacggca 
gaacagatcc 
cctctatgcg 
tggcgaagat 
gtgcccactg 
atgtttttaa 
gtaggatgtg 
ccctttaaac 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4116 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1709 
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<210> 1-37 
<211> 492 
<212> PRT 
<213> Drosophila 



<400> 147 



Met 


Ala 


Gin 


Phe 


Cys 


He 


Leu 


Gly 


Val 


Leu 


Leu 


He 


Leu 


Ser 


Gly 


Thr 


1 








5 










10 










15 




His 


Cys 


Ser 


Trp 


Gly 


Phe 


His 


Glu 


Glu 


Thr 


His 


Tyr 


Pro 


Cys 


Ala 


Phe 








20 










25 










30 






He 


Asp 


Thr 


Ala 


Asn 


He 


Thr 


Gly 


Ser 


Tyr 


Gly 


Leu 


Asp 


Gly 


Pro 


Phe 






35 










40 










45 








Val 


His 


Asn 


Trp 


Thr 


Val 


He 


Pro 


Arg 


His 


Phe 


Val 


Ala 


Val 


Tyr 


Asp 




50 










55 










60 










Phe 


Val 


He 


Glu 


Asn 


Gly 


He 


Arg 


He 


Pro 


Ala 


Ser 


Arg 


His 


Leu 


Arg 


65 










70 










75 










80 


Ala 


Cys 


Val 


Cys 


Lys 


Thr 


Lys 


Pro 


Cys 


Val 


Arg 


lie 


Cys 


Cys 


Leu 


Arg 










85 










90 










95 




Gly 


Glu 


He 


Tyr 


Asp 


Leu 


Glu 


Lys 


Arg 


Gin 


Cys 


Leu 


Val 


Pro 


Val 


Ala 








100 










105 










110 






Gly 


Val 


Ser 


Ser 


Leu 


Pro 


Ser 


His 


Ser 


His 


Met 


Glu 


Val 


Glu 


Leu 


Gly 




115 










120 










125 








Asn 


Gly 


Ser 


Leu 


Arg 


Leu 


Val 


Lys 


Leu 


Gin 


Pro 


Arg 


Phe 


Ser 


He 


His 




130 










135 










140 










Val 


Glu 


Thr 


Pro 


Cys 


Glu 


His 


Met 


Lys 


Ala 


Val 


Thr 


Lys 


Gly 


Ser 


Glu 


145 










150 










155 










160 


Tyr 


Val 


His 


Trp 


Thr 


Leu 


His 


Glu 


Asn 


Gly 


Thr 


He 


Ser 


His 


Arg 


Gly 








165 










170 










175 




His 


He 


Phe 


Ser 


Lys 


His 


Tyr 


Cys 


Phe 


Thr 


Pro 


Leu 


Leu 


His 


Gly Asn 








180 










185 










190 






Ser 


Thr 


Trp 


Glu 


Trp 


Gin 


Pro 


Leu 


Ala 


Cys 


Ala 


Pro 


Glu 


Lys 


Leu 


Tyr 






195 










200 










205 








Phe 


Val 


Leu 


Gly 


Val 


Arg 


Glu 


Trp 


Thr 


Tyr 


Ala 


He 


Cys 


Leu 


Leu 


He 




210 










215 










220 










Ala 


He 


Leu 


Ser 


Met 


Phe 


He 


Val 


Leu 


Met 


Val 


Tyr 


Leu 


Met 


Cys 


Ser 


225 










230 










235 










240 


Glu 


Met 


Arg 


Asn 


Ser 


Phe 


Tyr 


Gly 


Val 


Ala 


He 


Lys 


Ala 


Tyr 


Ala 


He 










245 










250 










255 




Cys 


Met 


He 


Leu 


Gly 


Tyr 


Ala 


Leu 


Leu 


Ala 


Tyr 


Leu 


Thr 


Leu 


His 


Asn 








260 










265 










270 






Pro 


Ala 


Asn 


Leu 


Ser 


Asn 


Ala 


Ala 


Cys 


Arg 


He 


Leu 


Pro 


Ser 


Leu 


Ala 






275 










280 










285 








Leu 


Met 


Asn 


Leu 


Val 


Leu 


Ser 


Phe 


Tyr 


He 


Leu 


Ser 


Phe 


He 


Ala 


Phe 




290 










295 










300 










Lys 


Leu 


Tyr 


Leu 


Ser 


Phe 


Tyr 


Gly 


Val 


Val 


Phe 


Thr 


Lys 


Leu 


Met 


Phe 


305 










310 










315 










320 


Trp 


Leu 


He 


Phe 


Thr 


Pro 


He 


Val 


Leu 


Val 


Ala 


Val 


Gly 


Trp 


Ser 


Phe 








325 










330 










335 




Phe 


Val 


Gly 


Phe 


Ser 


Tyr 


Tyr 


Gly 


Ser 


Arg 


Leu 


He 


Phe 


Gly 


Gly Asp 








340 










345 










350 






Thr 


Cys 


Trp 


Phe 


Asp 


Pro 


Arg 


Asn 


Trp 


Ser 


Val 


Met 


He 


Tyr 


Phe 


Tyr 






355 










360 










365 








Ala 


Pro 


Val 


Phe 


Val 


Ala 


Cys 


Ala 


He 


Ser 


Gly 


Phe 


Phe 


Tyr 


Val 


Leu 




370 










375 










380 










Ser 


Gin 


He 


Tyr 


He 


Arg 


Asp 


Gin 


Pro 


Asp 


He 


Glu 


Thr 


Glu 


Lys 


Ser 


385 










390 










395 










400 


Phe 


Glu 


Ser 


He 


Glu 


Lys 


Asn 


Arg 


Phe 


Lys 


Ser 


Phe 


Trp 


Lys 


Tyr 


Phe 










405 










410 










415 




Gly 


Tyr 


Thr 


Ala 


Val 


Val 


Trp 


Val 


Val 


Cys 


He 


Cys 


Ser 


Phe 


Ala 


Phe 








420 










425 










430 
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Asn Tyr Tyr Trp Glu 
435 

Cys Met Ala Phe His 
450 

Asn Gin Gin lie Gin 
4 65 

Thr Cys Glu Asn Ser 
485 



Asn Arg Ser His Leu Asn 
440 

Gly Phe Ala Ala Leu Tyr 
455 

Asn Phe Leu Arg Arg lie 
470 475 
Val Pro Leu Ser Ser Phe 
4 90 



Tyr Ala Val Ser Phe 
445 

Ala Leu He Gly Lys 
460 

Asp Asn Gly Glu Asp 
480 

Gly 



<210> 148 
<211> 3772 
<212> DNA 
<213> Drosophila 



<400> 148 

agtcagtgcc 

atcccccaat 

tatccgtcgt 

atgctggtta 

ggcatgtgtt 

attccaagta 

aaactgtttc 

ctttagcact 

gaggcgtctt 

ccatgtcgaa 

tattttttga 

gacgaaagtg 

ttttactcgg 

actcgtacca 

atggcaatct 

tcgctggcca 

cgcggcatac 

cttcgaccag 

gattccctcg 

cttcaagcga 

caacctgagc 

cctcatctca 

ggcccagttc 

caacacaagc 

cgttcataac 

accgctactt 

gcactctgag 

gcatctacga 

aggaggtcgt 

tgtgcctggc 

ttaaggggat 

agccgcgtcg 

ctctactacg 

ctgaatgccg 

gagcagcatc 

ttgatgccct 

actcccttgc 

ctagccaggg 

ttcatatgcc 

tccgagccca 

gccctaaatc 

aaacacaaaa 

acggatcaga 

aggtgccatc 



tggacggatc 
ccaaacccat 
tgtaaagcta 
aaagactata 
gtgtgggcga 
ttttcgagtg 
attggtttcc 
cttagtttaa 
gagtagttct 
cctacgagtc 
agccgaactt 
tcgcaaataa 
cggcggcggc 
tcatcatgac 
gaacgataag 
taaatctgtc 
tttcgggctt 
ctcgagttgt 
atccagccgt 
catgccttct 
accctctatg 
ctggcctgca 
caggcgcagt 
taattacaaa 
caccacgtcg 
ggccgtgatg 
catgctgctc 
ctaccgaaag 
gaccgctgtg 
cggggacgta 
ttggacgggg 
aaaacaattt 
tcatactgtt 
tgattgcgag 
aggagcccaa 
ccaccttggt 
cgcccaagag 
aggccaggca 
ttcgactgcc 
tcgactggct 
ccatcttcta 
ttcagggatt 
tggacaaaag 
ccagccgcga 



gcacgagtcc 
acccaaagct 
ccttatccac 
cacctagctc 
gtgagtgtgt 
cttagttttg 
cactccgctt 
tgtttaaagt 
tttgtacatt 
tacgaacata 
atgcaaccga 
atgtgagcca 
aagaagcgtt 
gatggcgtgt 
cggcacccga 
ggcagtctgc 
ccgttccgtt 
acctagaatg 
atccgggcgt 
ccgccgtcgt 
tgaacagccg 
gtgaccttgt 
atctgcagct 
tctccttaga 
gtgttgtccg 
cgacccgtgc 
atctgggcat 
atctatctcc 
cccgaggagc 
agtcaacgac 
ggatgtcaag 
atgcaattct 
caccctgatc 
gcagctctgg 
ggagggtcag 
cagtgccatg 
taccgtcaac 
ccgcaagatg 
cgcctgggtc 
cctgtacttc 
caccttcctc 
tcttggctgt 
cgggtgctgc 
tagggcagca 



ttctactaag 
tcgctgcatt 
tacgattact 
catatactat 
gcgtgtgact 
taatttgtag 
tattcgataa 
tagtttcgct 
aagcttacac 
cgaacatata 
aaggagacaa 
ctgctaaaga 
tggggcacaa 
ggaacgacgt 
taatgaaaca 
tcgcagaagc 
ccgttcgcca 
ggcgttcgcc 
gttcgtgggc 
gggcacgctc 
ccggaagctg 
ctccagcata 
ctgggtgagg 
caattggtgg 
gcagcctgac 
tggggttctg 
gttcgatagg 
tggatgtgga 
tggtggtcac 
tggattatcg 
ttgttgaatc 
ataaatattc 
ttcctgccgt 
ctgcggagac 
tttaagacca 
ggagtggccg 
gctcctggca 
gtggtggtgg 
ttcctgatca 
agcttcggca 
acccagacca 
ccgcctggta 
tgcggcctga 
gccaccgtaa 



cgctaccagt 
ttcgaataca 
acactatcct 
atatatatat 
tctgattgag 
ctagccgtta 
gctcgttttc 
gttaagttta 
ttagctctac 
tatgtgcgtt 
taaacaaatg 
agaccgattt 
acgacggatg 
gcgcatagta 
ttaaactaaa 
ttttaggtgc 
atggccctgc 
cagcacggag 
gatctcagcc 
ttcgtcctcg 
cggcccttct 
ttctgcaccg 
cggcagggat 
cttcatgtgc 
cctggtggcc 
gagtcccgac 
atcctctggc 
ggactccagc 
cgagctggaa 
cttaatacag 
aatgttccga 
cgttttagca 
gcattgtgtc 
actaccacca 
tggcgaatgg 
ttccctttgc 
agaagaccac 
tgctgctaat 
tgcggcttta 
tactgaacct 
ttaggacctt 
aagtgccgga 
ggccacccac 
tccgggatgt 



ggtggtcgta 
aacactgaac 
acaaaaaact 
acaatatcga 
ttgccatttg 
atgtttgtat 
tagattttag 
caccgtagtc 
attagatata 
gaaagcgttg 
tgctataaag 
gcgcaaggtg 
agtgattgcc 
atgacagcgg 
ccgtcggtag 
ggaccccttg 
tccactacac 
aggccacgcc 
agctgaaccg 
ccttctgcgg 
tccgggcctg 
tatcctacat 
tgagcagact 
aagtttgtgc 
atcgccttgg 
aaacgcttca 
ccgctgctgg 
gaggagagcg 
atggttcaca 
ttgacagcga 
ttatcttatc 
cgacgtcgga 
cttcctatgg 
ggagcagcag 
cggggatctc 
gctggacaac 
tgctgctgct 
gatggcggtt 
tggatcctac 
tttcagttgc 
gacgctggtg 
tggtatgccc 
cttcacatgg 
ggaccagcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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gatccgccct 
aagcaggagg 
tcctccgcct 
caacggttag 
tacaggcgca 
tataagatat 
gaatgggtgt 
tcagctatta 
agaacttaag 
tattcttaaa 
gacagactat 
ccgaaactta 
tagagtcagc 
catatactag 
aaagcagttg 
gctctctccc 
caaagctgaa 
ttcggattct 
gaagaaacaa 



ccgaccaggt 
tgttcaccat 
gataggcggt 
cagaacccca 
aaaacggt tt 
aagctcacat 
tggcagctaa 
ttgtgacgaa 
caaattgtga 
tatttttact 
ctttgttatt 
aaattgagtc 
agatcgtgga 
atacttccgc 
ttaattgtca 
atactctctt 
ttctgcgttt 
ttggtcggta 
caaagcgaca 



gcaaccggat 
ttacaagcag 
ggtcataaat 
gaacttgatt 
gtcacggtgg 
attatgtacc 
ctaaccatta 
aacctaagac 
gattaaaatg 
ctagaaatac 
tctaagaacg 
atgccaaaaa 
cgagacaatg 
ttggaacaat 
aatgttgagc 
cccatgaact 
tctggcattc 
aaacgcgaaa 
agtgtggagt 



ccgagtagcc 
tgcggtgact 
cacccgggtg 
gatggatgga 
gtatatcacc 
tcatcaaatg 
acacatggcg 
ctttcaagta 
caatgcaact 
tgacatcttt 
tagtaattag 
aagagaattg 
agcccgcaat 
ggttaacaga 
acgccagtta 
tttttaagcc 
agtgtcggcg 
tcttctgtgc 
gtggagtgca 



tacggcggtt 
cgtcgagtgc 
catttgtcgg 
tccagccccc 
cacgaataca 
tgtaaatatg 
caatcgaaat 
ccacttaaaa 
cctaaaggtg 
atgtgcttag 
ttaatggaat 
gcctgacaag 
gcaatggaac 
tctcgtgaga 
aatttagcta 
gcagagcccg 
gtttgcggcg 
ggcgggtgaa 
taccgtacca 



cctcagctac 
gtcaatcgag 
atacgggggc 
attcactctg 
ctagcacgca 
aggaataaac 
cgggaaggtc 
ctatatttag 
cgaatatacc 
ttttcgttca 
taattagtaa 
acacacatta 
tagattgtat 
gatggggctg 
tgcagcgaaa 
aagctcgact 
agcggttgga 
atcttcggca 
eg 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3772 



<210> 149 
<211> 1566 
<212> DNA 
<213> Drosophila 



<400> 149 

atggccctgc 

cagcaeggag 

gatctcagcc 

ttcgtcctcg 

cggcccttct 

ttctgcaccg 

ggtggcttca 

ctgaccctgg 

ttctggagtc 

ataggatcct 

gtggaggact 

gtcaccgagc 

taegtcatae 

gccgtgattg 

catcaggagc 

ccctccacct 

ttgccgccca 

agggaggeca 

tgccttcgac 

cccatcgact 

aatcccatct 

aaaattcagg 

cagatggaca 

catcccagcc 

ccctccgacc 

gaggtgttca 

gectga 



tccactacac 
aggccacgcc 
agctgaaccg 
ccttctgcgg 
tccgggcctg 
tatcctacat 
tgtgcaagtt 
tggccatcgc 
ccgacaaacg 
ctggcccgct 
ecagegagga 
tggaaatggt 
tgttcaccct 
cgaggcagct 
ccaaggaggg 
tggtcagtgc 
agagtaccgt 
ggcaccgcaa 
tgcccgcctg 
ggctcctgta 
tctacacctt 
gatttcttgg 
aaagcgggtg 
gegataggge 
aggtgeaace 
ccatttacaa 



cttcgaccag 
gattccctcg 
ettcaagega 
caacctgagc 
cctcatctca 
ggcccagttc 
tgtgccgttc 
cttggaccgc 
cttcagcact 
gctgggcatc 
gagegaggag 
tcacatgtgc 
gatcttcctg 
ctggctgcgg 
tcagtttaag 
catgggagtg 
caacgctcct 
gatggtggtg 
ggtcttcctg 
cttcagcttc 
cctcacccag 
ctgtccgcct 
ctgctgcggc 
agcagccacc 
ggatccgagt 
gcagtgcggt 



ctcgagttgt 
atccagccgt 
catgccttct 
accctctatg 
ctggcctgca 
caggegcagt 
ataaccacca 
tacttggccg 
ctgagcatgc 
tacgactacc 
gtcgtgaccg 
ctggccgggg 
ccgtgcattg 
agacactacc 
accatggega 
gccgttccct 
ggcaagaaga 
gtggtgctgc 
ateatgegge 
ggcatactga 
accattagga 
ggtaaagtgc 
ctgaggccac 
gtaatccggg 
agcctacggc 
gaetegtega 



acctagaatg 
ateegggegt 
ccgccgtcgt 
tgaacagccg 
gtgaccttgt 
atetgeaget 
cgtcggtgtt 
tgatgegace 
tgctcatctg 
gaaagatcta 
ctgtgcccga 
accacgacgt 
tgtccttcct 
accaggagca 
atggcgggga 
ttgcgctgga 
ccactgctgc 
taatgatggc 
tttatggatc 
accttttcag 
ccttgacgct 
eggatggtat 
ccaccttcac 
atgtggacca 
ggttcctcag 
gtgcgtcaat 



ggcgttcgcc 
gttcgtgggc 
gggcacgctc 
ccggaagctg 
ctccagcata 
ctggacaatt 
gtccggcagc 
cgtgctgggg 
ggcatgttcg 
tctcctggat 
ggagctggtg 
cggactctac 
atggctgaat 
gcaggagcag 
tctcttgatg 
caacactccc 
tgctctagcc 
ggttttcata 
ctactccgag 
ttgcgcccta 
ggtgaaacac 
gcccacggat 
atggaggtgc 
gccagatccg 
ctacaagcag 
cgagtcctcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1566 



<210> 150 
<211> 521 
<212> PRT 
<213> Drosophila 



<400> 150 
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Met 


7\ 1 ^ 

AJLa 


Leu 


Leu 


HIS 


Tyr 


i nr 


rhe 


Asp 


1 
1 








0 










Trp 


Aia 


D W 

rile 


A 1 

Aia 


P 1 ft 

bin 


HIS 


oi y 


Pin 

blU 


7A 1 a 
Ala 




















Pro 


Tyr 


Pro 


p i w 
biy 


val 


rne 


V 3 1 


biy 


Asp 






JO 










a n 
4 U 




Lys 


Arg 


HIS 


Ala 


Phe 


Ser 


Ala 


val 


val 














DO 






Phe 


Cys 


p i . . 

biy 


Asn 


Leu 


Ser 


Thr 


Leu 


Tyr 


65 










70 








Arg 


Pro 


Phe 


Phe 


Arg 


Ala 


Cys 


Leu 


lie 










85 










Val 


Ser 


Ser 


lie 


Phe 


Cys 


Thr 


Val 


Ser 








100 










i n c 

105 


Gin 


Tyr 


Leu 


Gin 


Leu 


Trp 


Thr 


lie 


Gly 






115 










120 




Pro 


Phe 


lie 


Thr 


Thr 


Thr 


Ser 


Val 


Leu 




130 










135 






Ala 


lie 


Ala 


Leu 


Asp 


Arg 


Tyr 


Leu 


Ala 


145 










150 








Phe 


Trp 


Ser 


Pro 


Asp 


Lys 


Arg 


Phe 


Ser 










165 










Trp 


Ala 


Cys 


Ser 


lie 


Gly 


Ser 


Ser 


Gly 








180 










HOC 

185 


Tyr 


Arg 


Lys 


lie 


Tyr 


Leu 


Leu 


Asp 


Val 






195 










o ft n 




Glu 


Glu 


Val 


Val 


Thr 


Ala 


Val 


Pro 


Glu 




210 










215 






Glu 


Met 


Val 


His 


Met 


Cys 


Leu 


Ala 


Gly 


225 










£ JU 








Tyr 


Val 


lie 


Leu 


Phe 


Thr 


Leu 


lie 


Phe 










245 










Leu 


Trp 


Leu 


Asn 


Ala 


Val 


lie 


Ala 


Arg 








260 










265 


Tyr 


His 


Gin 


Glu 


Gin 


Gin 


Glu 


Gin 


His 






275 










280 




Phe 


Lys 


Thr 


Met 


Ala 


Asn 


Gly 


Gly 


Asp 




290 










295 






Val 


Ser 


Ala 


Met 


Gly Val 


Ala 


Val 


Pro 


305 










310 








Leu 


Pro 


Pro 


Lys 


Ser 


Thr 


Val 


Asn 


Ala 










325 










Ala 


Ala 


Leu 


Ala 


Arg 


Glu 


Ala 


Arg 


•His 








*5 A A 
J4U 










TIC 

J4 0 


Leu 


Leu 


Met 


Met 


Ala 


Val 


OK ft 

rne 


lie 


Cys 






o c c 
3 O O 














Phe 


Leu 


T 1 ft 

lie 


Met 


Arg 


Leu 


Tyr 


biy 


Oft v 

ber 














J / 0 






Leu 


Leu 


Tyr 


rhe 


Ser 


Phe 


p l . * 

biy 


lie 


T ft.. 

Leu 


-Job 










390 








Asn 


Pro 


lie 


Phe 


Tyr 


Thr 


Phe 


Leu 


Thr 










405 










Leu 


Val 


Lys 


His 


Lys 


lie 


Gin 


Gly 


Phe 








420 










425 


Val 


Pro 


Asp 


Gly 


Met 


Pro 


Thr 


Asp 


Gin 






435 










440 




Cys 


Gly 


Leu 


Arg 


Pro 


Pro 


Thr 


Phe 


Thr 




450 










455 






Asp 


Arg 


Ala 


Ala 


Ala 


Thr 


Val 


lie 


Arg 



465 470 



PCT/US01/09341 



Gl n 

OJ.ll 


Leu 


Pin 


Leu 


iyr 


Leu 




10 










1 EL 
X ~J 




I ILL 


Pro 


lie 


rio 




TIq 

ne 


OX 1 1 
















Leu 


Qft-t- 

ber 


Pin 

bin 


Leu 


Asn 


Arg 


Drift 

rne 








A ft 
*i 0 








p i \t 
biy 


1 nr 


Leu 


D ft 

rne 


val 


Leu 


A 1 ft 

Ala 
















Val 


Asn 


Ser 


Arg 


Arg 


Lys 


Leu 




1 0 










oU 


Oft -~ 

oer 


Leu 


Ala 


Cys 


Ser 


Asp 


Leu 


on 
y u 










95 




Tyr 


Met 


Ala 


Gin 


Phe 


Gin 


Ala 










110 






Gly 


Phe 


Met 


Cys 


Lys 


Phe 


Val 








125 








Ser 


Gly 


Ser 


Leu 


Thr 


Leu 


Val 






140 










Val 


Met 


Arg 


Pro 


Val 


Leu 


Gly 




155 










1 60 


Thr 


Leu 


Ser 


Met 


Leu 


Leu 


lie 


1 1 \j 










1 IE 

1/3 




Pro 


Leu 


Leu 


p i • . 
biy 


Tift 

lie 


Tyr 


Asp 










1 yu 






pin 

blU 


Asp 


Oft V 

ber 


Ser 


pi.* 

blU 


pi.* 

blU 


C ft tft 

ber 
















p l . . 
blU 


Leu 


val 


val 


Thr 


pi.. 
blU 


Leu 






Tin 










Asp 


u ■; ft 
HIS 


Asp 


tt _ i 

val 


pi., 
biy 


Leu 


Tyr 




235 










240 


Leu 


Pro 


Cys 


lie 


1 1 — i 
Val 


Ser 


Phe 


o c r\ 

250 










255 




Gin 


Leu 


Trp 


Leu 


Arg 


Arg 


His 










270 






Gin 


Glu 


Pro 


Lys 


Glu 


Gly 


Gin 








285 








Leu 


Leu 


Met 


Pro 


Ser 


Thr 


Leu 
















Phe 


Ala 


Leu 


Asp 


Asn 


Thr 


Pro 




Jlo 










•2 ft /i 


Pro 


Gly 


Lys 


Lys 


Thr 


Thr 


Ala 


jjU 














Arg 


Lys 


Met 


val 


17 -i 1 

val 


17 1 

val 


IT -t 1 

val 










OOU 






Leu 


Arg 


Leu 


Pro 


Aia 


Trp 


IT ft 1 

val 
















Tyr 




pin 

olU 


Pro 


Tift 

lie 


Asp 


Trp 






380 










As n 


Leu 


rne 


C ft V 

ber 


Cys 


7A 1 =t 

Aia 


Leu 














a nn 
H uu 


bin 


Thr 


Tift 

lie 


Arg 


Thr 


Leu 


Thr 


a i n 
1 1 U 










A 1 C 

4 10 




Leu 


Gly 


Cys 


Pro 


Pro 


Gly 


Lys 










430 






Met 


Asp 


Lys 


Ser 


Gly 


Cys 


Cys 








445 








Trp 


Arg 


Cys 


His 


Pro 


Ser 


Arg 






460 










Asp 


Val 


Asp 


Gin 


Pro 


Asp 


Pro 




475 










480 
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Pro Ser Asp Gin Val Gin Pro Asp Pro Ser Ser Leu Arg Arg Phe Leu 

485 490 495 

Ser Tyr Lys Gin Glu Val Phe Thr lie Tyr Lys Gin Cys Gly Asp Ser 

500 505 510 

Ser Ser Ala Ser He Glu Ser Ser Ala 
515 520 



<210> 151 
<211> 8599 
<212> DNA 
<213> Drosophila 



<400> 151 

aggacacatt 

tatatatcat 

tcccgactct 

cggcatatag 

atgatacaga 

gtaggtggga 

gacatttttc 

ctcgcgggga 

ggattgccat 

attcgccgcc 

gtgacacact 

cccttgccgc 

atgctttgac 

acggaagcgg 

aattggtcac 

acgttgtcaa 

agagactcat 

tctcctggta 

caatttcatc 

cacactgggc 

tgcggcctac 

ttatgtctct 

attggtctgc 

ggagagatct 

aagtggggcg 

ccactgcggt 

gtgatggaaa 

tggtcgagag 

aagcggaaag 

gccattaatt 

gttagtttat 

caacaattgt 

cattgtaata 

tatatatttt 

tattggtaaa 

acgattaata 

aaagcaataa 

gcgttttcca 

gcatggcagt 

gcaagaaggt 

ttgcccggat 

gcgctcggaa 

aatcgactat 

aattacgcaa 

caaaaaatgg 

aaagcaaagt 



ggcgtactta 
gcggtgattt 
ctggcgactt 
tggatatagt 
tagacagacg 
aatgggattg 
accgcgccat 
gaacaagtgt 
ccaattatac 
atattggccc 
gcggtttcgg 
ttcctcggtg 
ttcattatgg 
aagacatccc 
gcgccatata 
gggactttgt 
atccttcata 
gcacccgccg 
gcccacatag 
gtcatcgggg 
atacaccaac 
ccttgtagcc 
taattacacc 
tctgtcagat 
gaagtgcgat 
caattaactc 
cggacaatcc 
tgtgtaggat 
ttatcccagc 
atcatgacgg 
cacttatcaa 
tgcatacatg 
aaatccagtt 
atacgatgtc 
acatgcacgc 
attttattaa 
tacaaataat 
ggccctgctc 
cgatagatac 
aagattccac 
aaggcacgct 
ggacaataaa 
aacaatggga 
gcataaataa 
gtgggcagcg 
tggcaaaaac 



gtctagacca 
gtggacttag 
tggacgggcc 
atgcgtggcg 
gcaggcgaca 
tggaaagcgg 
ccattcgcag 
tttgatttat 
tgctaacttt 
attgttgcgc 
tggcaggact 
gactccgtct 
cgtaacaaat 
cgaaagaaac 
gacaattcca 
gctgctgtac 
tcctttcttg 
tgaagaactc 
ggattgcgga 
ccaactgcct 
aggtgggagg 
ccgttatctg 
attcggtcta 
acaggtgagc 
aatttgttat 
tctcttaaca 
cgattaacgg 
tctgcggtaa 
catatggtta 
tggacagggg 
gtgtcaaaaa 
cttaaaaagg 
aacactcata 
gtttcacaaa 
attccacgca 
aagtttttcg 
aaacttctaa 
cgaggggcgc 
atgtgtgccc 
gtcctatcgc 
cctcggcgat 
caaagctccg 
agagcaccgc 
ggataatggg 
ccaaagagca 
caacaaaacc 



ctgggtgagc 
cataaaacct 
acgataaatc 
aaacttaata 
gtcgcggtgg 
aaaaatcttt 
acggatcgcc 
cgccagcaat 
cattatgtca 
cattgtctat 
catctccatc 
gataaatcgt 
cgtcggcatt 
tgcgaactgg 
tctcagtttt 
ttttttcgct 
cagctctgcc 
cagctgctcg 
gaccatcgag 
ggtcatcttc 
ctgagtcact 
ctcacctcgc 
atgccggcat 
gggtcaagag 
ggaacacccg 
atgcacagta 
gcggaacaat 
gagtcagggt 
ggaagctagt 
gcatcgtggt 
gcatgcaacg 
ttgggttttt 
caactatttt 
atttgaattt 
taactctcac 
attcgcctcg 
acgagcacta 
tttcccagca 
tgcatccgcg 
ctcgcatcga 
ccttatcctg 
gcataatgac 
acagcaacta 
ctttaagcag 
agtgcaaaac 
gaaaaaaaaa 



ttaatgtttt 
aactgaacgg 
aaggcgcaga 
agagatgcga 
tgcgagaggt 
aaagcaatgc 
aaagagagac 
ggcaatggca 
ctgtgctatt 
gcacagctct 
ctgccgcctg 
cagcgcggct 
ttgcattcct 
gcctttgtgg 
ccacatgttt 
tacactttaa 
atggatctaa 
catccgcgct 
gcggtgctca 
gtcatcaaca 
ggcttaacac 
tggccctgaa 
tgttccactg 
tgggtcaaca 
tctcccgttt 
attactggac 
gtgtaatgct 
tccgatgcag 
gcacatggaa 
tagggtgtgc 
aaatcaccta 
atgaaaatca 
catgggaacc 
tccaactgcc 
aaatattgaa 
acaagcgaaa 
acgcattttc 
gagcgccgtc 
ccgctactac 
tagggaactt 
ttgttcttgc 
gagaaatcac 
cacaatatgg 
cttgacagtc 
aaagcatcat 
acaacggggg 



gccgttcggt 
cccggcgaca 
tccgggagcc 
caacaaacaa 
gggtaggtgg 
cgctgcacag 
atcaacgtga 
gggcggacgg 
tcgggaccgg 
ctgtttagca 
ccttagtcgt 
tccctttcac 
cgctgcggaa 
ccacacgacg 
tgattgctca 
tagcttccga 
agactggctc 
actccggcca 
tcctggtcct 
accgccgcta 
cttgtacatc 
tgacctgaca 
ctggccatac 
ccaggtcaac 
gcggctgttg 
ataaatggcg 
agtattaacg 
aaggtgtgac 
aaaccattgt 
aacagaaact 
gaatccatca 
gagttgtaaa 
agactcttgt 
actttagggt 
acacatactg 
aagtttgccg 
tggctttttc 
atcctggttt 
cagcactcca 
ctattttggg 
actcgcgctc 
gcgccaataa 
tcggcgagcc 
attaaacagg 
ggagaatgaa 
ccaagtcgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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tagacattcg 
attttttacg 
acgcaggcag 
gcaaacgagc 
gactgtaaac 
gtccttgtcc 
tttagaatcc 
ttatttaacc 
acagttgtaa 
tggaaaacgg 
gaaaatcggc 
ctgccacgcc 
gaatgcgaaa 
tgggaacttt 
cttggacata 
gttcgccact 
atatttgctg 
ctgcaatcct 
gagatcgccc 
aaattaatca 
tttccatgac 
gacaacattt 
cgattgtggc 
tgcacccttt 
ttggactccc 
attagaaagt 
cttccggcta 
attggccata 
atcagaccgc 
gtggccatac 
cccaaaggtg 
ataaactata 
ctacagcaag 
ggtgctgatg 
gaccatgccc 
ccagcagctg 
cctctaccac 
ccacggccac 
catgggactg 
gaaccgggtg 
ggaggatgaa 
ggggattaag 
atcagtatgc 
ctattcaata 
aaactcactt 
aaatatttcc 
tgaaaatata 
gcagtatcta 
cccacagatt 
cgccatttca 
cgcctgcacg 
gctccgtaca 
gtgacattta 
ctggacggcg 
tttccgccgc 
gggaagatga 
cactgtccgt 
attccaaaca 
atatgggtat 
aagtttgcaa 



aaaagtgtat 
cgcaatgact 
gcaaatgaag 
tctcaaccaa 
aaacgaaaac 
caaatgaacg 
atttcccgca 
tttttgattt 
ttactaaact 
gaaagtacca 
gccatttcgg 
ccatttgtaa 
tactggggat 
tgactttcgt 
tctctttatt 
cacaatttgt 
actatcctta 
gcaaaaaaaa 
tgctttgttt 
cagccatggt 
tttttggcac 
tcccattcag 
agcacttttc 
accaagtgat 
ttgttgaatg 
tgaagaggga 
aataaacact 
ttggcctgaa 
acctggcgga 
tgagcttaac 
agctgcccga 
tgcgcaggtt 
ccatcctaca 
tactgctacg 
ctcacggcgg 
cagatgcacc 
ggccactcgc 
ccccaccatc 
gccggcatgc 
aactgaggaa 
agaggatgaa 
ttagcagtaa 
actcgcagaa 
ttggtaccag 
aaaaaagaac 
agaagccaat 
tagaatactc 
cacataattg 
gttcccatcc 
ctgggcttca 
ggatccaagg 
acgcggcgca 
atgttctatg 
ctgagcaata 
atgagtcgcc 
atctcctctt 
ttcagcagca 
ccacgtgcac 
ttcgcccaag 
aaacaggagc 



aaaatgtgga 
tcggcacagg 
cggcacctcc 
ccctatttcc 
taggaggcca 
ggactgtttc 
aaattcaagc 
acagtcttaa 
aactaaaaaa 
atagtggaaa 
ttttttcatg 
aaatgatggg 
gctgaaaaac 
ttttgcttac 
tatagtgaat 
attaaatgca 
caacttcagc 
aaaataatat 
gtttcgttcg 
ccgtttcaaa 
ttgggatggg 
gatgtcagaa 
cacttccaaa 
tttggcataa 
tcagacgaaa 
atttccgact 
taatatgaat 
atgcagggaa 
aatgtgctcc 
ctggattatc 
gattggcccg 
actacttcaa 
ggatactgtc 
gctccagctt 
ccgcccacca 
agcaccagca 
atcaccccag 
atggaccgcc 
cgaacatgac 
gccaggtggc 
gtgggcgggc 
atagaatttt 
cttaaagaag 
ctaattcaat 
tcgagttgtg 
attacatttt 
aactatattg 
aatcaacaat 
aagagaagaa 
tagtgatggt 
caagtgtcct 
tacataatga 
tccgtccaca 
gtctgtggaa 
agctgatgcc 
ctggcgctaa 
aatctgccat 
actttcattg 
tgcatcctgc 
tacatcctcc 



taaaaaaaag 
gcgacgacca 
cacggaaacc 
cattcccccc 
acagtcggct 
agctgaaatt 
taacaaacag 
tttctcgctg 
ggaaaatgcc 
ataaatttac 
ccatctggac 
cgcagtgtcg 
caatgctcgg 
tttcaaaagt 
ttaacagctt 
cttgaaaatt 
agtcgccggg 
ttacatttca 
ggaaaagttt 
gagacacgga 
gccattttca 
tgctgcattt 
gagcggaagt 
tcaaaagttt 
tgtgcagtga 
tatctgcacg 
tacccaacct 
tacgtgcctg 
actaattgtg 
agcctgacgg 
acatcaatgt 
caatacgggt 
cacgtgcgcc 
ccacatgagc 
cccccatccg 
acatcaccag 
ccaccccagt 
cgttatgggc 
aaacaagatt 
ctgctggcca 
tccacaacag 
ctcggcatcg 
tgttaacagc 
taaattccag 
caagttactt 
agcaaaatag 
ctaataaaca 
agtcacttta 
ctccagcgga 
cactccgtgg 
gtacaaagga 
ggtcacgcgt 
gctgccgccg 
cccgttcatg 
gaacaaagtg 
tttactgcat 
aagccctgat 
acctccccga 
agctcgcaaa 
cgaaagcccc 



ggagcaaagt 
cagcaaagtc 
gcagccagga 
ttcgtaattt 
aagaacaccg 
ggcgatcaat 
attttagctt 
tttgcttggg 
atcaggccaa 
ataacaattt 
gagtgggcga 
aaaattaata 
cggaactcgg 
gccataaatt 
gactcaaaac 
caattatttt 
ttgtgctcta 
aaaattgtac 
taaatgtgat 
caaaagttga 
ttttttcagg 
agtaactcag 
atcgctttcc 
gtaattctat 
agcgagtgga 
ccttagtgtg 
tcattcattt 
aagcccttcg 
tattcatttt 
tgtttggctt 
gctctcgaac 
ctaatggcct 
ctctactttc 
cgcttccggc 
cacccacatc 
caggcgggaa 
cacccaaacc 
catctcagca 
accaaaaagg 
aaaggatgaa 
tttgctcagg 
ccaaacaaat 
ttttagtatt 
aactcaggta 
tacttacttt 
aaagcagcag 
ggaatagaaa 
attcaatcgc 
tccacctcgc 
acgattcagg 
ttattgtgtg 
ggctcatcag 
ttcctcgatt 
tactggctgc 
agtatcgggt 
ctctgatctg 
tgattgattc 
acgccaatcc 
ttttcctcct 
cataaatccg 



tggcctagag 
cttttccagg 
ccaaagagca 
atggctcccg 
aagtccgaat 
aaatgtatag 
acatatatga 
gattttgcaa 
ataaaacaat 
tgtatgctcc 
aattggcgca 
aatggatgct 
taatggtaat 
caaagccagt 
tcgtcttgtt 
gttatctagg 
cgaaaaactc 
ccaactgttt 
agcgtaaatt 
gcttcccttt 
tggacaggtg 
ccagtcagtg 
ctctgtggtc 
ttttggcctc 
attttagcaa 
atataagcca 
tcaattcagt 
ccagtcataa 
gcagggctgc 
tctggtgctg 
acgtcaaacc 
gcgagccctt 
ccacgacgat 
taaacgatcc 
ccaccgccgc 
tgcactcgca 
accccaacca 
tggccatgag 
tgagccagct 
gatggatgaa 
cgggcgattt 
tgttgggtct 
tgtgaacggg 
aactaaacta 
ccgtgcataa 
aagttgcatt 
tttcgtttgg 
cgcaaacatt 
gctcgatggc 
aaatagtcac 
accgctttgt 
ttttgatttt 
tcctggtcac 
tgaactccga 
atccaacagt 
gagcgatgtt 
gcttatccct 
ccaagcccaa 
gcacagcgca 
taaccgaagg 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4 8 60 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
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tgtaagctcc agaatgctgg cagactgacc cgggcaaagt aaacaatcaa ccgacgcagc 6420 
aacaacaata gcaggtcctg caaagtgatt ttgaattatg aaagagcaaa tctaagacgg 6480 
catttacatc ctcgtagccc cacaaacccc atcccatagg tcctgattcc attttggtaa 6540 
acgcagtcaa gttgcatgat ttacttccgg agagccgggg tcagtctgtg gaatatttat 6600 
ggcccaacct tgtgctctgc cagaataaat gcgcatttaa aatccttatt aatttttgtg 6660 
cgagtcggaa tttatgatgc ctccctgccg tttcagtgct tcccccacga ggatacgccc 6720 
gagcacaaat caggctgttg tcacatcaat gctaacgatt tcgaaatcac aacgcttecg 6780 
attcccccgg agccgccatc atcccgtcct ccgggcggag caggaggagg aggtggaggt 6840 
ccatccagtg gaggtcccag ctctggcggc ggagtggcca gttccattgg cggctccgtc 6900 
cttggaatct gcggccgctc gcgcacaaac agcctcagcc gcagcgcctc ccagtacatc 6960 
cgagggacaa tgggcggagg agtccacaat caccaggcca cggaaacctt ctccaccacg 7020 
aggcctgaca ttgagggcct ctccgagaag tactggggcg agatcctgga gagaaccgtc 7080 
agctcgggca acctgaatgc catgcagaag cacctgccgc cccacctgcc ctacgcccac 7140 
cacctggtcc accagatcca ccagccgcag cagcaccacc aacagcagac cacctccttc 7200 
agcaaggcca gcgatctgca gctgaacctg aacctcagcc aagcagccac ggctgccgag 7260 
ctgggcaagt tctccaactc ggagccgaag ctgtgcgagc acctctttca cgacgtaaac 7320 
tgcgccaaga gcaaggccgc caacggggat ggggagtcct ccggcaagct ggcctgcggt 7380 
cgcagcattc cggacatcta gtacgccgag gacgtgatcc tggccaagaa ccaactgctg 7440 
gccaggcagg ccaagtgctc gcaccatccg ctgcaccagc aggccaagac gcgtctccat 7500 
tcccacacgg gcagcctgcg cttcagccgg attcgcgctg cctgccacag tccgccgccg 7560 
gagaagtccg cctttccaat ggccgagtat cgcaagtagg atcacagagg gcactgggat 7620 
caaggatgcc gggatacggt actccaggat gtccaggata tggtataacc atgcacagaa 7680 
tagggagtga gagcgagcga gtatctataa gattccatag gtgtatgtgc gtgtgagtgt 7740 
ctatataata ccgcatgatc aaaccataat atgtgtgttt tctattaggc ttaagatcta 7800 
ggaggtcctt tggttgggtt ttcaatttaa agtctctgag attcgcaaaa tacccatggc 7860 
ctgtagaaag cgccctgaag cctaactaaa agaatatgaa agatagcacc tatcctcaga 7920 
tattccactc tttatataaa gtaatctgtg ttggttataa cctgaatgat tgctgtccat 7980 
ctgagaactt tcaataccat tagctagact ttctttacta tacatggatc tggggaatcg 8040 
gaattagccg agttaagcag aagttaacca tagtaagtgt tcattcaatg tttgtgatac 8100 
cctaagtttc ccagcccaac atacatacct accagtaaca tataatataa tcctagttgt 8160 
tgtgcgtgcg tcggctaaat taaacgtagg atatatggaa acagagatat cagactttcc 8220 
tatcagaacc cgctcgacat tcggaagccc ttcgccaaat tctccaccac ttagggacta 8280 . 
aggatctacg tagactctga tatcggcgga ccttaaagcg ttgtggttaa tcatagttgt 8340 
agttccatag ctcgagtcgt atacaccaaa ttactcactg ttcgacgaat caaaatcaaa 8400 
tcctaacttt gtgattgtca agaatttaag tcggttgatg tggacattgc tgttaaatta 84 60 
atttttattt gtagctaagt aattgactag ttaaagccaa ataaatttac cgtttaagta 8520 
agtctaactt tagttaaata aacgagcggt gtgtaacaaa tgtatttcat aggcagagga 8580 
ttcaaatcga aataaataa 8599 



<210> 152 
<211> 2121 
<212> DNA 
<213> Drosophila 

<400> 152 

atggatctaa agactggctc tctcctggta gcacccgccg tgaagaactc cagctgctcg 60 

catccgcgct actccggcca caatttcatc gcccacatag ggattgcgga gaccatcgag 120 

gcggtgctca tcctggtcct cacactgggc gtcatcgggg ccaactgcct ggtcatcttc 180 

gtcatcaaca accgccgcta tgcggcctac atacaccaac agccccgtta tctgctcacc 240 

tcgctggccc tgaatgacct gacaattggt ctgctaatta caccattcgg tctaatgccg 300 

gcattgttcc actgctggcc atacggagag atcttctgtc agatacaggc cctgctdcga 360 

ggggcgcttt cccagcagag cgccgtcatc ctggtttgca tggcagtcga tagatacatg 420 

tgtgccctgc atccgcgccg ctactaccag cactccagca agaagggctg cgtggccata 480 

ctgagcttaa cctggattat cagcctgacg gtgtttggct ttctggtgct gcccaaaggt 540 

tactacttca acaatacggg tctaatggcc tgcgagccct tctacagcaa gccatcctac 600 

aggatactgt ccacgtgcgc cctctacttt cccacgacga tggtgctgat gtactgctac 660 

ggctccagct tccacatgag ccgcttccgg ctaaacgatc cgaccatgcc cctcacggcg 720 

gccgcccacc acccccatcc gcacccacat cccaccgccg cccagcagct gcagatgcac 780 
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cagcaccagc aacatcacca gcaggcggga atgcactcgc acctctacca cggccactcg 840 

catcacccca gccaccccag tcacccaaac caccccaacc accacggcca cccccaccat 900 

catggaccgc ccgttatggg ccatctcagc atggccatga gcatgggact ggccggcatg 960 

ccgaacatga caaacaagat taccaaaaag gtgagccagc tgaaccggat tgttcccatc 1020 

caagagaaga actccagcgg atccacctcg cgctcgatgg ccgccatttc actgggcttc 1080 

atagtgatgg tcactccgtg gacgattcag gaaatagtca ccgcctgcac gggatccaag 1140 

gcaagtgtcc tgtacaaagg attattgtgt gaccgctttc tgccgccgtt cctcgatttc 1200 

ctggtcacct ggacggcgct gagcaatagt ctgtggaacc cgttcatgta ctggctgctg 1260 

aactccgatt tccgccgcat gagtcgccag ctgatgccga acaaatgctt cccccacgag 1320 

gatacgcccg agcacaaatc aggctgttgt cacatcaatg ctaacgattt cgaaatcaca 1380 

acgcttccga ttcccccgga gccgccatca tcccgtcctc cgggcggagc aggaggagga 1440 

ggtggaggtc catccagtgg aggtcccagc tctggcggcg gagtggccag ttccattggc 1500 

ggctccgtcc ttggaatctg cggccgctcg cgcacaaaca gcctcagccg cagcgcctcc 1560 

cagtacatcc gagggacaat gggcggagga gtccacaatc accaggccac ggaaaccttc 1620 

tccaccacga ggcctgacat tgagggcctc tccgagaagt actggggcga gatcctggag 1680 

agaaccgtca gctcgggcaa cctgaatgcc atgcagaagc acctgccgcc ccacctgccc 1740 

tacgcccacc acctggtcca ccagatccac cagccgcagc agcaccacca acagcagacc 1800 

acctccttca gcaaggccag cgatctgcag ctgaacctga acctcagcca agcagccacg 1860 

gctgccgagc tgggcaagtt ctccaactcg gagccgaagc tgtgcgagca cctctttcac 1920 

gactacgccg aggacgtgat cctggccaag aaccaactgc tggccaggca ggccaagtgc 1980 

tcgcaccatc cgctgcacca gcaggccaag acgcgtctcc attcccacac gggcagcctg 2040 

cgcttcagcc ggattcgcgc tgcctgccac agtccgccgc cggagaagtc cgcctttcca 2100 

atggccgagt atcgcaagta g 2121 



<210> 153 

<211> 706 

<212> PRT 

<213> Drosophila 



<400> 153 



Met 


Asp 


Leu 


Lys 


1 








Ser 


Ser 


Cys 


Ser 








20 


lie 


Gly 


He 


Ala 






35 




Leu 


Gly 


Val 


He 




50 






Arg 


Arg 


Tyr 


Ala 


65 








Ser 


Leu 


Ala 


Leu 


Gly 


Leu 


Met 


Pro 








100 


Cys 


Gin 


He 


Gin 






115 




Val 


lie 


Leu 


Val 




130 






Pro Arg 


Arg 


Tyr 


145 








Leu 


Ser 


Leu 


Thr 


Leu 


Pro 


Lys 


Gly 








180 


Pro 


Phe 


Tyr 


Ser 






195 




Tyr 


Phe 


Pro 


Thr 




210 






His 


Met 


Ser 


Arg 



225 



Thr Gly Ser Leu 
5 

His Pro Arg Tyr 

Glu Thr He Glu 
40 

Gly Ala Asn Cys 
55 

Ala Tyr He His 
70 

Asn Asp Leu Thr 
85 

Ala Leu Phe His 

Ala Leu Leu Arg 
120 

Cys Met Ala Val 
135 

Tyr Gin His Ser 
150 

Trp He He Ser 
165 

Tyr Tyr Phe Asn 

Lys Pro Ser Tyr 
200 

Thr Met Val Leu 
215 

Phe Arg Leu Asn 
230 



Leu Val Ala Pro 
10 

Ser Gly His Asn 
25 

Ala Val Leu He 

Leu Val He Phe 
60 

Gin Gin Pro Arg 
75 

He Gly Leu Leu 
90 

Cys Trp Pro Tyr 
105 

Gly Ala Leu Ser 

Asp Arg Tyr Met 
140 

Ser Lys Lys Gly 
155 

Leu Thr Val Phe 
170 

Asn Thr Gly Leu 
185 

Arg He Leu Ser 

Met Tyr Cys Tyr 
220 

Asp Pro Thr Met 
235 



Ala Val Lys Asn 
15 

Phe He Ala His 
30 

Leu Val Leu Thr 
45 

Val He Asn Asn 

Tyr Leu Leu Thr 
80 

He Thr Pro Phe 
95 

Gly Glu He Phe 
110 

Gin Gin Ser Ala 
125 

Cys Ala Leu His 

Cys Val Ala He 
160 

Gly Phe Leu Val 
175 

Met Ala Cys Glu 
190 

Thr Cys Ala Leu 
205 

Gly Ser Ser Phe 

Pro Leu Thr Ala 
240 
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Ala Ala His His 

Leu Gin Met His 
260 

Ser His Leu Tyr 
275 

Pro Asn His Pro 
290 

Val Met Gly His 
305 

Pro Asn Met Thr 

lie Val Pro He 
340 

Met Ala Ala He 
355 

He Gin Glu He 
370 

Tyr Lys Gly Leu 
385 

Leu Val Thr Trp 

Tyr Trp Leu Leu 
420 

Pro Asn Lys Cys 
435 

Cys Cys His He 
450 

Pro Pro Glu Pro 
4 65 

Gly Gly Gly Pro 

Ser Ser lie Gly 
500 

Asn Ser Leu Ser 
515 

Gly Gly Val His 
530 

Pro Asp lie Glu 
545 

Arg Thr Val Ser 

Pro His Leu Pro 
580 

Gin Gin His His 
595 

Leu Gin Leu Asn 
610 

Gly Lys Phe Ser 
625 

Asp Tyr Ala Glu 

Gin Ala Lys Cys 
660 

Leu His Ser His 
675 

Cys His Ser Pro 

690 
Arg Lys 
705 



Pro His Pro His 
245 

Gin His Gin Gin 

His Gly His Ser 
280 

Asn His His Gly 
295 

Leu Ser Met Ala 
310 

Asn Lys He Thr 
325 

Gin Glu Lys Asn 

Ser Leu Gly Phe 
360 

Val Thr Ala Cys 
375 

Leu Cys Asp Arg 
390 

Thr Ala Leu Ser 
405 

Asn Ser Asp Phe 

Phe Pro His Glu 
440 

Asn Ala Asn Asp 
455 

Pro Ser Ser Arg 
470 

Ser Ser Gly Gly 
485 

Gly Ser Val Leu 

Arg Ser Ala Ser 
520 

Asn His Gin Ala 
535 

Gly Leu Ser Glu 
550 

Ser Gly Asn Leu 
565 

Tyr Ala His His 

Gin Gin Gin Thr 
600 

Leu Asn Leu Ser 
615 

Asn Ser Glu Pro 
630 

Asp Val He Leu 
645 

Ser His His Pro 

Thr Gly Ser Leu 
680 

Pro Pro Glu Lys 
695 



Pro 


His 


Pro 


Thr 




250 






His 


His 


Gin 


Gin 


265 








His 


His 


Pro 


Ser 


His 


Pro 


His 


His 








300 


Met 


Ser 


Met 


Gly 






315 




Lys 


Lys 


Val 


Ser 




330 






Ser 


Ser 


Gly 


Ser 


345 








He 


Val 


Met 


Val 


Thr 


Gly 


Ser 


Lys 








380 


Phe 


Leu 


Pro 


Pro 






395 




Asn 


Ser 


Leu 


Trp 




410 






Arg 


Arg 


Met 


Ser 


425 








Asp 


Thr 


Pro 


Glu 


Phe 


Glu 


He 


Thr 








460 


Pro 


Pro 


Gly 


Gly 






475 




Pro 


Ser 


Ser 


Gly 




490 






Gly 


He 


Cys 


Gly 


505 








Gin 


Tyr 


He 


Arg 


Thr 


Glu 


Thr 


Phe 








540 


Lys 


Tyr 


Trp 


Gly 






555 




Asn 


Ala 


Met 


Gin 




570 






Leu 


Val 


His 


Gin 


585 








Thr 


Ser 


Phe 


Ser 


Gin 


Ala 


Ala 


Thr 








620 


Lys 


Leu 


Cys 


Glu 






635 




Ala 


Lys 


Asn 


Gin 




650 






Leu 


His 


Gin 


Gin 


665 








Arg 


Phe 


Ser 


Arg 


Ser 


Ala 


Phe 


Pro 



700 



Ala Ala Gin Gin 
255 

Ala Gly Met His 
270 

His Pro Ser His 
285 

His Gly Pro Pro 

Leu Ala Gly Met 
320 

Gin Leu Asn Arg 
335 

Thr Ser Arg Ser 
350 

Thr Pro Trp Thr 
365 

Ala Ser Val Leu 

Phe Leu Asp Phe 
400 

Asn Pro Phe Met 
415 

Arg Gin Leu Met 
430 

His Lys Ser Gly 
445 

Thr Leu Pro He 

Ala Gly Gly Gly 
4 80 

Gly Gly Val Ala 
495 

Arg Ser Arg Thr 
510 

Gly Thr Met Gly 
525 

Ser Thr Thr Arg 

Glu He Leu Glu 
560 

Lys His Leu Pro 
575 

He His Gin Pro 
590 

Lys Ala Ser Asp 
605 

Ala Ala Glu Leu 

His Leu Phe His 
640 

Leu Leu Ala Arg 
655 

Ala Lys Thr Arg 
670 

He Arg Ala Ala 
685 

Met Ala Glu Tyr 
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<210> 154 
<211> 3332 
<212> DNA 
<213> Drosophila 



<400> 154 

gataaagccc 

tgcgtattac 

ttagtattgc 

tttctggtcc 

ctcactttat 

gtgccgcttt 

cggaaagcga 

gagcagaatt 

ggacatgcag 

tgaaatgctg 

actgcatacc 

taatatataa 

atattaataa 

tgtttcattt 

ttcattgcag 

tattaaaata 

caagtggaaa 

aggaggagga 

aaatggggca 

ctgtgcccat 

aacgcccaca 

cgtggagtcc 

tgtgcgaccc 

ggagtacgac 

cacgatgatc 

ctatgtggtg 

ggctatcgcg 

gggcaactgg 

ctccttcacc 

ccatcccata 

cgcctggatg 

gtcgagcaac 

cgactccacc 

cctggtgttc 

taaggagaag 

tgacatttcg 

aaatattatt 

attaattgtt 

tggttttttg 

gatttcacag 

taaaaggttt 

attattatag 

ggcagatcat 

cgctcctcaa 

cagctactcc 

gaagagcttc 

gctggagaac 

tggcaatgga 

gagaaacaac 

ggatgcagtg 

aaatccggcc 

ccacacggac 



aattacagaa 
ataataatga 
ggactcatcc 
tggcacgaac 
gaaaaagtta 
actgtctgct 
atcatttttt 
tcatatagtt 
gactaaggac 
atagcatgcg 
aactaaactg 
tttcataaat 
cattatttaa 
ggttgctgaa 
aaaccgcagc 
tttgccattt 
ttagcaggca 
ggaggaggac 
gctgcaagcc 
tcgggcatcc 
gagcagctgg 
gagtcgtatc 
cacctggacc 
gatgatggcc 
ctgtacgcac 
atgaggttct 
gatgagtgct 
cccttcggca 
tcctcgatct 
tcctcgcctc 
acatccgtgc 
ggcaatgtgt 
ttcattttgt 
tactgcctgg 
aagcgctctc 
attagttgtg 
attacattat 
ttaaattgat 
atttctccac 
gtaactaatt 
taaaaccata 
ctaaaacaaa 
ttgtagattc 
cgctgtgcat 
aactcggcea 
atgaaggcct 
agtttcttcc 
aaaggtggcg 
aatgctccga 
acctgtctcc 
accgtgctgg 
ttataagacc 



cctaatacat 
ataatttaag 
catccaactg 
aataaattga 
attgcgtgta 
tttttcttca 
cgcacgctga 
gattgcattg 
ttcgaagcga 
ctcaatattg 
aacacagtga 
ttgtcacacc 
tataattata 
aaccagtaac 
gtgcaacatg 
agctggttta 
gctactgtag 
gtttgggcgg 
acagaaacaa 
tgttatttgt 
cagtggcgcc 
catccataaa 
acaggaatcg 
cggactgttc 
tggtctgcat 
ccaagatgca 
tcctgatcgg 
actatatgtg 
tcctgttgat 
gctaccgaac 
tgctgatgct 
cctgcaacat 
actcgctggt 
tgatcaggaa 
acaggaaggt 
gaactaacta 
aaatatatat 
atatagaaag 
aggtcataag 
gtatacatta 
agctatttgc 
ttacgagatt 
gctttttatt 
ctcgtctgga 
tgaacccaat 
gcacgtgtgc 
ccaagtttgg 
cccagcgtgg 
tggccactac 
agccgccagt 
tggtcaatgc 
gggctccctc 



acacctaaaa 
tgatagttgt 
ttgactcatt 
gtttatatgt 
gttgttggct 
ttttccactc 
gcagtgcaaa 
tctgcaggaa 
atggtgtggg 
tcagtaaata 
gaataaacat 
aaaattcgga 
attatatata 
taactaatct 
tgacgtcaca 
gcatgcattg 
acgaatcgaa 
gaaagccatc 
cagcacaagg 
gctgacagct 
aaatggcacc 
tggcactcag 
accgacgcag 
gtacagctac 
cattggactc 
gaccgtaacc 
cattcccttc 
caaggcctac 
catgtcggcg 
gccctttgta 
gccggttatc 
cgagtggcca 
cttgggattc 
acttcacacc 
caccaagttg 
ctacctgacc 
ttttttttga 
aattttataa 
tgcgtacata 
cagtctaaaa 
ctactattat 
tgtggaaatt 
gactattttc 
gctggccgtc 
actgtacgcc 
tgcccgcaag 
caagggcagg 
ggcattaacc 
aacgacgacg 
acaccaggtg 
tgagaccaac 
gatgcccctg 



tgactaattg 
ttgaacttac 
tcaaattcta 
attacgcccc 
ttgctagatg 
gcctttcatt 
gaaagcaaac 
gatgttcgca 
ctggggcagg 
atacgcgctt 
aacagaaagt 
tttaaattaa 
aaaatatatt 
ttgtgggcta 
ggcataaagt 
ttgcattgac 
atggaaggtg 
atggaaggac 
accaatatcg 
atgacactaa 
acactgcatc 
aatgaaacaa 
cagaatggca 
aacttcatcc 
tttggcaaca 
aacatataca 
ctgctctaca 
atggtgagca 
gatcgctaca 
tccaagttgg 
ctttttgcca 
gacactcaga 
gccactccac 
gtgggaccga 
gtgctcacgg 
actcttgtgt 
cattttgaat 
tatttattca 
ttttgttggc 
acatttatct 
ccttttggat 
gacatgaact 
caggtggctt 
ttcctggcct 
tttttgagcg 
gatgtgaatg 
caatcggagc 
aagaagaagt 
acaaccacca 
ccagccgaga 
aactgtaaac 
gaaacggtgg 



acatccattg 
agaacacttc 
tttaattata 
ttcttgtggt 
gatttttatt 
tttcatccgc 
acaagcaaac 
cgatgatgca 
gatagtctca 
gttagagcca 
tgaatttgct 
aagtataatt 
tttatatgta 
tccaaatctc 
atggagagca 
tgccacataa 
gatggtggcg 
actcgacacc 
cgaccaacgg 
cgagcttaat 
aactggagtc 
tggtgaccag 
gtcactattt 
tgaagctcat 
ccctggtgat 
tactgaatct 
caatgcaggt 
cctcgatcac 
tagccgtttg 
tttcggcctt 
gcaccgtgca 
actcgcacac 
tgacttttat 
agcacaagtc 
taataatata 
tcagtttgat 
tcgactataa 
atacaattta 
ttccacactg 
ttgtgtactt 
tgatttttaa 
ggacttcttt 
tgatcagctc 
gcggatgcct 
ataacttcaa 
cccagttgca 
gtcttcttgg 
gcttggcgac 
caacgggcac 
tccaggtggg 
cgcccgtgct 
tcttcatcgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 
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gcggaggtga gaccatcagg ttgagctgga cctcgatacg gccatcgatt gccaggcaat 3180 

agcgaggcag ccggaatgct tgctataggt cgcctaaaat gatattaaca ttttatacca 3240 

atatggcaga gctttgcaaa aacgggtaaa cttaatccca tttttatatg atcgaatctc 3300 

agcaagaaga aatttttaaa aacccgaata aa 3332 



<210> 155 
<211> 1332 
<212> DNA 
<213> Drosophila 



<400> 155 

atggaaggtg 

atggaaggac 

accaatatcg 

atgacactaa 

acactgcatc 

aatgaaacaa 

cagaatggca 

aacttcatcc 

tttggcaaca 

aacatataca 

ctgctctaca 

atggtgagca 

gatcgctaca 

tccaagttgg 

ctttttgcca 

gacactcaga 

gccactccac 

gtgggaccga 

gtgctcacgg 

actcttgtgt 

cattttgaat 

tatttattca 

ttttgttggc 



gatggtggcg 
actcgacacc 
cgaccaacgg 
cgagcttaat 
aactggagtc 
tggtgaccag 
gtcactattt 
tgaagctcat 
ccctggtgat 
tactgaatct 
caatgcaggt 
cctcgatcac 
tagccgtttg 
tttcggcctt 
gcaccgtgca 
actcgcacac 
tgacttttat 
agcacaagtc 
taataatata 
tcagtttgat 
tcgactataa 
atacaattta 
tt 



aggaggagga 
aaatggggca 
ctgtgcccat 
aacgcccaca 
cgtggagtcc 
tgtgcgaccc 
ggagtacgac 
cacgatgatc 
ctatgtggtg 
ggctatcgcg 
gggcaactgg 
ctccttcacc 
ccatcccata 
cgcctggatg 
gtcgagcaac 
cgactccacc 
cctggtgttc 
taaggagaag 
tgacatttcg 
aaatattatt 
attaattgtt 
tggttttttg 



ggaggaggac 
gctgcaagcc 
tcgggcatcc 
gagcagctgg 
gagtcgtatc 
cacctggacc 
gatgatggcc 
ctgtacgcac 
atgaggttct 
gatgagtgct 
cccttcggca 
tcctcgatct 
tcctcgcctc 
acatccgtgc 
ggcaatgtgt 
ttcattttgt 
tactgcctgg 
aagcgctctc 
attagttgtg 
attacattat 
ttaaattgat 
atttctccac 



gtttgggcgg 
acagaaacaa 
tgttatttgt 
cagtggcgcc 
catccataaa 
acaggaatcg 
cggactgttc 
tggtctgcat 
ccaagatgca 
tcctgatcgg 
actatatgtg 
tcctgttgat 
gctaccgaac 
tgctgatgct 
cctgcaacat 
actcgctggt 
tgatcaggaa 
acaggaaggt 
gaactaacta 
aaatatatat 
atatagaaag 
aggtcataag 



gaaagccatc 
cagcacaagg 
gctgacagct 
aaatggcacc 
tggcactcag 
accgacgcag 
gtacagctac 
cattggactc 
gaccgtaacc 
cattcccttc 
caaggcctac 
catgtcggcg 
gccctttgta 
gccggttatc 
cgagtggcca 
cttgggattc 
acttcacacc 
caccaagttg 
ctacctgacc 
ttttttttga 
aattttataa 
tgcgtacata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1332 



<210> 156 

<211> 393 

<212> PRT 

<213> Drosophila 



<400> 156 



Met 


Glu 


Gly 


Gly 


Trp 


Trp 


Arg 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Arg 


Leu 


Gly 


1 








5 










10 










15 




Gly 


Lys 


Ala 


He 


Met 


Glu 


Gly 


His 


Ser 


Thr 


Pro 


Asn 


Gly 


Ala 


Ala 


Ala 








20 










25 










30 






Ser 


His 


Arg 


Asn 


Asn 


Ser 


Thr 


Arg 


Thr 


Asn 


He 


Ala 


Thr 


Asn 


Gly 


Cys 






35 










40 










45 








Ala 


His 


Ser 


Gly 


He 


Leu 


Leu 


Phe 


Val 


Leu 


Thr 


Ala 


Met 


Thr 


Leu 


Thr 




50 










55 










60 










Ser 


Leu 


He 


Thr 


Pro 


Thr 


Glu 


Gin 


Leu 


Ala 


Val 


Ala 


Pro 


Asn 


Gly 


Thr 


65 










70 










75 










80 


Thr 


Leu 


His 


Gin 


Leu 


Glu 


Ser 


Val 


Glu 


Ser 


Glu 


Ser 


Tyr 


Pro 


Ser 


He 










85 










90 










95 




Asn 


Gly 


Thr 


Gin 


Asn 


Glu 


Thr 


Met 


Val 


Thr 


Ser 


Val 


Arg 


Pro 


His 


Leu 








100 










105 










110 






Asp 


His 


Arg 


Asn 


Arg 


Pro 


Thr 


Gin 


Gin 


Asn 


Gly 


Ser 


His 


Tyr 


Leu 


Glu 






115 










120 










125 








Tyr 


Asp 


Asp 


Asp 


Gly 


Pro 


Asp 


Cys 


Ser 


Tyr 


Ser 


Tyr 


Asn 


Phe 


He 


Leu 




130 










135 










140 










Lys 


Leu 


He 


Thr 


Met 


He 


Leu 


Tyr 


Ala 


Leu 


Val 


Cys 


He 


He 


Gly 


Leu 
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1 H J 










150 










155 










160 


pho 
cllc 


Ply 


rto I 1 


Thr 


l»eu 


Ua 1 
v ax 


T 1 p 

IXC 


T\/T" 


Val 


Wo 1 
V cl 1 


Met 


Ar g 


Phe 


Ser 


Lys 


Met 




















17 0 
x t \J 










1 7 S 
i / j 




bin 


Thr 


Us 1 
v a J. 


Thr 


.Icn 
O i 1 


IXC 


iyr 


lie 


Leu 


Ben 
tt-D 1 1 


Leu 


Ala 


lie 


Ala 


Ben 


Glu 








180 










185 










190 






Cys 


Phe 


Leu 


Tip 

lie 


"oiy 


Tip 


tr L O 


Php 


Leu 


Le u 


i yr 


Th r 
1 111. 


Met- 
ric L 


«1 1 1 


V cl 1 


val y 
















z uu 










one: 
lUj 








Asn 


Trp 


Pro 


rne 




Asn 


Tyr 


net 


L-y s 


Lys 


Q 1 a 

Hia 


iyr 


Mia -t- 

ne L 


V cl 1 


OCl 


I 111 




210 










£ 1 0 




















Ser 


He 


inr 


ca- 
ber 


rne 


i nr 


Ser 


c ^-s >- 

ber 


lie 


rne 


Leu 


Leu 


Tie* 

lie 


we c 


C ^-i *- 

oer 


Hid 


225 










£ .3U 










6 J J 












Asp Arg 


Tyr 


lie 


Ala 


val 


Cys 


His 


Pro 


lie 


O V- 

ber 


ber 


Pro 


Arg 


Tyr 


Arg 




















OCA 










ncc 
t J J 




Thr 


Pro 


Phe 


val 


Ser 


Lys 


Leu 


val 


Ser 


Ala 


Phe 


Ala 


Trp 


Met 


I nr 


O V 

ber 








o el r\ 










O £ C 
ZOO 










o i n 






Val 


Leu 


Leu 


Met 


Leu 


Pro 


ir _ i 
val 


lie 


Leu 


ni, „ 

Pne 


Ala 


Ser 


Thr 


val 


bin 


ber 






O "7 C 
£.10 










o q n 
z o U 


















Ser 


Asn 


Gly 


Asn 


val 


Ser 


Cys 


Asn 


Tl- 
116 


blU 


Trp 


Pro 


Asp 


1 nr 


bin 


Asn 




290 










one 










n n 










Ser 


His 


Thr 


Asp 


Ser 


Thr 


pne 


lie 


Leu 


Tyr 


ber 


Leu 


val 


Leu 


biy 


rne 


305 




















Ol C 

jl J 










*3 o n 


Ala 


Thr 


Pro 


Leu 


Thr 


Phe 


Tl A 

lie 


Leu 


vai 


rne 


Tyr 


Cys 


Leu 


val 


lie 


Arg 










o o c 










Tin 










J Jo 




Lys 


Leu 


His 


Thr 


Val 


Gly 


Pro 


Lys 


His 


Lys 


Ser 


Lys 


Glu 


Lys 


Lys 


Arg 








340 










345 










350 






Ser 


His 


Arg 


Lys 


Val 


Thr 


Lys 


Leu 


Val 


Leu 


Thr 


Val 


He 


He 


Tyr 


Asp 






355 










360 










365 








He 


Ser 


He 


Ser 


Cys 


Gly 


Thr 


Asn 


Tyr 


Tyr 


Leu 


Thr 


Thr 


Leu 


Val 


Phe 




370 










375 










380 










Ser 


Leu 


He 


Asn 


lie 


He 


He 


Thr 


Leu 

















385 390 



<210> 157 
<211> 5453 
<212> DNA 
<213> Drosophila 



<400> 157 

gataaaagca 

gagttcttca 

acttctccga 

ttacaattca 

tggcgatcgt 

aacaactcac 

ccaggcataa 

atgccaatga 

tgatgtttgc 

cataaactgc 

accggtcgac 

gggcgcatgg 

gaaatcgaaa 

ecaatgetaa 

tcattggttg 

tttttttaga 

gttcgctttg 

tctcgactgc 

ctcatcagga 

ggagecaaca 

tgcctcgaca 



ttacattttc 
tacaaagttt 
aaagagcega 
tcacatagcc 
tgeaatgeca 
ttgttttctt 
tcccgacaat 
tatgtgtatt 
actccggtgc 
gttaacgacg 
gtaegtctea 
gtgcgggaaa 
tggaaagaga 
tatcgaaaga 
gttgggttta 
acgatataac 
cagcacattt 
ggaggctgta 
ccatccccat 
tcctcgcaca 
aeggaatege 



gagtctgeae 
tagctgetea 
caegtaaatt 
cgtagttcct 
tgcaattcaa 
ttctgcgtgc 
cggtgcgatg 
gtatgtatgt 
agegggacta 
ggcactctgc 
ttgtgcgata 
teggaaateg 
tcaattttca 
caaagctccg 
actegtacaa 
tcgatataat 
actatgeggt 
tttccgccca 
ctgcaccgac 
gcctggcgac 
egggaaegta 



gategtatea 
tttttaatct 
ggaaaagggt 
cttcctgctg 
tgcccaattc 
atgtggtgct 
tgggtgatac 
gcatacatat 
taaataagee 
caacgagaat 
agctttcaat 
gaaacgaaaa 
attgeageca 
tgttctggag 
agtttgatgg 
gctggtgcgt 
cgatggcaaa 
gcgcttcacc 
ggceggaaca 
eggacgegtg 
tgtcctcgac 



tcagttgttt 
gttccttgtg 
tagtgeaact 
cacagaaaga 
aaaaatattt 
cctggccatt 
gagtgcgaat 
ggegtegtag 
gggtatttat 
aagatccggt 
taaatctgac 
egaaaaegga 
atgeatgeat 
gtcgtgttcg 
gtttgggatt 
aaataacaac 
atgaccctcc 
cgcctgctgc 
tttgaatcaa 
ccactactgc 
ggtgtcgcca 



gcaggacctt 
cagagattac 
gttgttgcaa 
aattggtatg 
gaaatgtgta 
tgecattteg 
gegaatgega 
cgtgtgggcg 
taggggcaat 
aaacaaaggc 
ccggataacg 
ageggaaate 
gtccatgggt 
ccatgcagac 
gttttttttt 
gatgttcatt 
tgtcgaacat 
gccagtccgg 
aatccatget 
acgatttcga 
gggtggccca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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attggccctg gagcccaccg tcatggacgc actgcccgat tcgga'cacgg aac'aggttct 1320 

cggtaatctc aacagcagcg cgccctggaa tctaacgctt gcgtcggcag cggccaccaa 1380 

tttcgagaac tgctcggcgc tttttgtcaa ttacacccta ccacaaacag gtacctaaag 1440 

cctaataact tttaaatcat attgcagtat tttttttttt ttttttgttg ccgggatagg 1500 

actctactgc aattggacct gggacacatt gctctgctgg ccacccactc cggctggagt 1560 

ccttgcacgg atgaattgtc ctggcggctt tcatggcgta gatacgcgca gtaagtaact 1620 

gaaatacact ctaaaaaaaa agcagccgaa aaagaacatg ctgagtacaa ttagcataat 1680 

taatcgttga agctgtttag ctgtttagca gccacaagaa gctttatgac ttctgccggc 1740 

ttattaggag tcaaaagaca tgtcctacgg tgtttatccc atccacacgg catatctggc 1800 

ggtcacaatc atacggatca ttagctggga aacggttcaa gtggtgcacg gcaaacaatt 1860 

acttgattcc gtattcatcg aaaggatgtc aatatttaca tgcatgactg ggggcgtcca 1920 

atcaaccgca ctcccagtcg gggccgcccc tccttggctt ccatcgccgg gggctgtcca 1980 

ggtgtttcat gtgcaccctc gagctccgcc agttgaacag ggtttaaaac accagccaat 2040 

cgattgctta attacgatcc taaatgcttg ctctacccct ccgataaatt aaacgttctg 2100 

caatggggaa ttgctattta ataaccatgg gcgatataca ctgagcgaaa agattctaag 2160 

tagctgactt accacccaat attttacaaa cttaaaatgc tcttccatta tcactagtga 2220 

atgtattatc tgcgcttgtg gtgcaaaatc taggcatatc gaactatatc tgttaagaaa 2280 

taattattcg agtgcattgt tgagctacgg cctcatttct ccgggccgtg tctagatttt 2340 

tagccccaat attttcaact ggacgaggtc ggtcgcccat tcgattggca taattgcact 2400 

tgtcagtcag tgtgtcaaag ctcgattact gtcattaatc caacccaacc tccctctttt 24 60 

tcggttcgca gaattcgcca tccgaaagtg tgagctggat ggtcgatggg gcagcaggcc 2520 

aaatgccacg gaggtgaatc cgccgggatg gacggactac gggccgtgtt acaagccgga 2580 

gattatccgt ctcatgcagc agatgggcag caaggacttc gatgtgagtt gaattctttc 2640 

cacactacat cataatacct gtttcagtta tagttaagag cattgcttaa atgccaacca 2700 

aaatcccaaa tcaagggtat ggttttgaaa ttacctaata gggttccaag ttttttagac 2760 

accctaaaaa gtgatttcaa aggcctgatt acatttgttg atagtggttt atagtagtag 2820 

ttttaattcc atttcttcca actggtttgc gatatccctt aggcctacat agacattgcc 2880 

aggaggactc gaaccctgga gatcgtgggc ctctgcctct ccctgttcgc ccttatagtt 2940 

tccctgctga tcttctgcac atttcgctcg ctgcgaaaca atcgcaccaa gatccacaag 3000 

aatcttttcg tcgccatggt gctgcaggtg atcattcgcc tgaccttgta tctcgaccaa 3060 

ttccggcggg gaaacaagga ggcggccacc aacacgagtc tctctgtcat tgagaacacg 3120 

gtaggcttaa tattcccgca taaggataag gtataaggct tatgttcgat attatatgtt 3180 

atgttctaca gccctatttg tgcgaagcat cctatgtact tctggagtac gctcgtaccg 3240 

ccatgttcat gtggatgttc atcgagggcc tttacctgca caacatggtc accgtggccg 3300 

ttttccaggg cagctttccc ctcaagttct tctcgcgact cggctggtgt gtgcccattc 3360 

tgatgaccac cgtgtgggcg agatgcacgg tcatgtatat ggacacctcg ctgggcgaat 3420 

gcttgtggaa ctataatctc acgccctact actggatcct cgaggggcca cgactagcgg 3480 

tcatactggt gggtaatatc aactgttgat cgctttaatc acctattcac ccaactgaaa 3540 

acatccgcat gcagctaaac ttctgtttcc tggtgaacat tatccgagtg ctggtaatga 3600 

agctgcgtca atcgcaggcc agcgatatag aacagactcg caaggcagtt agagcggcta 3660 

tagtcctact accacttttg ggtataacca atctcctgca ccagctggct cctctgaaaa 3720 

cggccacgaa cttcgcggtc tggtcgtatg gcacccactt tctcacctcg tttcagggat 3780 

tttttatagc gctaatttac tgctttctaa atggcgaggt aagacatcaa gtcactcagt 3840 

tgtccatttg cctaaaaccc tcactatttc caaccaaaat atatattttt taggttcgtg 3900 

ccgtgctact aaagagtctg gccacccagc tgtcggtgcg aggtcatccg gaatgggcgc 3960 

cgaaaagggc atctatgtac tcgggtgctt ataacacggc gccggatacg gatgcagtgc 4020 

agcctgcagg agatccatcg gccactggaa agcggtaagc atccgctttg tgcacacccc 4080 

accttg.aaac tccttacatc aatttaaaac tcccagaata tcaccgccga ataaaaggct 4140 

gaatggaaga aagccgagca gtgccagcat tgtgatgatt cacgagcctc aacagcgcca 4200 

gcgactgatg ccccggctgc aaaacaaggc gcgggaaaag ggcaaggacc gggtggagaa 4260 

gacggatgcg gaagcggagc cggatccgac catctcccac attcacagca aggaggcggg 4320 

cagcgcgaga tcgcgaactc gcggctccaa gtggataatg ggcatctgct tccggggtca 4380 

aaaggtacta agagtaccgt cagcgtcatc cgtgccaccc gagtcagttg tatttgagtt 4440 

gtcagagcag tagaccaaaa cctagcctaa ctaatagatg tagccatagc cgtagccaaa 4500 

tagccgtaga agtagccaac gccatgtagc ccaccaccac agtagccagc acccacacaa 4560 

tcggaccgga aatgatccca aagataaccc agaacccgac tattcctgca tctcccatcg 4 620 

acccaccaag caagccagct ggaaacagca catatgtaaa ataccccaac aaaatatcgc 4680 

agttcttata cccaagatca ttttgaacag aaagtaagct cattgttttg taattcaaat 4740 

taaaccaata gacatttaat ttaaatagtt attttcgctt gccttcggtt cttgctccac 4800 

tgtacttatg taaatacata cataccttta tttgtgtaag ccatatccaa accctttcca 4860 
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aaatctagaa 
tacaagtttt 
tggacacgca 
caaaacttca 
tatttaagtg 
cattagaact 
aatcaactgt 
acttgtattt 
tcacactgcc 
gctttgttgt 



atctcctatc 
acaaaatatt 
aatacattca 
gcggcatgtt 
aattaacaca 
tattacataa 
tgaacttgct 
acattttctt 
agctacaatc 
gcaacaaaaa 



ctttcgaaat 
tttgatgaca 
aaaacactcg 
cttttacttc 
aacttgcggt 
cataattgaa 
gttcatcgga 
taccactttg 
gatcacgttc 
ctttgccaca 



cggctaatcc 
ataaatttcc 
cacacggttc 
tactattcat 
aactgcgcag 
aagtgacatt 
ttccgttatt 
attagcttac 
actgcaactg 
atcgatattt 



tacgaaaatt 
aatagcgtca 
ttactcgatg 
atttaaatat 
tgtggccagg 
gtttaaagga 
aagttttaag 
actttccgtt 
ccttgaattt 
aactaatggg 



caaatttccc 
ccttctgttc 
aatttttcag 
tagattgaaa 
ctttgtgagc 
actcaccttt 
atttgggttt 
cctgagctgg 
tgctagctgc 
gtc 



4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5453 



<210> 158 
<211> 1902 
<212> DNA 
<213> Drosophila 

<400> 158 

atgaccctcc tgtcgaacat tctcgactgc ggaggctgta tttccgccca gcgcttcacc 60 

cgcctgctgc gccagtccgg ctcatcagga ccatccccat ctgcaccgac ggccggaaca 120 

tttgaatcaa aatccatgct ggagccaaca tcctcgcaca gcctggcgac cggacgcgtg 180 

ccactactgc acgatttcga tgcctcgaca acggaatcgc cgggaacgta tgtcctcgac 24 0 

ggtgtcgcca gggtggccca attggccctg gagcccaccg tcatggacgc actgcccgat 300 

tcggacacgg aacaggttct cggactctac tgcaattgga cctgggacac attgctctgc 360 

tggccaccca ctccggctgg agtccttgca cggatgaatt gtcctggcgg ctttcatggc 420 

gtagatacgc gcaaattcgc catccgaaag tgtgagctgg atggtcgatg gggcagcagg 4 80 

ccaaatgcca cggaggtgaa tccgccggga tggacggact acgggccgtg ttacaagccg 54 0 

gagattatcc gtctcatgca gcagatgggc agcaaggact tcgatgccta catagacatt 600 

gccaggagga ctcgaaccct ggagatcgtg ggcctctgcc tctccctgtt cgcccttata 660 

gtttccctgc tgatcttctg cacatttcgc tcgctgcgaa acaatcgcac caagatccac 720 

aagaatcttt tcgtcgccat ggtgctgcag gtgatcattc gcctgacctt gtatctcgac 780 

caattccggc ggggaaacaa ggaggcggcc accaacacga gtctctctgt cattgagaac 840 

acgccctatt tgtgcgaagc atcctatgta cttctggagt acgctcgtac cgccatgttc 900 

atgtggatgt tcatcgaggg cctttacctg cacaacatgg tcaccgtggc cgttttccag 960 

ggcagctttc ccctcaagtt cttctcgcga ctcggctggt gtgtgcccat tctgatgacc 1020 

accgtgtggg cgagatgcac ggtcatgtat atggacacct cgctgggcga atgcttgtgg 1080 

aactataatc tcacgcccta ctactggatc ctcgaggggc cacgactagc ggtcatactg 1140 

ctaaacttct gtttcctggt gaacattatc cgagtgctgg taatgaagct gcgtcaatcg 1200 

caggccagcg atatagaaca gactcgcaag gcagttagag cggctatagt cctactacca 1260 

cttttgggta taaccaatct cctgcaccag ctggctcctc tgaaaacggc cacgaacttc 1320 

gcggtctggt cgtatggcac ccactttctc acctcgtttc agggattttt tatagcgcta 1380 

atttactgct ttctaaatgg cgaggttcgt gccgtgctac taaagagtct ggccacccag 1440 

ctgtcggtgc gaggtcatcc ggaatgggcg ccgaaaaggg catctatgta ctcgggtgct 1500 

tataacacgg cgccggatac ggatgcagtg cagcctgcag gagatccatc ggccactgga 1560 

aagcgaatat caccgccgaa taaaaggctg aatggaagaa agccgagcag tgccagcatt 1620 

gtgatgattc acgagcctca acagcgccag cgactgatgc cccggctgca aaacaaggcg 1680 

cgggaaaagg gcaaggaccg ggtggagaag acggatgcgg aagcggagcc ggatccgacc 1740 

atctcccaca ttcacagcaa ggaggcgggc agcgcgagat cgcgaactcg cggctccaag 1800 

tggataatgg gcatctgctt ccggggtcaa aaggtactaa gagtaccgtc agcgtcatcc 1860 

gtgccacccg agtcagttgt atttgagttg tcagagcagt ag 1902 

<210> 159 
<211> 633 
<212> PRT 
<213> Drosophila 

<400> 159 

Met Thr Leu Leu Ser Asn lie Leu Asp Cys Gly Gly Cys lie Ser Ala 

15 10 15 
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Gin 


Arg 


Phe 


Thr 


Arg 


Leu 


Leu 


Arg 


Gin 


Ser 


Gly 


Ser 


Ser 


Gly 


Pro 


Ser 








20 










25 










30 






Pro 


Ser 


Ala 


Pro 


Thr 


Ala 


Gly 


Thr 


Phe 


Glu 


Ser 


Lys 


Ser 


Met 


Leu 


Glu 






35 










40 










45 








Pro 


Thr 


Ser 


Ser 


His 


Ser 


Leu 


Ala 


Thr 


Gly 


Arg 


Val 


Pro 


Leu 


Leu 


His 




50 










55 










60 










Asp 


Phe 


Asp 


Ala 


Ser 


Thr 


Thr 


Glu 


Ser 


Pro 


Gly 


Thr 


Tyr 


Val 


Leu 


Asp 


65 










70 










75 










80 


Gly 


Val 


Ala 


Arg 


Val 


Ala 


Gin 


Leu 


Ala 


Leu 


Glu 


Pro 


Thr 


Val 


Met 


Asp 










85 










90 










95 




Ala 


Leu 


Pro 


Asp 


Ser 


Asp 


Thr 


Glu 


Gin 


Val 


Leu 


Gly 


Leu 


Tyr 


Cys 


Asn 








100 










105 










110 






Trp 


Thr 


Trp 


Asp 


Thr 


Leu 


Leu 


Cys 


Trp 


Pro 


Pro 


Thr 


Pro 


Ala 


Gly 


Val 






115 










120 










125 








Leu 


Ala 


Arg 


Met 


Asn 


Cys 


Pro 


Gly 


Gly 


Phe 


His 


Gly 


Val 


Asp 


Thr 


Arg 




130 










135 










140 










Lys 


Phe 


Ala 


He 


Arg 


Lys 


Cys 


Glu 


Leu 


Asp 


Gly 


Arg 


Trp 


Gly 


Ser 


Arg 


145 










150 










155 










160 


Pro 


Asn 


Ala 


Thr 


Glu 


Val 


Asn 


Pro 


Pro 


Gly 


Trp 


Thr 


Asp 


Tyr 


Gly 


Pro 










165 










170 










175 




Cys 


Tyr 


Lys 


Pro 


Glu 


He 


He 


Arg 


Leu 


Met 


Gin 


Gin 


Met 


Gly 


Ser 


Lys 








180 










185 










190 






Asp 


Phe 


Asp 


Ala 


Tyr 


He 


Asp 


He 


Ala 


Arg 


Arg 


Thr 


Arg 


Thr 


Leu 


Glu 






195 










200 










205 








lie 


Val 


Gly 


Leu 


Cys 


Leu 


Ser 


Leu 


Phe 


Ala 


Leu 


He 


Val 


Ser 


Leu 


Leu 




210 










215 










220 










He 


Phe 


Cys 


Thr 


Phe 


Arg 


Ser 


Leu 


Arg 


Asn 


Asn 


Arg 


Thr 


Lys 


He 


His 


225 










230 










235 










240 


Lys 


Asn 


Leu 


Phe 


Val 


Ala 


Met 


Val 


Leu 


Gin 


Val 


He 


He 


Arg 


Leu 


Thr 








245 










250 










255 




Leu 


Tyr 


Leu 


Asp 


Gin 


Phe 


Arg 


Arg 


Gly 


Asn 


Lys 


Glu 


Ala 


Ala 


Thr 


Asn 








260 










265 










270 






Thr 


Ser 


Leu 


Ser 


Val 


He 


Glu 


Asn 


Thr 


Pro 


Tyr 


Leu 


Cys 


Glu 


Ala 


Ser 






275 










280 










285 








Tyr 


Val 


Leu 


Leu 


Glu 


Tyr 


Ala 


Arg 


Thr 


Ala 


Met 


Phe 


Met 


Trp 


Met 


Phe 


290 










295 










300 










He 


Glu 


Gly 


Leu 


Tyr 


Leu 


His 


Asn 


Met 


Val 


Thr 


Val 


Ala 


Val 


Phe 


Gin 


305 








310 










315 










320 


Gly 


Ser 


Phe 


Pro 


Leu 


Lys 


Phe 


Phe 


Ser 


Arg 


Leu 


Gly 


Trp 


Cys 


Val 


Pro 








325 










330 










335 




He 


Leu 


Met 


Thr 


Thr 


Val 


Trp 


Ala 


Arg 


Cys 


Thr 


Val 


Met 


Tyr 


Met 


Asp 








340 










345 










350 






Thr 


Ser 


Leu 


Gly 


Glu 


Cys 


Leu 


Trp 


Asn 


Tyr 


Asn 


Leu 


Thr 


Pro 


Tyr 


Tyr 






355 










360 










365 








Trp 


He 


Leu 


Glu 


Gly 


Pro 


Arg 


Leu 


Ala 


Val 


He 


Leu 


Leu 


Asn 


Phe 


Cys 


370 










375 










380 










Phe 


Leu 


Val 


Asn 


He 


He 


Arg 


Val 


Leu 


Val 


Met 


Lys 


Leu 


Arg 


Gin 


Ser 


385 










390 










395 










400 


Gin 


Ala 


Ser 


Asp 


He 


Glu 


Gin 


Thr 


Arg 


Lys 


Ala 


Val 


Arg 


Ala 


Ala 


He 










405 










410 










415 




Val 


Leu 


Leu 


Pro 


Leu 


Leu 


Gly 


He 


Thr 


Asn 


Leu 


Leu 


His 


Gin 


Leu 


Ala 








420 










425 










430 






Pro 


Leu 


Lys 


Thr 


Ala 


Thr 


Asn 


Phe 


Ala 


Val 


Trp 


Ser 


Tyr 


Gly 


Thr 


His 






435 










440 










445 








Phe 


Leu 


Thr 


Ser 


Phe 


Gin 


Gly 


Phe 


Phe 


He 


Ala 


Leu 


He 


Tyr 


Cys 


Phe 




450 










455 










460 










Leu 


Asn 


Gly 


Glu 


val 


Arg 


Ala 


Val 


Leu 


Leu 


Lys 


Ser 


Leu 


Ala 


Thr 


Gin 


465 










470 










475 










480 


Leu 


Ser 


Val 


Arg 


Gly 


His 


Pro 


Glu 


Trp 


Ala 


Pro 


Lys 


Arg 


Ala 


Ser 


Met 










485 










490 










495 
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T W T* 

lyr 


Co v- 


y 


Ala 


i yr 


As n 






nu 


Hen 


Thr 


Asp 


Al a 
rtx d 


Vol 


PI n 
bill 


clO 








500 










505 










510 






a 1 a 
HI a 


ft! VJ 

bi y 


Hen 




JCI 


nld 


l nr 


P 1 \r 

bi y 


Lys 




lie 


C O r- 


Pro 


D »- /~v 
irro 


Asn 


Lys 






Jl J 










jZU 










S? R 








Arg 


LeU 


As n 


b l y 


Arg 


Lys 


JT J. O 


OCX. 


Ot3i 


ri J. ct 




Tip 


val 


net 


lie 


nis 




530 










JJJ 










540 










P 1 n 


Pro 


bin 


bin 


Arg 


PI T"» 

bin 


Arg 


L€U 




Pro 


Arg 


Leu 


bin 


Asn 


Lys 


Aia 


R/ R 




















r r s 

3 0 Z> 










c en 
D ou 


Arg 




Lys 


bi y 


Lys 


Asp 


Arg 


Vdi 


Pin 


Lys 


i nr 


Asp 


nla 


p i i-i 

blU 


Ala 


pin 
blU 










jOj 










D / U 










CI tl 

D / 3 




Pro 


Asp 


Pro 


Thr 


lie 


Ser 


His 


lie 


His 


Ser 


Lys 


blU 


Ala 


Gly 


Ser 


Ala 








580 










585 










590 






Arg 


Ser 


Arg 


Thr 


Arg 


Gly 


Ser 


Lys 


Trp 


He 


Met 


Gly 


He 


Cys 


Phe 


Arg 






595 










600 










605 








Gly 


Gin 


Lys 


Val 


Leu 


Arg 


Val 


Pro 


Ser 


Ala 


Ser 


Ser 


Val 


Pro 


Pro 


Glu 




610 










615 










620 










Ser 


Val 


Val 


Phe 


Glu 


Leu 


Ser 


Glu 


Gin 
















625 










630 























<210> 160 
<211> 4385 
<212> DNA 
<213> Drosophila 



<400> 160 

gcttacaaaa 

gaatggaaaa 

acattgtccg 

tgcttgcagt 

eggttgetae 

etcegtattt 

ctcacaaaat 

acactaactc 

taacatgtaa 

atetgettet 

tttcttacca 

cctttgacct 

tttggggaat 

tttattcacc 

ectgeatatt 

ageggggege 

acatcccacc 

geegcattat 

tggacttagg 

tgtaatttac 

cctacggccg 

ggattaaact 

aacattttga 

tctctacata 

aaactggttt 

etttcaeggt 

tatcatagtg 

ggctgtcgcc 

cctcagtgta 

gtttcactcg 

ggcagtttgg 

gcgaactact 

ttggctgtac 

gaegattgee 



ccaaaaacca 
cagaggattt 
gcgatttatt 
aagaaagtaa 
taaaagcaaa 
tctcatttgt 
atgttctatt 
tgtttgtatt 
gecagaattt 
gccggcccac 
tttcggacaa 
gccctgaact 
ttcgaagctt 
taacgcctga 
tgagtgttgg 
actgagttgg 
accgactacc 
tgcggcaccg 
cctgggcaca 
ataatgattt 
tggtggcaaa 
gaacaaatac 
aaaatatatt 
agctctaatt 
taaatattta 
tacttctcgc 
ctgacccgac 
gatctggctg 
aggctgccgc 
gtatttcccc 
eggtacatag 
ctgatcacca 
ctagtcacag 
teagtgecat 



gattttgtgc 
tatcatcact 
geegtctaga 
cttttaatta 
agtgaagata 
ctaatggagt 
cccaagaaat 
aaagacaaat 
ttgttgcatg 
tagtegtatg 
atggtaatga 
tggacatcgt 
aaattctggt 
tgtttatttg 
aatttccgaa 
cagccggaca 
atetggcett 
gcatcgatga 
tagcccctgg 
gccagccggg 
aaggctaacg 
tegcataaat 
gagttcaatc 
actttaagaa 
tcccaatttt 
tgattgtctg 
gggagatgcg 
tgatgctgga 
gagaggagca 
aggtgetgea 
eggtaagcta 
tagccacggc 
ccattgccaa 
tgagtcaata 



ttcaaatcac 
tgaaggctga 
gacaaaaaac 
gaggttactg 
tatcctttga 
tatgtcacca 
tcttctctgt 
gcataagaat 
ttacacgttt 
cgtattattg 
catttcattg 
tetttgeatg 
aaaattcaaa 
caggtagcag 
aagggaattt 
ccgcggtcaa 
cattgacacc 
tttccacaca 
gtatgtctgc 
gggcgcataa 
aagtgtgcct 
ccataaagca 
taaccaacca 
tttaatcata 
ttttgtcttt 
tatcctggga 
ctcccccaca 
atacataccc 
gctcagctac 
caccatctcc 
teegcaaagg 
ctatgttgtc 
gttcctagag 
cattctggac 



aaacataacg 
caaaacattt 
actceggaaa 
cagctatgea 
atctcagcac 
agaaatgegt 
aagtaagtgt 
gecttaaatt 
tcccacaaat 
attaeggaac 
ttgattcgtt 
tggcatgcaa 
ccacaacact 
gtagcacttt 
aaatgaaacg 
teaaaeggag 
atggtcacga 
aagtgagtag 
atattaatta 
tgaaatcatg 
aatgtggatg 
gatgttaaag 
tgttgtttat 
cccaattttg 
ctccccaaca 
accattgega 
aatgecatae 
tatacggtgc 
agctgggcgt 
atttggctaa 
aategcatet 
tgtgtcctgg 
actttggatg 
tacaategge 



aaagaaatat 
ggcacaattt 
attgtcactt 
getgetactg 
aaattgeatt 
gagtgttaca 
aaaagtgtgt 
cgtctacgtg 
tgecaaaate 
gaggecttgt 
tgaaatcgta 
tgccattgta 
gaattcgttt 
cgccacgcag 
ccagaggagc 
tecttgeagg 
acatgtcgea 
aacgaggacg 
gttttttaag 
tgtgtaattg 
ggcccctttc 
gtcggagatg 
teggtttett 
tgctcaagtg 
gctacaaata 
ataccctaaa 
teaegggtet 
acgactatat 
gcttcatcaa 
eggtgacget 
ggtgtggaat 
tagtgtcacc 
ccaatggcaa 
aggatgaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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gaccatgcag 
ggccaatgga 
aagcactgga 
gtatcacagc 
cagtgtcctg 
cggtgctctg 
tatgcagcca 
gctggagaag 
cttcctggtc 
tgccttcttt 
tcctccgctg 
gttggtgttt 
ataattgcac 
gtacattttg 
atactattaa 
aaatatttta 
ggtataaaat 
tggaacccga 
ggcgcttatt 
gagcacgttc 
gcacgacggc 
acggcagcgg 
gacgacccaa 
agccggcgga 
tccgcctgcc 
tttgcagccc 
tgggagtggc 
gaaggctaga 
caacaggcgc 
caaggcggtt 
cattgacgcc 
atccaagtac 
atgcgctgat 
ctgagacttt 
gcctgcaaca 
aatagttgtg 
attaattaag 
ttgaataaat 
gacactttgt 
tgcat 



gtcatgtcga 
actgcagtta 
gtaaccacat 
gcactggcgc 
atcaagctga 
ttggaggcca 
attgtcaatg 
gagaagcaga 
accgagtttc 
ctgcaatgtt 
tctgtaaccc 
aggtgggccc 
cacaaatttg 
tttatctcgt 
attgagagat 
aacatttaat 
ctaatcgccc 
catttatggc 
aattcgagca 
gcgctcctct 
gaagggaggg 
ttgctgcaca 
gtgacaaatg 
gcagctgcat 
accgatgctg 
agtccaaggg 
accaagtgca 
gaaacggagc 
agcagtgttc 
ttggtcagtg 
ctctggctgt 
ccactaatat 
tccattattt 
gctttgctgg 
ttttgtgacc 
taagtgctaa 
atatattgat 
gttttttcga 
tttaattaaa 



gtacaacgcc 
gcttgctaag 
cctcgtcgtt 
tgcgtgatcg 
taccctgctt 
aaaggaggag 
gaaaggtggt 
ccgatcgcac 
cacagggcat 
acctaaagct 
gtaattgctt 
agtttggcac 
aggcttcaat 
acacccaaaa 
ttttttcttt 
atgtagtttt 
ttgaacttta 
atttcgattt 
tcaacttcat 
tccgtccgcg 
tgggcggaag 
cggatgccgt 
tgtagcagga 
cggtggctct 
ctgtttccac 
ggactgcgat 
tctggcccac 
aatcctccga 
ttctaatggt 
aacagccgcc 
gagacgaccc 
ttaactaccg 
tcccgccagt 
aaaatgagtg 
acatttgata 
gcgcgtctct 
gaaataattc 
agtaaacttg 
cggaatttat 



agatgtttcc 
tctaaccaca 
gggtgagcgc 
gcagttcagg 
cgcactgacc 
gaagatcctg 
gattccgacg 
cacgaggatg 
tatgggtctg 
gagtaagtat 
tattggctat 
attcatcaat 
gaaggcctgg 
taaagtagtc 
aatttgagta 
tagctagcat 
tgaaagccct 
tttcgtttca 
cctgtactgt 
ctggctggac 
tggcggcctg 
tagcaagagc 
gagcagcggc 
ggccctggca 
taa'cgacatc 
atccagtggc 
gacggactgg 
acaggacata 
attactaagc 
gagtccagcg 
ctcacttcca 
tgtacagacc 
gttgtcttgg 
gattttcctt 
agtgatatac 
agatataaac 
ggaagctaag 
attcaattca 
attgaacttt 



tgggcgatac 
gtgatacccc 
aatgtgactg 
aatgcgacct 
attctgtctg 
gcctgtcatg 
caacccaaga 
cttctggcgg 
ctgaatgtgc 
tttaggatta 
gataattgac 
catacgcacc 
cattgacgtg 
ttacatttaa 
aaagtataaa 
atgttttttc 
gcccatagat 
ggtgacctta 
tcgatgagcc 
aaatggctgc 
ggcggctacg 
atggccatcg 
cgggcggcga 
gccactgatg 
agcctggtcg 
cagcatcgaa 
ctgagaaagc 
gagctgggca 
agctccgatg 
gacgaggatg 
ttttgtacgc 
aattattgag 
cgaatcgaga 
gaaaggactc 
ttgactaagt 
gaattagtca 
agattaaatt 
cttattgtaa 
gataaacgaa 



caagtgattc 
taaccacatt 
tctataagct 
tccttatata 
tgcggctcat 
cagccaacga 
gctgtaaact 
tactgctgct 
tcctgggcga 
agacacceac 
tgaatttatg 
gttggctgcc 
ttcaataagt 
aacttcgcag 
attgaaaaaa 
aagtgtagga 
ggcgtgttgg 
tggacatctt 
gccagttccg 
cgctgtcgca 
gcggatatgg 
atctcgggct 
tgtcagccgc 
ttgatggatg 
agaagctaca 
ggcggcgcag 
tgcgtaaCca 
agagctccat 
aggttaaagc 
tggaagacgc 
cctgtaaatt 
acaaattgtt 
gcccctttcg 
cctggaagca 
ttcccccctt 
tagtctaact 
tctttttata 
gtgttattgc 
tgcttttgaa 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4385 



<210> 161 
<211> 1389 
<212> DNA 
<213> Drosophila 



<400> 161 

atggtcacga 

aactacaaat 

aataccctaa 

ctcacgggtc 

cacgactata 

tgcttcatca 

acggtgacgc 

tggtgtggaa 

gtagtgtcac 

gccaatggca 

caggatgagg 

ccaataacca 

agcgcactgg 



acatgtcgca 
actttcacgg 
atatcatagt 
tggctgtcgc 
tcctcagtgt 
agtttcactc 
tggcagtttg 
tgcgaactac 
cttggctgta 
agacgattgc 
tgaccatgca 
catcctcgtc 
cgctgcgtga 



gccgcattat 
ttacttctcg 
gctgacccga 
cgatctggct 
aaggctgccg 
ggtatttccc 
gcggtacata 
tctgatcacc 
cctagtcaca 
ctcagtgcca 
ggtcatgtcg 
gttgggtgag 
tcggcagttc 



tgcggcaccg 
ctgattgtct 
cgggagatgc 
gtgatgctgg 
cgagaggagc 
caggtgctgc 
gcggtaagct 
atagccacgg 
gccattgcca 
ttgagtcaat 
agtacaacgc 
cgcaatgtga 
aggaatgcga 



gcatcgatga 
gtatcctggg 
gctcccccac 
aatacatacc 
agctcagcta 
acaccatctc 
atccgcaaag 
cctatgttgt 
agttcctaga 
acattctgga 
cagatgtttc 
ctgtctataa 
ccttccttat 



tttccacaca 
aaccattgcg 
aaatgccata 
ctatacggtg 
cagctgggcg 
catttggcta 
gaatcgcatc 
ctgtgtdrctg 
gactttggat 
ctacaatcgg 
ctgggcgata 
gctgtatcac 
atacagtgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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ctgatcaagc 
ctgttggagg 
ccaattgtca 
aaggagaagc 
gtcaccgagt 
tttctgcaat 
agcatcaact 
ctcttccgtc 
agggtgggcg 
cacacggatg 
aatgtgtag 



tgataccctg 
ccaaaaggag 
atggaaaggt 
agaccgatcg 
ttccacaggg 
gttacctaaa 
tcatcctgta 
cgcgctggct 
gaagtggcgg 
ccgttagcaa 



cttcgcactg 
gaggaagatc 
ggtgattccg 
caccacgagg 
cattatgggt 
gctgagtgac 
ctgttcgatg 
ggacaaatgg 
cctgggcggc 
gagcatggcc 



accattctgt 
ctggcctgtc 
acgcaaccca 
atgcttctgg 
ctgctgaatg 
cttatggaca 
agccgccagt 
ctgc'cgctgt 
tacggcggat 
atcgatctcg 



ctgtgcggct 
atgcagccaa 
agagctgtaa 
cggtactgct 
tgctcctggg 
tcttggcgct 
tccggagcac 
cgcagcacga 
atggacggca 
ggctgacgac 



catcggtgct 
cgatatgcag 
actgctggag 
gctcttcctg 
cgatgccttc 
tattaattcg 
gttcgcgctc 
cggcgaaggg 
gcggttgctg 
ccaagtgaca 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1389 



<210> 162 
<211> 462 
<212> PRT 
<213> Drosophila 



<400> 162 



Met 


Val 


Thr 


Asn 


Met 


Ser 


Gin 


Pro 


His 


Tyr 


Cys 


Gly 


Thr 


Gly 


He 


Asp 


1 








5 










10 










15 




Asp 


Phe 


His 


Thr 


Asn 


Tyr 


Lys 


Tyr 


Phe 


His 


Gly 


Tyr 


Phe 


Ser 


Leu 


He 








20 










25 










30 






Val 


Cys 


He 


Leu 


Gly 


Thr 


He 


Ala 


Asn 


Thr 


Leu 


Asn 


He 


He 


Val 


Leu 






35 










40 










45 








Thr 


Arg 


Arg 


Glu 


Met 


Arg 


Ser 


Pro 


Thr 


Asn 


Ala 


He 


Leu 


Thr 


Gly 


Leu 




50 










55 










60 










Ala 


Val 


Ala 


Asp 


Leu 


Ala 


Val 


Met 


Leu 


Glu 


Tyr 


He 


Pro 


Tyr 


Thr 


Val 


65 










70 










75 










80 


His 


Asp 


Tyr 


He 


Leu 


Ser 


Val 


Arg 


Leu 


Pro 


Arg 


Glu 


Glu 


Gin 


Leu 


Ser 










85 










90 










95 




Tyr 


Ser 


Trp 


Ala 


Cys 


Phe 


He 


Lys 


Phe 


His 


Ser 


Val 


Phe 


Pro 


Gin 


Val 








100 










105 










110 






Leu 


His 


Thr 


He 


Ser 


He 


Trp 


Leu 


Thr 


Val 


Thr 


Leu 


Ala 


Val 


Trp 


Arg 






115 










120 










125 








Tyr 


lie 


Ala 


Val 


Ser 


Tyr 


Pro 


Gin 


Arg 


Asn 


Arg 


He 


Trp 


Cys 


Gly 


Met 




130 










135 










140 










Arg 


Thr 


Thr 


Leu 


He 


Thr 


He 


Ala 


Thr 


Ala 


Tyr 


Val 


Val 


Cys 


Val 


Leu 


145 










150 










155 










160 


Val 


Val 


Ser 


Pro 


Trp 


Leu 


Tyr 


Leu 


Val 


Thr 


Ala 


He 


Ala 


Lys 


Phe 


Leu 










165 










170 










175 




Glu 


Thr 


Leu 


Asp 


Ala 


Asn 


Gly 


Lys 


Thr 


He 


Ala 


Ser 


Val 


Pro 


Leu 


Ser 








180 










185 










190 






Gin 


Tyr 


He 


Leu 


Asp 


Tyr 


Asn 


Arg 


Gin 


Asp 


Glu 


Val 


Thr 


Met 


Gin 


Val 






195 










200 










205 








Met 


Ser 


Ser 


Thr 


Thr 


Pro 


Asp 


Val 


Ser 


Trp 


Ala 


He 


Pro 


He 


Thr 


Thr 




210 










215 










220 










Ser 


Ser 


Ser 


Leu 


Gly 


Glu 


Arg 


Asn 


Val 


Thr 


Val 


Tyr 


Lys 


Leu 


Tyr 


His 


225 










230 










235 










240 


Ser 


Ala 


Leu 


Ala 


Leu 


Arg 


Asp 


Arg 


Gin 


Phe 


Arg 


Asn 


Ala 


Thr 


Phe 


Leu 










245 










250 










255 




lie 


Tyr 


Ser 


Val 


Leu 


He 


Lys 


Leu 


He 


Pro 


Cys 


Phe 


Ala 


Leu 


Thr 


He 








260 










265 










270 






Leu 


Ser 


Val 


Arg 


Leu 


He 


Gly 


Ala 


Leu 


Leu 


Glu 


Ala 


Lys 


Arg 


Arg 


Arg 






275 










280 










285 








Lys 


He 


Leu 


Ala 


Cys 


His 


Ala 


Ala 


Asn 


Asp 


Met 


Gin 


Pro 


He 


Val 


Asn 




290 










295 










300 










Gly 


Lys 


Val 


Val 


He 


Pro 


Thr 


Gin 


Pro 


Lys 


Ser 


Cys 


Lys 


Leu 


Leu 


Glu 


305 










310 










315 










320 


Lys 


Glu 


Lys 


Gin 


Thr 


Asp 


Arg 


Thr 


Thr 


Arg 


Met 


Leu 


Leu 


Ala 


Val 


Leu 










325 










330 










335 





206 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 



PCT/US01/09341 



Leu 


Leu 


Phe 


Leu 


Val 


Thr 


Glu 


Phe 


Pro 


Gin 


Gly 


lie 


Met 


Gly 


Leu 


Leu 








340 










34 5 
















Asn 


Val 


Leu 


Leu 


Gly 


Asp 


Ala 


Phe 


Phe 


Leu 


Gin 


Cys 


Tyr 


Leu 


Lys 


Leu 






355 










360 










ODD 








Ser 


Asp 


Leu 


Met 


Asp 


lie 


Leu 


Ala 


Leu 


lie 


Asn 


Ser 


Ser 


lie 


Asn 


Phe 




370 










375 




















lie 


Leu 


Tyr 


Cys 


Ser 


Met 


Ser 


Arg 


Gin 


Phe 


Arg 


Ser 


Thr 


Phe 


Ala 


Leu 


385 










•5 n A 

390 










one 










/inn 


Leu 


Phe 


Arg 


Pro 


Arg 


Trp 


Leu 


Asp 


Lys 


Trp 


Leu 


Pro 


Leu 


Ser 


Gin 


His 










405 










410 










415 




Asp 


Gly 


Glu 


Gly 


Arg 


Val 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Gly 


Gly 


Tyr 


Gly 








420 










425 










430 






Gly 


Tyr 


Gly 


Arg 


Gin 


Arg 


Leu 


Leu 


His 


Thr 


Asp 


Ala 


Val 


Ser 


Lys 


Ser 






435 










440 










445 








Met 


Ala 


lie 


Asp 


Leu 


Gly 


Leu 


Thr 


Thr 


Gin 


Val 


Thr 


Asn 


Val 







450 455 460 



<210> 163 
<211> 4214 
<212> DNA 
<213> Drosophila 



<400> 163 

ataattctta 

ttattgaaaa 

gtaaatagcg 

agctcgtaac 

tgcttggata 

cgatatcatc 

acttttatct 

actaaaacgc 

gaagattaac 

ccattttcga 

tgcgactaat 

ccttcgaaga 

gtagagacta 

ttcttattaa 

attagaaatg 

ggcactatta 

aataccagag 

gcttgtaggg 

taagatgaac 

gcattaataa 

taaaatatta 

tgacgtgcaa 

aaattccact 

cgactatttc 

aactgcagga 

ctgcgtcttg 

gtagatggaa 

gggtccgtag 

aaattagact 

ccatgcctgg 

gaagaccaat 

cagccttttg 

agagcatgat 

tccagttgaa 

catgggattc 

gtagtagatg 



tgaaaaatat 
aatatgaaat 
ggcatcaatt 
cgcgaattat 
aaaaaatcgt 
gataggtgtt 
ggtgcatacc 
ttgaaacttg 
aagcccacaa 
tcgatatggc 
ccaggcgggc 
atgatttacc 
catgtggtat 
tccaagtgaa 
tggttcttct 
tttatatact 
ttcacagatg 
agagtctgtg 
accaacctga 
tactcagatc 
tagcttctat 
attcttgcca 
ccagaagtca 
tcggatccag 
atataaaaga 
acgattttca 
aagtacgcct 
tagtagatca 
ttgttgtact 
cttgaagtgc 
caccgccaga 
tttaaatttg 
gcagaagtag 
gtagacaaaa 
ttcaaggcca 
attaactggc 



aaaaatgggc 
ttacatttta 
gtgcatagtt 
atatgtatgt 
ccgagtcgcc 
tccctttcaa 
cgatttgact 
agaaattagg 
actgcacaaa 
aacaacattt 
ccccaagtta 
gttgatcata 
taagatgaaa 
ttataatatt 
taaagatata 
ctcccgtgtg 
agcaataatc 
aaggaaaata 
agattttaga 
tgtctgactt 
tcgaatattt 
aggagcagaa 
atgatgaaga 
accgcaaaca 
taaataggtc 
gttgagtgcc 
tgagacagca 
aaatacccca 
ttcttattca 
ttgggcagct 
actgccagag 
agcagaaagt 
gccagaaatc 
ttactcaacc 
agtagaccaa 
catagtggct 



tgattaaaat 
aaaagtgtta 
tgaaggtaag 
acatatacga 
ggatatccat 
aagtcttcgt 
gcttggacgg 
gaagtagacg 
acttgcaagg 
aagccatgcc 
acttggccct 
ccgggaagcc 
aaactaaatg 
tattttggca 
aatgtacatc 
cgacagacta 
tcagttcctg 
tcaatgttaa 
tcgcctttct 
gattccctgt 
caacttacac 
taaaaccagc 
aatgctctct 
cgcccactat 
tataataaaa 
caatatgtac 
tacaatgctg 
ggtgcgaact 
acctaccatc 
tggagtcctg 
caagggacaa 
caagactgca 
cttttgggat 
tatgtttctg 
cagggagtaa 
tttacgggca 



gattggtatt 
ttcgcgttat 
ttgcccaata 
taaaaataaa 
attatcgctt 
ccgatttggt 
agtgggatgt 
ggtagcgtca 
ccattttatt 
cactctaatg 
cctcatcgag 
ttttttggtc 
cgtacatcaa 
ccttgtaaat 
ataatatgct 
aatgattcat 
ttaaagtgat 
aagaagataa 
catcgaactt 
aaaaaaaaaa 
attaatggcc 
tatggccagg 
cactctggcc 
gtaagcgcta 
acagctttta 
tgagtatctg 
aatatcagga 
gaaaatgcat 
gaaccagcag 
ggctaaggag 
tcgacggacg 
aatggccaga 
gcatagaaca 
caagccacat 
ccaacgatca 
tctctcacag 



atggaaatta 
tactcagttt 
aatggcgcaa 
gtgtggcctt 
tcagaaaatt 
ccgtttgact 
gtgaatgaac 
gtttgtttga 
atttgtctat 
cgtcaataaa 
tcaagcgctt 
ttcaggggag 
atcaaaagta 
gtgacaagtg 
agagtactga 
aatgaatatg 
gtcactgtcg 
attaaagcaa 
gtatacgcga 
aatgggcaat 
caccag^att 
cccagaatgc 
attatataca 
aaggcataga 
ggaaatctta 
gtggcagttc 
ggagtgcaat 
agataattcg 
taatcttctc 
accaaaaacc 
gagctcctga 
aagacgaagg 
aaataagtta 
gcgagagctt 
tggacagcaa 
atccaagtag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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ggagagtatg 
agaatctctt 
taccccgttt 
tccaacgggc 
ccattctcga 
cggcaaccca 
ctgttagcgt 
ttccagtgct 
agcagacaga 
ttgaaggagt 
ccatccttca 
tttatactgg 
atcaatggcg 
gtcggcctga 
tcgccgttcg 
aatggaaacc 
caacgacacc 
ctcctagcta 
atgctggcca 
cacaagcagc 
tagaaaccgg 
agcatataat 
atattagatt 
aaaatccaat 
tttaaatatt 
ctggcttgat 
aacagatccg 
tacaattata 
atggtacatt 
atctgtatgt 
caaatttgat 
atatggctct 
aaaaaatgaa 
ggatagggac 
acttcgcgaa 



aaaatgttta 
cgaaagtatt 
gaaagcgttc 
taaagcaata 
agaggtccca 
gctctgtgcg 
aggtgagctc 
tggcatcaaa 
cgcagccgcg 
actccgccat 
tccgcagtcc 
cgatctccac 
agaccatggc 
cgtcacacgc 
ctttgagcaa 
caaaattgac 
gacaacacca 
gtcgtcttgc 
agaataaaaa 
tggcaaaaag 
cgtggaatct 
ccattcgagc 
agattatttc 
ttttaaccga 
taggtgcatt 
atcgtttacg 
ctgaggaaaa 
attcttgtat 
tatgcaaaga 
tacgtatttt 
ctctcagctg 
gcataaattt 
agagaattgg 
tttctcttat 
ttta 



taatgatggc 
caaacaacac 
gcagttgagc 
ctcatccgtt 
ttgccagaag 
caccacaaaa 
cacgtgagtg 
gacgaggttc 
cagatgtttc 
cagatgcggt 
atccgtaatg 
tgactttgcg 
taactgatcc 
tgttcagctc 
tgcacgcaca 
aacgaatgtg 
aaaagaacaa 
ctctctctaa 
acagactaac 
agcgattggg 
caaccagctc 
catttccaga 
catccatttt 
caatccaaat 
tccaggctat 
gtcttcaaga 
aacgacgttc 
tgtgttttac 
gggtatacaa 
tgctggatgt 
ggttcggaga 
cttatacaac 
gcggggtggg 
agtagtgtct 



tataatactg 
gactcacaaa 
ggagtgagct 
gaccacagac 
ttatcgtgct 
cgatcgcgta 
gactcgcgga 
tccggacaac 
ttggcccggt 
gggaccacta 
ttgaccgtgt 
cccacggaga 
acgcttgttg 
gataattaag 
ctcgccgagt 
acgcgggtct 
ccataaacat 
acgacaaacc 
tgattagccg 
aagactcaca 
ggaacttgca 
cttatcttga 
ctataattta 
ggcccaaacg 
acaaagtttg 
atattcgaaa 
attttatctt 
attagttcag 
tcaacaccaa 
ttaccttgga 
accttacaat 
taatcaggtt 
gtgggtaagg 
ttcataaagt 



cctagaaggg 
tggcgtaaat 
cccactgctc 
gattgtcccg 
ttttgtccac 
tgcgtacttg 
cttggtgtgg 
agaaactgat 
actccactcc 
ctccagcgta 
gggcgtaggc 
gccaaataat 
gatttttggt 
taaacaaata 
cagaaatctg 
aaaaacggct 
aaaagtttgg 
ggcaaccgca 
ttcccaaaac 
acccgtcccg 
catgcatacc 
gatagatata 
ttgcctttgc 
aatttaccac 
tctacatgta 
agattcgttg 
tggctttatt 
agataatgca 
tctcgttttt 
cttaggctaa 
tttgcttgca 
gttcgtggtc 
gttttcgtgg 
tctatccgta 



aacaagagtt 
ccaaacccgg 
cggattgtgc 
ccgcagagtt 
gaatttgttc 
atccacagtc 
cattgtgcag 
gcagggcttc 
atcgatcacc 
tgagtaggag 
acagggatgg 
taaaagaatt 
cgattcgttc 
tgagaagcga 
caccaacaac 
gcgattgcga 
tggagagtct 
acggagctcc 
cagacgcgac 
cccctcctcc 
ttaagttgta 
aaagttaaga 
gaccgtaaag 
agctttagtt 
ggtgtgtgac 
atttttgaca 
aagcttgaaa 
gatatggaga 
aacaaggcac 
tggagaatta 
taaacagtag 
acatggcatt 
tagttctggg 
tgtggatgct 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
■4214 



<210> 164 
<211> 1859 
<212> DNA 
<213> Drosophila 



<400> 164 

tcgtttagag 

atggttgttc 

gcgtcacatt 

cgagtgtgcg 

tatcgagctg 

gcgtggatca 

tgggcgcaaa 

tcaacattac 

tcccaccgca 

ccaagaaaca 

cggagaacct 

agtccactca 

atcgttttgt 

ataacttctg 

ggtcaacgga 

ctccgctcaa 

gtattatagc 

cccgtaaaag 



agaggcaaga 
tttttggtgt 
cgttgtcaat 
tgcattgctc 
aacagcgtgt 
gttagccatg 
gtcagtggag 
ggatggactg 
tctgatggcg 
tctgcgcggc 
cgtctttgat 
cgtggagctc 
ggtgcgcaca 
gcaatgggac 
tgagtattgc 
ctgcgaacgc 
catcattata 
ccactatggc 



cgactagcta 
tgtcggtgtc 
tttgggtttc 
aaagcgaacg 
gacgtcaggc 
gtctcgccat 
atcgccagta 
cggatgaagg 
gagtactcct 
tgcgtctgtc 
gccaagcact 
acctacgcta 
gagctgggtt 
ctcttcgaga 
tttagcccgt 
tttcaaacgg 
aacattttca 
cagttaatca 



ggagagactc 
gttgtcgcaa 
cattgttgtt 
gcgatcgctt 
cgacgaacga 
tgataattct 
taaaccatcc 
atggctccta 
tcaaggtgat 
tgctgaagcc 
ggaactgcac 
acaggactgt 
gccggaacaa 
atggaactct 
tggagcacaa 
ggtaccgggt 
tactctccct 
tctactactt 



tccaccaaac 
tcgcagccgt 
ggtgcagatt 
ctcatatttg 
atcgaccaaa 
tttaattatt 
ctgtgcctac 
ctcatacgct 
cgatggagtg 
ctgcatcagt 
aatgccacac 
ggatcaagta 
attcgtggac 
gcggcgggac 
tccggagcag 
ttggatttac 
acttggatct 
gctgtccatg 



ttttatgttt 
ttttagaccc 
tctgactcgg 
tttacttaat 
aatccaacaa 
tggctctccg 
gcccacacgg 
ggagtagtgg 
gagtaccggg 
ttctgttgtc 
caagtccgcg 
cgcatacgcg 
aaaaagcacg 
aatcgtctgt 
tgggagctca 
gccatttgca 
gtgagagatg 
atcgttggtt 
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actccctgtt ggtctacttg gccttgaaga atcccatgaa gctctcgcat gtggcttgca 1140 

gaaacatagg atttctggcc tacttctgca tcatgctctc cttcgtcttt ctggccattt 1200 

gcagtcttga ctttctgctc aaatttaaac aaaaggctgt caggagctcc gtccgtcgat 1260 

tgtcccttgc tctggcagtt ctggcggtga ttggtcttcg gtttttggtc tccttagccc 1320 

aggactccaa gctgcccaag cacttcaagc caggcatggg agaagattac tgctggttcg 1380 

atgttcgcac ctggggtatt ttgatctact actacggacc cattgcactc ctcctgatat 1440 

tcagcattgt atgctgtctc aaggcgtact tttccatcta cgaactgcca ccagatactc 1500 

agtacatatt gggcactcaa ctgaaaatcg tcaagacgca tttctatgcc tttagcgctt 1560 

acatagtggg cgtgtttgcg gtctggatcc gagaaatagt cgtgtatata atggccagag 1620 

tgagagagca tttcttcatc attgacttct ggagtggaat ttgcattctg ggcctggcca 1680 

tagctggttt tattctgctc cttggcaaga atttgcacgt caaatcctgg tgggccatta 1740 

atgtggaatc aagtcagaca gatctgagta ttattaatgc tcgcgtatac aagttcgatg 1800 

agaaaggcga tctaaaatct tcagactctc cctacaagcc gacagtgaca tcactttaa 1859 



<210> 165 
<211> 513 
<212> PRT 
<213> Drosophila 



<400> 165 
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<210> 166 

<211> 5208 

<212> DNA 

<213> Drosophila 



<400> 166 

egagecattt 

atttccatcc 

accgacaatc 

gcatttccag 

ttaeggtett 

gaaaaaacga 

tgtattgtgt 

aaagagggta 

atttttgetg 

agctgggttc 

aatttcttat 

attgggcggg 

cttatagtag 

agatatcgac 

aatatgaata 

tccgatcgaa 

ttcaatgetc 

tctgaggatc 

gcgacagatt 

tgctgccatc 

gtccattgcg 

tttgagtgct 

taaacaggtt 

ggttagaggt 

actacttctc 

ggteggaetc 



ccagacttat 
attttctata 
caaatggccc 
gctatacaaa 
caagaatatt 
cgttcatttt 
tttacattag 
tacaatcaac 
gatgtttacc 
ggagaacctt 
acaactaatc 
gtggggtggg 
tgtctttcat 
tttaaatgtt 
agtgatcttt 
atgaatataa 
tgcctatggc 
cacgctcgag 
ggccacgctg 
gcgaacactc 
tgggcagcgt 
tggattttaa 
accttcacca 
tatcagggtt 
catttccaga 
gggtcggtgg 



cttgagatag 
atttattgee 
aaacgaattt 
gtttgtctac 
cgaaaagatt 
atctttggct 
ttcagagata 
accaatctcg 
ttggacttag 
acaattttgc 
aggttgttcg 
taagggtttt 
aaagttctat 
gattacagaa 
atctacagaa 
atgegattag 
ggcaactgcg 
ctgatcacta 
ctcgacgtgg 
gagcatctgt 
ggaatagaca 
gttagggaaa 
agacacaaaa 
atctaatggt 
ggattacctg 
cattgctacc 



atataaaagt 
tttgcgaccg 
accacagctt 
atgtaggtgt 
cgttgatttt 
ttattaagct 
atgeagatat 
tttttaacaa 
gctaatggag 
ttgeataaac 
tggtcacatg 
cgtggtagtt 
ccgtatgtgg 
taatctcatg 
aggaaegcag 
tccttcctga 
agtgctgtga 
gcggcttctc 
tcttggtgga 
gtttgtgtga 
atggcggtaa 
agcagacaca 
gtggatggtc 
ttcttccaac 
tgaccatttt 
ggccttgtca 



taagaatatt 
taaagaaaat 
tagtttttaa 
gtgacctggc 
tgacaaacag 
tgaaatacaa 
ggagaatggt 
ggcacatctg 
aattacaaat 
agtaga.tatg 
gcattaaaaa 
ctgggggata 
atgetactte 
ataggaatat 
tgttcgcaat 
tctgcacgca 
teageggaag 
caccgatggc 
gtactggctc 
agtggtggag 
atctgattag 
ttaagegate 
attatgatta 
tggggttaac 
gattagatgg 
acagecttaa 



agattagatt 
ccaattttta 
atatttaggt 
ttgatatcgt 
ateegctgag 
ttataattct 
acatttatgc 
tatgttacgt 
ttgatctctc 
getctgeata 
atgaaagaga 
gggactttct 
gcgaatttat 
gtagtcctaa 
aatgcgagga 
tcccgaggta 
atggtggttt 
ttctcgtgca 
tgacgttggt 
tggttatgac 
aattggttcg 
ctacaggata 
gttttcgagg 
tacaatgtca 
etatcttegg 
ggtgcagctt 
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1200 
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tcttggcagc 
attcgagcgc 
tgttgagaca 
atcagggcaa 
tcaccgagct 
ttagtctgca 
ctgacaacga 
agaactcgaa 
agatttaagg 
atttaagcgc 
ttactgaaat 
ttggcaaatg 
cgctgcttca 
cgcacacgta 
gacacgacac 
acaggacata 
gaccttcttc 
atccgagata 
ctccgtgagc 
gttggaccaa 
tgacactcac 
ccgggccatc 
gttggcaaca 
catatccagc 
ggtaagttgc 
cagcgaataa 
gaaattgtgc 
gaaatatgct 
ttgcacgtag 
ctttccgtgc 
gaagggaatg 
gtttacaaga 
ctatgatcgt 
ttgacgtatg 
cggagcatgg 
agagaaaaca 
accaatatcg 
cggttgtccg 
gggcagcaga 
acctctatgc 
ggaactattg 
ttcactgtat 
ctctcggcca 
cacaggacgc 
ttcggcataa 
gttattattc 
acagttagtc 
aagacctatt 
ggcaatagag 
acaacaggtt 
agccaacgtg 
tccgcgggac 
tggattcgtg 
cgaactccac 
tccataaatt 
agcctttggg 
gaagggtaac 
atgcgaacgg 
atgaatactg 
tagcataatt 



acattcgcgt 
ct tttctcca 
catgcacata 
ttaggaatgg 
aggcggctgg 
ggcagtgtct 
aaataagtac 
gtgcatatgt 
aacgtggaat 
gggttagcaa 
cgacgaaatc 
cgttgcggat 
tcgagaacga 
tcgttctgag 
acaagaggga 
acgtggagtt 
ttcattacaa 
taaaagagct 
caggtgatgc 
gcacgataaa 
ttgtccttct 
tcgcgctgca 
acgatcgcca 
cagcagtata 
tgggcgcaat 
aacaggaagc 
cacaagtcaa 
gtgtatccaa 
cacagcgagg 
tggttgcgga 
gagaacatca 
agtaaaggtg 
cctgcatgtc 
tcggaaccat 
atccgttctc 
tcttggggca 
tagactcgtt 
agtggtctat 
tattcaggca 
gctccgttag 
tgccgtagta 
cgtagaaatt 
taacgtaagt 
ccagcaatgt 
tgaaggttct 
tttaacgttt 
atcagtcgca 
tgccgattag 
caacatctag 
gtcgaaaaga 
tcccatgggg 
atcaatcata 
ttaaattgat 
tccaaaccaa 
cattaccagt 
gaccaactaa 
ctgcggcatt 
gtggagaagg 
gcagtgggtg 
aatcgctgag 



cagaagtgat 
ggagtcttca 
tggacacaca 
tacttctgtt 
agattcagtg 
gttggtgttg 
aacatataag 
attcccataa 
tagacaggaa 
aggccacaac 
gatttagact 
caccaccagc 
agtcggtccc 
aatagtcggg 
gtcacgagtc 
aagaatagag 
acaacatgaa 
tagaccatcc 
ccattatgag 
aacgtcttta 
cggttcgtag 
cgccgtgaat 
gaatgggacc 
caccatcacc 
aggtgaaggc 
gtttcttctc 
agctaatgac 
acactttgca 
agtaacccac 
gctcgggaat 
tggctagtgc 
atcgtttcac 
acaattggcc 
gcagaactcg 
aaagagtaga 
gggtctgaag 
gcggaaggtt 
tatgcatccg 
ggacttgaac 
cgagctgtgt 
atcatagctt 
acagggcaca 
gccgaaggga 
aactccgcag 
gaaattcgta 
attcttttaa 
acacgcagca 
ttagtcgttt 
ggacccatta 
attaaggagg 
aggaaccagg 
aggtcaacca 
agtaactgcc 
tcccgatctg 
gcgaaaagtt 
ttcattgatc 
gtcaggagtg 
acaaacacat 
gggaagggag 
cacatgcact 



ggaagtcgag 
atgaatagcg 
tggtggatgc 
tagtagctac 
ctgtgttggc 
agcttgcttt 
gtttaaaata 
tcttcaatgg 
tcatataacc 
aaacgagcaa 
tcgtgtgaat 
ttgcggatct 
atcgttacgc 
gttacagata 
gtgtgaatgg 
caacacttac 
aatgagaaag 
cttgtcgctg 
aaagaggcgt 
ggggaaggtg 
gttcttggtg 
cttaatcgcc 
gtagaaatag 
aatacctggc 
taggaaaact 
ctggaatcgg 
acttagccac 
tgtaacacca 
ggagagccct 
caataggtaa 
aaggaatttc 
ttaccatacg 
ggatgtatgg 
ttcttcctga 
taatcatccc 
ctttggatag 
atattgagca 
tcaacttcag 
ttgcagacgc 
atgtagtcgt 
ccgttgggaa 
tttggatcgt 
atccccggaa 
tatagggatc 
aatctttatt 
cgtttaaata 
attaagcctt 
aatgtttatc 
gcggggacga 
cgaaacagga 
ggcggactac 
cgcagggagt 
attgtctgtc 
cccacttttc 
ataccgctcg 
cgctgattgt 
tgggccacta 
taaactccgc 
agccagagca 
tgggaaattt 



gtagtggtcg 
atcgatgatg 
aaacataagt 
aaagacgcga 
tgtcagtgtt 
taagcaatat 
atcacagatt 
gtcttccatc 
cgaaattttc 
ccgagctaaa 
gaataccact 
ggttggaggc 
tctcagtctg 
tgatggtgga 
acaaacgccc 
ctgtttgtga 
ccctgcatcg 
cccacgaagt 
aagaacaagc 
gataagaaat 
ctgttctcac 
gtcattatga 
atcatggctg 
tttaggttcg 
agggcaatac 
ttaattcctc 
atgtaggcac 
gtggcatcaa 
attaaatagc 
acggctatgg 
ggtcggatta 
aattgataat 
tgtagaacat 
ttagcagttt 
aattgttggt 
caaactgcgt 
taggcggatc 
agttaaagac 
atcccctaac 
acgtcccgac 
agagcctatg 
ctatggtgtg 
tcaaccggaa 
ctccaggctg 
aaatattatt 
ttcagtggaa 
agcaagtgaa 
taagtgagtt 
gtggtgaatg 
tagcgtcaca 
ccacgacctg 
gcagactgag 
aatagaggtg 
aagcgctgct 
ctatcttatc 
ttgcctttgg 
aattagtccg 
acagttggct 
cttatcttgc 
cagcggacca 



agctcaggcc 
atcaaattga 
ttgtgtttag 
ccggactgag 
cggtttagag 
cgaaagcagt 
gcgcgctgtc 
ttagtgcaag 
aatattcttg 
ggagaccttc 
ggtggaccca 
gtccataaac 
ccgcagactg 
cagagtcaga 
agtgggcggg 
tcaagtgctt 
cattggccag 
acgagataag 
cgaatctata 
actgaaagga 
tcgcaattat 
acatgatggt 
cccagttgct 
cgggcaaatt 
cccaagagta 
tcgtccctcg 
ccataaaaaa 
cttgatagag 
agaccaagct 
taagcatcat 
ataaaatgct 
ctttaccgtg 
gtccgattcg 
atcgtccaca 
ctctggctgc 
aaacatgttg 
tggccacgtt 
ctgccgccac 
gtgttcccga 
caattciigcg 
gcccgtcaga 
gtagtggtct 
cacgaccaag 
atgaattttc 
acaaatattt 
tatagttatt 
gtcactaatt 
cactaatttg 
atctatgcgg 
agcaattagc 
agcgccattc 
caaacattta 
acagtctccc 
aacgagcccg 
ttgtatcatt 
gggagaagca 
aatttatggg 
aatggggaaa 
ggtgggaaat 
aattcaatcg 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
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tatatatctg cgcactttaa atgtgcgaat gagtgctgcg gatgtttg 5208 



<210> 167 
<211> 1914 
<212> DNA 
<213> Drosophila 



<400> 167 

atgccgaaga 

gtcctgtgct 

gccgagagag 

acagtgaatc 

atagttcccg 

atagaggttc 

tgctgcccgt 

gacaaccgaa 

atattggtca 

ttttctctgc 

atgctccgtg 

tacgtcaacg 

gatcatagca 

cgcaaccagc 

tcgctgtgct 

ggatacacag 

gacttgtggc 

ttcctgtttt 

tgcgcccagc 

tactgctggc 

gcgatcgttg 

cagcgcgaga 

aaggacaaat 

gagcttatct 

gatctggcca 

gtcaagcact 

gtaacgatgg 

atccgcaagc 

ccattgtcta 

aaacacagat 

aagaccacgt 

ccgctagtga 



aaattcatca 
tggtcgtgtt 
accactacca 
tgacgggcca 
cagaattggt 
gggaacacgt 
ggcggcaggt 
cgtggccaga 
acatgtttac 
agccagagac 
tggacgataa 
aatcggacat 
cggtaaagat 
acggaaagag 
acgtgcaact 
catatttctt 
acaatttccg 
attcgctgta 
aacttaccaa 
tggatatgag 
ttgccaacac 
tggcccggat 
tcggcttgtt 
cgtacttcgt 
atgcgatgca 
tgatcacaaa 
gaccgacttc 
tggtggtgat 
ttccacgctg 
gctcgagtgt 
cgagcagcgt 
tcagctcgag 



gcctggagga 
ccggttgatt 
caccatagac 
taggctcttt 
cgggacgtac 
taggggatgc 
ctttaactct 
tccgcctatg 
gcagtttgct 
caacaattgg 
actgctaatc 
gttctacacc 
tatcaattcg 
cttggtctgc 
ctatcaagtt 
ttttatgggt 
agggacgaga 
ctcttggggt 
tttgcccgcg 
caactgggca 
catcatgttc 
aattgcgagt 
cttacgcctc 
gggcagcgac 
gggctttctc 
cagaacgata 
gttctcgatg 
ccgcaacgca 
cccacgcaat 
tcgcgatggc 
ggccaatctg 
cgtggatcct 



tccctatact 
ccggggattc 
gatccaaatg 
cccaacggaa 
gactacatac 
gtctgcaagt 
gaagttgacg 
ctcaatataa 
atccaaagct 
gatgattatc 
aggaagaacg 
atacatccgg 
tatgccgttt 
tatttaatag 
gatgccactg 
gcctacatgt 
ggaattaacc 
attgccctag 
aacctaaagc 
gccatgatct 
ataatgacgg 
gagaacagca 
tttctcataa 
aagggatggt 
attttcatgt 
cgtgtgcgca 
aagcagcggt 
tttgccaatg 
ggacgtcata 
agcaaccaac 
tcgctgcacg 
cagaaaacca 



gcggagttac 
ccttcggcac 
tgccctgtaa 
gctatgatta 
acagctcgct 
tcaagtcctg 
gatgcataat 
ccttccgcaa 
tcagaccctg 
tactctttga 
agttctgcat 
ccaattgtga 
acctattgat 
ggctctccgt 
gtgttacatg 
ggctaagtgt 
gattccagga 
ttttcctagc 
caggtattgg 
atttctacgg 
cgattaagat 
ccaagaacct 
tgggcatcac 
ctaagctctt 
tgtttgtaat 
gtctgcggca 
ttatggacgc 
ggtccaccag 
accactccac 
gtcagagcca 
agaagccatc 
ccatcttccg 



attgctgggc 
ttacgttatg 
tttctacgat 
ctacggcaca 
aacggagcgc 
cctgaatatc 
agaccactcg 
cgagtctacg 
ccccaagatg 
gaacggatcc 
ggttccgaca 
catgcaggac 
tcccgagctc 
gggttactcc 
caaagtgttt 
cattagcttt 
gaagaaacgc 
cttcacctat 
tgatggtgta 
tcccattctg 
tcacggcgtg 
acgaaccgag 
ctggctcacg 
ttatatctcg 
gaagaagaag 
gactgagagc 
ctccaaccag 
tgatttaccg 
cacttcacac 
gtactccacc 
ggtggagaag 
ctga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1914 



<210> 168 
<211> 637 
<212> PRT 
<213> Drosophila 



<400> 168 



Met 


Pro 


Lys 


Lys 


He 


His 


Gin 


Pro 


Gly 


Gly 


Ser 


Leu 


Tyr 


Cys 


Gly 


Val 


1 








5 










10 










15 




Thr 


Leu 


Leu 


Gly 


Val 


Leu 


Cys 


Leu 


Val 


Val 


Phe 


Arg 


Leu 


He 


Pro 


Gly 








20 










25 










30 






lie 


Pro 


Phe 


Gly 


Thr 


Tyr 


Val 


Met 


Ala 


Glu 


Arg 


Asp 


His 


Tyr 


His 


Thr 






35 










40 










45 








lie 


Asp 


Asp 


Pro 


Asn 


Val 


Pro 


Cys 


Asn 


Phe 


Tyr 


Asp 


Thr 


Val 


Asn 


Leu 




50 










55 










60 










Thr 


Gly 


His 


Arg 


Leu 


Phe 


Pro 


Asn 


Gly 


Ser 


Tyr 


Asp 


Tyr 


Tyr 


Gly 


Thr 


65 










70 










75 










80 


He 


Val 


Pro 


Ala 


Glu 


Leu 


Val 


Gly 


Thr 


Tyr 


Asp 


Tyr 


He 


His 


Ser 


Ser 
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85 










90 














Leu 


Thr 


Glu 


Arg 


He 


Glu 


Val 


Arg 


Glu 


His 


Vai 


Arg 


Gly 


Cys 


Val 


Cys 








100 






* 




105 










110 






Lys 


Phe 


Lys 


Ser 


Cys 


Leu 


Asn 


He 


Cys 


Cys 


Pro 


Trp 


Arg 


Gin 


Val 


Phe 






115 










120 










125 








Asn 


Ser 


Glu 


Val 


Asp 


Gly 


Cys 


He 


He 


Asp 


His 


Ser 


Asp 


Asn 


Arg 


Thr 




130 










135 










14 0 










Trp 


Pro 


Asp 


Pro 


Pro 


Met 


Leu 


Asn 


He 


Thr 


Phe 


Arg 


Asn 


Glu 


Ser 


Thr 


145 










150 










155 










160 


lie 


Leu 


Val 


Asn 


Met 


Phe 


Thr 


Gin 


Phe 


Ala 


He 


Gin 


Ser 


Phe 


Arg 


Pro 










165 










170 










175 




Cys 


Pro 


Lys 


Met 


Phe 


Ser 


Leu 


Gin 


Pro 


Glu 


Thr 


Asn 


Asn 


Trp 


Asp 


Asp 






180 










185 










190 






Tyr 


Leu 


Leu 


Phe 


Glu 


Asn 


Gly 


Ser 


Met 


Leu 


Arg 


Val 


Asp 


Asp 


Lys 


Leu 






195 










200 










205 








Leu 


He 


Arg 


Lys 


Asn 


Glu 


Phe 


Cys 


Met 


Val 


Pro 


Thr 


Tyr 


Val 


Asn 


Glu 




210 










215 










220 










Ser 


Asp 


Met 


Phe 


Tyr 


Thr 


He 


His 


Pro 


Ala 


Asn 


Cys 


Asp 


Met 


Gin 


Asp 


225 










230 










235 










240 


Asp 


His 


Ser 


Thr 


Val 


Lys 


He 


He 


Asn 


Ser 


Tyr 


Ala 


Val 


Tyr 


Leu 


Leu 










245 










250 










255 




He 


Pro 


Glu 


Leu 


Arg 


Asn 


Gin 


His 


Gly 


Lys 


Ser 


Leu 


Val 


Cys 


Tyr 


Leu 








260 










265 










270 






He 


Gly 


Leu 


Ser 


Val 


Gly 


Tyr 


Ser 


Ser 


Leu 


Cys 


Tyr 


Val 


Gin 


Leu 


Tyr 






275 










280 










285 








Gin 


Val 


Asp 


Ala 


Thr 


Gly 


Val 


Thr 


Cys 


Lys 


Val 


Phe 


Gly 


Tyr 


Thr 


Ala 




290 










295 










300 










Tyr 


Phe 


Phe 


Phe 


Met 


Gly 


Ala 


Tyr 


Met 


Trp 


Leu 


Ser 


Val 


He 


Ser 


Phe 


305 










310 










315 










320 


Asp 


Leu 


Trp 


His 


Asn 


Phe 


Arg 


Gly 


Thr 


Arg 


Gly 


He 


Asn 


Arg 


Phe 


Gin 










325 










330 










335 




Glu 


Lys 


Lys 


Arg 


Phe 


Leu 


Phe 


Tyr 


Ser 


Leu 


Tyr 


Ser 


Trp 


Gly 


He 


Ala 








340 










345 










350 






Leu 


Val 


Phe 


Leu 


Ala 


Phe 


Thr 


Tyr 


Cys 


Ala 


Gin 


Gin 


Leu 


Thr 


Asn 


Leu 






355 










360 










365 








Pro 


Ala 


Asn 


Leu 


Lys 


Pro 


Gly 


He 


Gly Asp Gly Val 


Tyr 


Cys 


Trp 


Leu 




370 










375 










380 










Asp 


Met 


Ser 


Asn 


Trp 


Ala 


Ala 


Met 


He 


Tyr 


Phe 


Tyr 


Gly 


Pro 


He 


Leu 


385 










390 










395 










400 


Ala 


He 


Val 


Val 


Ala 


Asn 


Thr 


He 


Met 


Phe 


He 


Met 


Thr 


Ala 


He 


Lys 










405 










410 










415 




He 


His 


Gly 


Val 


Gin 


Arg 


Glu 


Met 


Ala 


Arg 


He 


He 


Ala 


Ser 


Glu 


Asn 








420 










425 










430 






Ser 


Thr 


Lys 


Asn 


Leu 


Arg 


Thr 


Glu 


Lys 


Asp 


Lys 


Phe 


Gly 


Leu 


Phe 


Leu 






435 










4 40 










4 4 5 








Arg 


Leu 


Phe 


Leu 


He 


Met 


Gly 


He 


Thr 


Trp 


Leu 


Thr 


Glu 


Leu 


He 


Ser 


450 










455 










4 60 










Tyr 


Phe 


Val 


Gly 


Ser 


Asp 


Lys 


Gly 


Trp 


Ser 


Lys 


Leu 


Phe 


Tyr 


He 


Ser 


465 










470 










475 










4 80 


Asp 


Leu 


Ala 


Asn 


Ala 


Met 


Gin 


Gly 


Phe 


Leu 


He 


Phe 


Met 


Leu 


Phe 


Val 








485 










490 










4 95 




Met 


Lys 


Lys 


Lys 


Val 


Lys 


His 


Leu 


He 


Thr 


Asn 


Arg 


Thr 


He 


Arg 


Val 








500 










505 
















Arg 


Ser 


Leu 


Arg 


Gin 


Thr 


Glu 


Ser 


Val 


Thr 


Met 


Gly 


Pro 


Thr 


Ser 


Phe 




515 










520 










525 








Ser 


Met 


Lys 


Gin 


Arg 


Phe 


Met 


Asp 


Ala 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Leu 




530 










535 










540 










Val 


Val 


He 


Arg 


Asn 


Ala 


Phe 


Ala 


Asn 


Gly 


Ser 


Thr 


Ser 


Asp 


Leu 


Pro 


545 










550 










555 










560 


Pro 


Leu 


Ser 


He 


Pro 


Arg 


Cys 


Pro 


Arg 


Asn 


Gly Arg 


His 


Asn 


His 


Ser 



2.13 



SUBSTITUTE SHEET (RULE 26) 



WO 01/070980 PCT/US01/09341 



565 

Thr Thr Ser His Lys 
580 

Gin Arg Gin Ser Gin 
595 

Asn Leu Ser Leu His 
610 

Ser Ser Ser Val Asp 
625 



570 

His Arg Cys Ser Ser Val 
585 

Tyr Ser Thr Lys Thr Thr 
600 

Glu Lys Pro Ser Val Glu 
615 

Pro Gin Lys Thr Thr lie 
630 635 



575 

Arg Asp Gly Ser Asn 
590 

Ser Ser Ser Val Ala 
605 

Lys Pro Leu Val lie 
620 

Phe Arg 



<210> 169 
<211> 3957 
<212> DNA 
<213> Drosophila 



<400> 169 

ccagacatgc 

tcatctgaac 

tactaccgaa 

cttcaacatc 

atttactgcc 

ccttacccca 

gctccgtatt 

ggggaaatcc 

ctgattctgg 

aaagtgcaag 

ggtggcgccc 

tcgtgccagc 

gttatgtggg 

atggcatttt 

atgaaatgag 

ccaggacacg 

catcggtgtc 

gcagtggtgg 

ggactccgcg 

ctatggcaac 

gctgacgctg 

caactccacg 

gctgctggcc 

cctcaatctc 

acactacgta 

aaccccacgc 

cccttgaccg 

acttgcacgt 

ttctgtttgc 

acaccacgcc 

tcatgtcctt 

tactcaacca 

cccatggaga 

tgacgggctg 

cctcgaacgg 

tacagcttca 

gctcctaccg 

cagcaaaatc 

taatcatacg 

tggtataagt 

cgcagcgcca 

cccaaacctc 

tgatcttcat 



ttcctcgatg 
cgaatgacag 
ctgatactta 
tcgttaacgc 
atgccatgca 
tgcactctat 
gaattgattt 
ccgaaaaaca 
caggaattgg 
gttgcgtggc 
ggaaggaaga 
attgcataag 
tgtttgtgtg 
gttactttgc 
gcgaccaggc 
gcaactcgca 
ggcaagggat 
gagaactcgt 
gagctgcatc 
tttagtctag 
ctcctgctcg 
ctcgttctga 
gtctgcattg 
gcgatgaacc 
caggtgagtg 
tattcaggtc 
ctacgccaca 
ctccctggcg 
ctacaaaatc 
aaacccggtc 
catcatcttc 
ctacggcgtc 
gctgccgctg 
ceccaacgcc 
gaacggcacg 
cacactgcag 
ctctagcaat 
gttgtaaaca 
ccccgttgac 
taactaactt 
gcgcatcaac 
cacgctgcgg 
tgcctgctgg 



gggcttaacg 
tggaggtcgc 
aatcaaggca 
tctgcccaat 
agagatatga 
catgaacacc 
cggttgcccc 
ttcgaaccat 
agggcatagg 
cattcatttc 
gcatcagtaa 
tgtcagtgtc 
gtagtagtgt 
taccttcctt 
aagcgcctcg 
caggcacagg 
actggcgcgg 
atcggcgcca 
tcgccctcca 
gcaatccgta 
ctgttagcta 
ccctctgctc 
ccgatctcct 
ggaggacgcg 
ggtcaatcaa 
atgcccgttt 
gtaaagcatc 
ctgctatcct 
atcgccaagt 
agcatcagct 
cacaccatcg 
cgacgcaagc 
cccattccca 
cagcaagcgg 
ggcaccggtg 
ccgcgccagc 
cccatatcac 
ggatgtatgc 
tttggtcaga 
gcaaacccac 
cgagcaggca 
tcgcgaaggc 
gcaccgcacg 



tcagcggagc 
tagaaaaggg 
gttaaaaatt 
tctacagtca 
atcaccgcat 
gcacacactt 
aaagtgccac 
tcgtccctgg 
gcggaccttg 
gcagccgatg 
tctctaagtg 
aacggggcgg 
ccttgttggc 
tgtcgcccgt 
tcatcgaaga 
aacaggtgca 
gatcgaaagc 
acatccagag 
ggagcccaac 
tgacgtggac 
tggcctggtg 
cgcctcttcg 
ggtcaccggc 
atcgctgccc 
acaagcttaa 
cggccagcac 
cgcgattggc 
ggctcgcctc 
cgatggtggt 
gcacctccga 
ccgtgttcgt 
tctgcgccct 
tcctgcgacg 
cctgtggtgg 
gcggtcccat 
caggctctgc 
ccaggtaagt 
ttgatgtagt 
atatatttca 
ttagggccat 
ggggaccagc 
acctggctaa 
tcttctgcat 



agcgatgtca 
gttccatttc 
taatatgaaa 
tccgatttta 
gggaatagcc 
ttaatctgcg 
attccaaatc 
cgagcacaga 
tggcggtttg 
ggtggaagtg 
cgccttgttc 
ggatgtggtt 
agacaaaaag 
cgcacggcag 
tcacccaaac 
ggcacaacaa 
atgaacaggg 
cccaccgatg 
cagctgcccg 
tcagagcact 
gttttcggcg 
gtgcgtttgc 
atctccgctc 
cttgtgctgt 
gttagccaag 
tatttcgttt 
gcagcttcgc 
tgccgccatc 
caagggcggt 
cctgggcgcc 
gttgcctggc 
gtcgctaacg 
ccagacccac 
gggcaccacg 
ggccgtcagt 
cggatcggcc 
cataaatatt 
caaaagctat 
tcgaatggca 
gagggagatt 
gacacccggc 
catgctcatc 
cttctacaag 



gtggcagcca 
cactagttta 
ccggaaatgg 
atgaagcaaa 
cacaaaggac 
actaattgct 
tccgttgatt 
aacagcgtct 
tcgagtgcac 
ggcagcggtg 
gcattcgtac 
gtgcttggtg 
actgatgatg 
ctcctttagt 
agagtacgga 
ctggcaggag 
acaatctgca 
acctcggcct 
cggactacga 
ccatttcccc 
gcgtcgtggg 
gtaacccgct 
cggtcacgct 
gcaaggtcat 
taactaagtt 
ttcatgctct 
cagcggcgct 
agcactccct 
ggggcggcga 
aacgccatgt 
attggtgtgc 
gctcgagctg 
atggtcattg 
gcggacgaca 
ccgggcgaca 
ctcgaaccgg 
gtgaagttat 
tagtttatgt 
gtacgtcatt 
cgcgcccact 
atcccactgc 
gcctcggccg 
aacttcggca 
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120 

180 
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600 

660 

720 

780 

840 

900 

960 

1020 
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1200 

1260 
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1440 
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1560 
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1680 

1740 

1800 
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acaatcagca gtgcagccag acctctgttt acttcagcct tctgctaggt aagtctgaat 2640 

tggcaatcat tcccaatccg aatccgagtc cgaattggaa ctgactagtt gatcgcgttg 2700 

caggctactt ctactcggcc atcagtccgg tgatctactg ggcgctcaac cacaattcgc 2760 

ttcgacaatc tccctgcgcg cccatcatcc gcctgcgctc catgcagaac ttcctgaggt 2820 

cgcgcttccg gacgcacacc gccccgccgc ctccctcgtc caccaacgag gcggcactgg 2880 

gggcattcaa ccccaagctg atcaagctaa caccgaaaca gtatagagct caggcctctt 2940 

cgcattatct ctactagtgt gatactaagc taaacttgat atacatatat acacatttat 3000 

atacacgcat atatagtata ttttttacct acgaccccac cccccccctg acccgtcgat 3060 

gccctcttag taccctctag tatcgaactc cgaacccgaa ctttgaccca accgaacgaa 3120 

actatctaga tgtttttcaa aagctctacg caacaattgc ataatacgaa agatacatgc 3180 

tacatataca catatataga gtacatacta ccttgcatag cagtgttagt cgacgcttat 3240 

ttacacgatg ttgtgcgcct gaagctatgc aataaggaca ccggagtcct gccgcaactc 3300 

ttgcctataa cgaggatagc actggtaacg aaagcaagaa aaaaaaaagg aaaaaaaata 3360 

caaaaataac gaaacccttc tagtgacttg ttcgaattgt tgtttagtct tttaagctga 3420 

aatattgtac tatgttcctt ctcatagata agcaataagt gaacgcttta agccaagtta 3480 

aacaaacgga aaacgaatca aactaaacta aaactcgatc taaaactttt aaacgattta 3540 

aacgtaaact cgaaagcata cctaacgaaa cattaataat tatatacata cgcgactaca 3600 

tacaaataac aactaacaag tgaaaactaa aaacaaaaaa tgaatgtaaa tgtgagcaaa 3660 

aaaacaaaag gataacattt agtattgtat acatattctt cgacttaatt ctttatacac 3720 

tttttctaac catttttaat atcgttcagt aaactgaaca cggtgtatag ggcaattcaa 3780 

gctgctggat atattcgcgt ttcagatgcc agagccaccc aaaggatcgg actctggatc 3840 

gggtttcatt tcactccatt cagtggcgac cgtttcctaa tttgccaaac ttttgtacat 3900 

gctccgctgc gagcgactct ctccttactc cacctggttt ttcgttcaat ccttgga 3957 

<210> 170 
<211> 1638 
<212> DNA 
<213> Drosophila 

<400> 170 

atgaacaggg acaatctgca gcagtggtgg gagaactcgt atcggcgcca acatccagag 60 
cccaccgatg acctcggcct ggactccgcg gagctgcatc tcgccctcca ggagcccaac 120 
cagctgcccg cggactacga ctatggcaac tttagtctag gcaatccgta tgacgtggac 180 
tcagagcact ccatttcccc gctgacgctg ctcctgctcg ctgttagcta tggcctggtg 240 
gttttcggcg gcgtcgtggg caactccacg ctcgttctga ccctctgctc cgcctcttcg 300 
gtgcgtttgc gtaacccgct gctgctggcc gtctgcattg ccgatctcct ggtcaccggc 360 
atctccgctc cggtcacgct cctcaatctc gcgatgaacc ggaggacgcg atcgctgccc 420 
cttgtgctgt gcaaggtcat acactacgta caggtcatgc ccgtttcggc cagcactatt 480 
tcgtttttca tgctctccct tgaccgctac gccacagtaa agcatccgcg attggcgcag 540 
cttcgccagc ggcgctactt gcacgtctcc ctggcgctgc tatcctggct cgcctctgcc 600 
gccatcagca ctccctttct gtttgcctac aaaatcatcg ccaagtcgat ggtggtcaag 660 
ggcggtgggg cggcgaacac cacgccaaac ccggtcagca tcagctgcac ctccgacctg 720 
ggcgccaacg ccatgttcat gtccttcatc atcttccaca ccatcgccgt gttcgtgttg 780 
cctggcattg gtgtgctact caaccactac ggcgtccgac gcaagctctg cgccctgtcg 840 
ctaacggctc gagctgccca tggagagctg ccgctgccca ttcccatcct gcgacgccag 900 
acccacatgg tcattgtgac gggctgcccc aacgcccagc aagcggcctg tggtgggggc 960 
accacggcgg acgacacctc gaacgggaac ggcacgggca ccggtggcgg tcccatggcc 1020 
gtcagtccgg gcgacataca gcttcacaca ctgcagccgc gccagccagg ctctgccgga 1080 
tcggccctcg aaccgggctc ctaccgctct agcaatccca tatcacccag ggccatgagg 1140 
gagattcgcg cccactcgca gcgccagcgc atcaaccgag caggcagggg accagcgaca 1200 
cccggcatcc cactgcccca aacctccacg ctgcggtcgc gaaggcacct ggctaacatg 1260 
ctcatcgcct cggccgtgat cttcattgcc tgctgggcac cgcacgtctt ctgcatcttc 1320 
tacaagaact tcggcaacaa tcagcagtgc agccagacct ctgtttactt cagccttctg 1380 
ctaggctact tctactcggc catcagtccg gtgatctact gggcgctcaa ccacaattcg 1440 
cttcgacaat ctccctgcgc gcccatcatc cgcctgcgct ccatgcagaa cttcctgagg 1500 
tcgcgcttcc ggacgcacac cgccccgccg cctccctcgt ccaccaacga ggcggcactg 1560 
ggggcattca accccaagct gatcaagcta acaccgaaac agtatagagc tcaggcctct 1620 
tcgcattatc tctactag 1638 

<210> 171 
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<211> 545 
<212> PRT 
<213> Drosophila 



<400> 171 
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435 440 445 

Gin Cys Ser Gin Thr Ser Val Tyr Phe Ser Leu Leu Leu Gly Tyr Phe 

4 50 4 55 4 60 

Tyr Ser Ala lie Ser Pro Val He Tyr Trp Ala Leu Asn His Asn Ser 
465 470 475 480 

Leu Arg Gin Ser Pro Cys Ala Pro He He Arg Leu Arg Ser Met Gin 

485 490 495 

Asn Phe Leu Arg Ser Arg Phe Arg Thr His Thr Ala Pro Pro Pro Pro 

500 505 510 

Ser Ser Thr Asn Glu Ala Ala Leu Gly Ala Phe Asn Pro Lys Leu He 

515 520 525 

Lys Leu Thr Pro Lys Gin Tyr Arg Ala Gin Ala Ser Ser His Tyr Leu 
530 535 540 

Tyr 
545 



<210> 172 
<211> 7112 
<212> DNA 
<213> Drosophila 



<400> 172 

gaatgataac 

atctatgtat 

gcagtttctt 

gtattttttg 

tcctgtcacc 

ggctccagga 

caatacgaag 

attgtaactt 

aagcaaagcg 

ttatttaaat 

tttttgatgc 

ttagctatgt 

ttctggaaaa 

tcgttaatct 

atatcttcta 

tattttggtg 

cttggcgttt 

tctgatagaa 

tttcatcatg 

actaacttca 

taagtgaagc 

ttaacacaaa 

attaaattta 

tatgtatgca 

aacatttacg 

caggcttttc 

tgtgtttcct 

tggagtgctg 

aaaaaagtat 

ttttagtcgc 

tgtttaatat 

ctatctcatt 

aatgatttat 

attttctaat 

tggcataatt 

atgagatgag 

ggctgccagt 



taattcaata 
ctatgactat 
ttccttcgta 
gcagctctgg 
ctgcccggtg 
acaaatgcgt 
tgcaaactca 
tggtaaagtc 
aaaactttag 
attttggctt 
actacaaatg 
acatataatt 
cactcattta 
ttaaagtgta 
attccgcatt 
gtcacattat 
gccgacttct 
aggtgagtga 
cacgagcgga 
ctgtaaagtt 
atttaagtga 
agttaaagta 
tttatttatc 
tgtgcctcga 
ccggctgtca 
aggcttttat 
taactttccc 
cttgaaaatt 
tttgacaaat 
agtgtcagaa 
gtgtttaatt 
agaattcttg 
ttgcattccc 
atgctttttg 
ttagctaaat 
attacgaaaa 
gacaatttaa 



aaactacaga 
gtatctcgaa 
tcgagtttca 
ctggtttttt 
taattaaaaa 
tacattgtgg 
gccgtaacaa 
aatgttttct 
tgcgtacttt 
ttctatccat 
ctgttgattt 
acgaatattt 
tttaacacta 
tttgtgacca 
gttttttttt 
cacgacgact 
gtgttggact 
caattggcaa 
gctgcaaaat 
gccaatagtt 
agcatttata 
gcgtttcagc 
agctgcagaa 
aaacaaagcc 
cataaaggaa 
ccccacgctc 
aaaaacacgt 
caaacaaaaa 
gagtttggct 
ataaataatg 
aaaagctatg 
aaatgaccat 
gatgttaaat 
tggccaaaga 
tatgcgcaca 
atcataattt 
tttaatgatt 



tttcaataac 
tcctccgctg 
gagcacggct 
tcattttcct 
agtttttctc 
accatccgat 
taagattttt 
ttttccaagt 
tattacaaca 
tgtttttcag 
taataaacga 
ttaataaagg 
gcaaattagc 
cagcacaatt 
ttttgtgaat 
gcgctcaata 
cttttgtgtg 
aaacaacatt 
tctcggaaag 
tttaatgtcc 
gaaaagtttt 
tagtacagat 
tgtggctgaa 
ccaaaaaaaa 
actaaaactc 
cgctttcact 
cgcagtgacg 
gaagcctttt 
tacagtttct 
cttcattaat 
ttctatacat 
ttatttatgc 
cctgaaattg 
actaaattat 
acttgacgcc 
atgcatttat 
ataaacatca 



taaagcatac 
aaactctaaa 
tgtagtgttg 
tttttttttg 
gacatttgct 
tgtaaaataa 
gcaataatat 
tggccaataa 
ttaacttcat 
ttaatatctt 
tttatagctt 
gcatacttaa 
gcacaaataa 
aaacggtggt 
ttttcaggta 
accaactttt 
atgcagaatc 
atttcagtat 
catatacgaa 
gttttcatac 
tagttatagt 
agtagaatgt 
acggtatctg 
gtttcaactt 
cgcaggcgtc 
ccaatcatgt 
tgttcactta 
tcagatttgc 
ataagttatt 
ttaaatttgt 
ttagaaaaaa 
act taataat 
gtttgcatat 
tgagatgacc 
caccagaaat 
aatttatatg 
agtcgtttta 



atgtatatgt 
ctattcccat 
cacgtttttc 
tactcccatt 
tgccatgtga 
attttaagtg 
ttaactttta 
aaagagcata 
gctatatatt 
aaacaaacaa 
acatggatta 
taaattgaaa 
tgtgcaaagt 
ccgaaattct 
acctccttgt 
tcctcgccaa 
tgtccatcta 
taaattaacg 
gaatacgtga 
gcaaaaaatt 
aggttaaaaa 
tatgtggtcg 
agtgcatacg 
tcaaccgtga 
actttcactt 
ttatttgcat 
aaataaagcg 
agggcaaatt 
taatttaaaa 
gaaatgcatg 
aagtaattac 
aaacagacat 
tttccatttc 
catctgtgtt 
catatgcaaa 
tgcggcatac 
agaaagtttt 
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tagttcatta 
aaatggggaa 
attatgttta 
attgctaata 
aaaaggaaaa 
tggcgggcgt 
tgccaagtat 
ttcagctggg 
tacacggcct 
caccccatta 
aaacctcata 
tgtgccctgc 
gtattgtatt 
ctaatgaaac 
cgactcgaaa 
ctcgaagggt 
catcacagcc 
agacataaag 
aatttaacaa 
gttttttttt 
gcccaagttt 
ggagatatgt 
tgtgcttctc 
gcttgaagtt 
caatccaaaa 
aaatgtattt 
ctaatgagat 
tcttcgtctg 
ctccattttt 
tcgccccttg 
ggtgctgtac 
tgtggtgcag 
tatgcacaac 
ccagctgcaa 
tgggggcggt 
caaacagagc 
ccaggagaac 
ctaacaatag 
gctttaactt 
atcgctatcg 
tcgtaaaaac 
gctcggagca 
accgatgtga 
actgcaagtg 
cggacagcag 
ctcgacagca 
cagctcactt 
ttcggaaaag 
ttgaattttt 
tatgttttcc 
acacctttat 
atatacacag 
tcgtttgatc 
accacacaca 
tatcaaaatc 
acctgttaat 
aagacgagct 
ggcggtggca 
gtcctacgcg 
ctctgcaatc 



tgttaaggaa 
tctttttaag 
ttatagtatt 
acgaggcaca 
agctttgcgc 
gtcataacat 
atgctcttta 
tctttggcga 
ccattttcat 
cctgcaaaca 
ttcacggcca 
ttttttcaat 
ttgagacatt 
ttattagacc 
ggcttaataa 
gaagtggagg 
agacaaaaaa 
acaacgaaat 
attgtttgcg 
ttgcagatgg 
gttttcagca 
gtcctggacc 
tacgttatgc 
aataattatg 
gtgtttccta 
ttttgaattg 
agaatataaa 
ttaaaaaagt 
caatttcttc 
ttccgcaatg 
agcaaaatcg 
catcagcatc 
agcatgtacc 
tcggcggctt 
cctggtccgt 
agtaaatatg 
actgcatcag 
caataaaccg 
cgattgagtc 
gagcactaat 
tttacccgta 
ggtgtcaatt 
cacctggagt 
cgaggccaaa 
gttcaattgg 
gctttgccac 
tgtcaataaa 
ttgtgcaact 
taatattcct 
gttttaaaag 
aaaactttta 
ataacaacct 
aataaaactc 
atcccacacc 
gatatgataa 
ccgcacaggt 
tctaccacca 
gcggcggagc 
cccggcgcgg 
tgccgtatca 



tgatttgtcc 
aaacgcatta 
attgggcaca 
aatgcggttt 
ctcataaaat 
tttatgtggc 
aagagcgaaa 
gttcctctgc 
cctggttgtc 
gatcctaact 
gtaataggca 
tcccattttg 
ctcgcaactt 
aagcccggag 
gccgtgggca 
gagcgcagcc 
tacgtttcgg 
gggctatcca 
tgtttggcag 
tcattgtcac 
agaccattaa 
gtgagatgtt 
cgctcttggt 
tttaaccaat 
gtcacgttac 
cttattaaag 
taattttggg 
gattaatgga 
tgagaaattt 
aaatttgaat 
ccatcgcctt 
agcagccgca 
atcaccatcc 
cgtcggcagg 
cactcgcttc 
agaagcgcgg 
aaaaatcgcc 
aaagtggagt 
cgcacgtgct 
ttattagata 
gctgttactc 
cacattaagg 
gtggagtggt 
gttatggcaa 
tcgggtgcac 
acaaactggc 
agccggcgga 
gccgcagatt 
cgaagtatca 
tcacaaaaat 
cataacttta 
aactatcttt 
tgtgctctcc 
acaaattagc 
tgttcaacaa 
gtccctggag 
cggccacgcc 
cggggccgga 
cgttgtccgc 
tgcccgcaaa 



cttttaaaga 
tccaaaatct 
tttaaacgca 
tcgaatgtaa 
actaaagaca 
ttgtgctaaa 
cttttgccca 
cgcatgtacc 
atctgtatgg 
gcggctcgtc 
agtatgcatt 
atgggtcaaa 
taataacttc 
actcatgcaa 
taactttgtg 
aaagggttga 
taatgacgat 
ttaacatgca 
ccgagcagac 
agtttggatt 
gaacatccac 
caactccaag 
gatgacggta 
ccttggatag 
ctagcattga 
tgattaatta 
tacataataa 
agtgtagatt 
gataagaatt 
aatgaattta 
gtggcgcagt 
gcagccatcc 
gcatcatcat 
agtggtggga 
cggcggcagc 
cgtcagcatc 
ggttgagcac 
cgaacctttg 
gtgtggtaaa 
caccatttgt 
ccgcactttc 
ccatcgcagg 
ggagcgaagt 
aagtgttttt 
catggcgtat 
acacacatca 
taaaaatcga 
gaaaattggt 
aatgtcaggg 
ggaagtttgg 
gtttacgctg 
ctgttgaact 
ccctctacta 
tcgataattg 
ctgttgcggc 
gccgatcgtc 
catcaccagc 
gccacccaca 
atgctgataa 
atgtggcagt 



gagaaattga 
tggcgacaac 
tgagtgcgaa 
tttcagccga 
ccttaatgtc 
tttggctctg 
taaataatta 
agtttgtcca 
agcgctactt 
tgagggtaag 
ttaaatgagg 
agcggaggaa 
atttgaatac 
attaaccggc 
gccgggaaaa 
accacatagc 
tgcactaatt 
aattcagtgt 
aaaacaaaac 
acatcggcag 
acgcaggacg 
ttgctggaca 
agtgtaacaa 
gatgttcata 
gcagactata 
tatcaataac 
ataatctatg 
tccctttatt 
gagaagcttt 
acgaacccat 
tcgcgcggcc 
tgccaggaca 
caccaccacc 
gtgggtttgg 
agcaccacgt 
acggagagcc 
ggaggagtag 
ttccctaatt 
taatgttttt 
cggcctggcc 
gccgggccaa 
accatgccac 
gaccaccggc 
ttttttccat 
ttcttatttc 
ttgttcttga 
gaatttgtga 
aaacaccaaa 
gagcactggt 
taggaaaaac 
ttccttaact 
cttttctctc 
actccccaat 
caaggtaggt 
cagcgaccta 
ccattgtcag 
gggcgggcaa 
tgtcccactc 
tattcgtgct 
actggtgagt 



atcaccaggc 
cttcaggatt 
attacgtggc 
aatagtaata 
gcattagttt 
ccaccaatat 
tttatgtcct 
ctccctgagc 
tgccatcgta 
ttctaaccat 
tctatgtgta 
atctgtttat 
aatttgcaag 
agaacagaac 
cttttccgca 
ccggttgttt 
aaactacaag 
gttcattcgc 
gtcaatgcgt 
tttactccac 
ggcaggagga 
tgattaactt 
atgatacgta 
aattcatttt 
attaataatt 
aactatctct 
aaatgatttt 
tttttaacga 
ttgcataaaa 
ttgcattgca 
tcacgccgca 
ttggcatggg 
accagcatca 
gtggtggtgg 
ccttgtcccg 
aggtgagctg 
ccttaaaagc 
ggccagaacg 
tgaaaatcac 
aagttggcag 
gtggcacggt 
aagctcggcc 
acagggcgaa 
tatcggtggc 
ttatttcgag 
tcagcatcag 
cacgatgaca 
ggcaaggccc 
tcgatttcgg 
aatggcggga 
tatgaccccg 
tctctacctt 
tataacaaca 
tgtaatcgaa 
atgccattac 
tgcctgccgc 
cgcctcagtg 
gtccagcaac 
gacattcgcc 
agtcccttcg 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
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tccacccacc 
cccttaccgc 
tattgcaccg 
tgcgagtggg 
aagtcatcaa 
aaagcggtct 
ctcgcctttc 
ttccgcttcc 
cgggcaattt 
ggcggtggaa 
ttaaaaacgt 
aaatgtcctt 
aagtgtgaaa 
aacttttcat 
attttacatg 
aataactact 
tttgtaagaa 
aacgaaaatg 
gcggacaaat 
ccacttttcc 
ttgtttactt 
tattggtatt 



acccataatc 
aggccattca 
gcaatcggcg 
cagataagct 
tggacagagc 
tccgttttcc 
aaaagcggtc 
ggtggagcac 
ggtttttccg 
gtgtttttgt 
gcacttaagc 
taagcgaaag 
atctggagac 
acagattcta 
ccgatgtgca 
gaaataaata 
aatacttcgc 
tttttgaatt 
aaatataaat 
cgcagtgcca 
gtttgaactt 
gttaggtgga 



acaatccaat 
tcagcccaca 
ggaatttgca 
ctagatggct 
tggagctgca 
aatcccactc 
gccaagaaaa 
tttcgaatag 
gctggcgggt 
ttgctaaatc 
tgctgactgc 
tcgaaagaca 
attggcaatt 
ctaaaccaaa 
ataaatccaa 
tcgatctcta 
tgcagatttg 
tttgttcgag 
aatatctggc 
taattgagat 
tcagctttta 
attgaaagtg 



gcgaatcgga 
cggactgagt 
tagtcctccg 
tcctgctgga 
gccagctttg 
ccttcccaaa 
gtccgtttta 
gtggacggac 
gtggtgggcg 
gaggaggagt 
tgtaaattgg 
aacgattgat 
ttggcacgta 
cgtttttgag 
tttaatgttc 
gttctcatta 
caacgttgca 
cactcaattt 
caaatacaag 
ttatgtctag 
tttatttgaa 
tt 



atcccaatcc 
gtaaacaata 
ttggacggga 
atcgaggcta 
ccgctggcct 
gacgccagtt 
gccagccctt 
cgcttagttt 
tataacttga 
taattgctcc 
aaataaaata 
tgaagatggg 
agagaaacgt 
tcaattgtaa 
ataagttcat 
gtgcaaatat 
cttttggcaa 
gcacaattgt 
agaaatggat 
cccggaatgg 
gaattggttg 



ctatccctat 
ttttatggta 
aattaacatc 
gtaaacaatt 
aaatgatttg 
tcgctttctt 
tggctateag 
gctttgaatt 
taagtgagtg 
ctgggtagaa 
aaagtcacag 
aaaagtcatc 
ttttattttg 
attgtaaagc 
gtttttggtg 
tcagtgtaat 
acgcaacgaa 
aaaacgaatg 
ttccacatcg 
gcagcgagac 
cccccaggga 



5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7112 



<210> 173 
<211> 1437 
<212> DNA 
<213> Drosophila 



<400> 173 

atgcagaatc 

atgtaccagt 

tgtatggagc 

gctcgtctga 

aagtttgttt 

atatgtgtcc 

cttctctacg 

ttgtggcgca 

cagcagccat 

ccgcatcatc 

ggagtggtgg 

tccggcggca 

ggcgtcagca 

cccgcacttt 

gccatcgcag 

tggagcgaag 

gaggccgatc 

gcccatcacc 

ggagccaccc 

cgcatgctga 

aaaatgtggc 

tggtatattg 

acatctgcga 

caattaagtc 



tgtccatcta 
ttgtccactc 
gctactttgc 
ggatggtcat 
tcagcaagac 
tggaccgtga 
ttatgccgct 
gttcgcgcgg 
cctgccagga 
atcaccacca 
gagtgggttt 
gcagcaccac 
tcacggagag 
cgccgggcca 
gaccatgcca 
tgaccaccgg 
gtcccattgt 
agcgggcggg 
acatgtccca 
taatattcgt 
agtactggcc 
caccggcaat 
gtgggcagat 
atcaatggac 



tctgatagaa 
cctgagctac 
catcgtacac 
tgtcacagtt 
cattaagaac 
gatgttcaac 
cttggtgatg 
cctcacgccg 
cattggcatg 
ccaccagcat 
gggtggtggt 
gtccttgtcc 
ccagttggca 
agtggcacgg 
caagctcggc 
cacagggcga 
cagtgcctgc 
caacgcctca 
ctcgtccagc 
gctgacattc 
attcatcagc 
cggcgggaat 
aagctctaga 
agagctggag 



agctgggtct 
acggcctcca 
cccattacct 
tggattacat 
atccacacgc 
tccaagttgc 
acggtgctgt 
catgtggtgc 
ggtatgcaca 
caccagctgc 
ggtgggggcg 
cgcaaacaga 
gtcgtaaaaa 
tgctcggagc 
caccgatgtg 
aactgcaagt 
cgcaagacga 
gtgggcggtg 
aacgtcctac 
gccctctgca 
ccacacggac 
ttgcatagtc 
tggcttcctg 
ctgcagccag 



ttggcgagtt 
ttttcatcct 
gcaaacagat 
cggcagttta 
aggacgggca 
tggacatgat 
acagcaaaat 
agcatcagca 
acagcatgta 
aatcggcggc 
gtcctggtcc 
gcagtaaata 
ctttacccgt 
aggtgtcaat 
acacctggag 
gcgaggccaa 
gcttctacca 
gcagcggcgg 
gcgcccggcg 
atctgccgta 
tgagtgtaaa 
ctccgttgga 
ctggaatcga 
ctttgccgct 



cctctgccgc 
ggttgtcatc 
cctaactgcg 
ctccacgccc 
ggaggaggag 
taactttgtg 
cgccatcgcc 
tcagcagccg 
ccatcaccat 
ttcgtcggca 
gtcactcgct 
tgagaagcgc 
agctgttact 
tcacattaag 
tgtggagtgg 
agtgtccctg 
ccacggccac 
agccggggcc 
cggcgttgtc 
tcatgcccgc 
caatatttta 
cgggaaatta 
ggctagtaaa 
ggcctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1437 



<210> 174 
<211> 478 
<212> PRT 
<213> Drosophila 
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<400> 174 
Met Gin Asn Leu 
1 

Phe Leu Cys Arg 
20 

Ser He Phe He 
35 

Val His Pro He 
50 

Met Val He Val 
65 

Lys Phe Val Phe 

Gin Glu Glu Glu 
100 

Leu Leu Asp Met 
115 

Val Met Thr Val 
130 

Ser Arg Gly Leu 
145 

Gin Gin Pro Ser 

Tyr His His His 
180 

Leu Gin Ser Ala 
195 

Gly Gly Gly Gly 
210 

Ser Thr Thr Ser 
225 

Gly Val Ser He 

Val Ala Val Thr 
260 

Glu Gin Val Ser 
275 

Leu Gly His Arg 
290 

Thr Thr Gly Thr 
305 

Glu Ala Asp Arg 

His His Gly His 
340 

Gly Gly Ser Gly 
355 

Ser Ser Asn Val 
370 

He Phe Val Leu 
385 

Lys Met Trp Gin 

Asn Asn He Leu 
420 

Ser Pro Pro Leu 
435 

Ser Arg Trp Leu 
450 



Ser 


He 


Tyr 


Leu 


5 








Met 


Tyr 


Gin 


Phe 


Leu 


Val 


Val 


He 








40 


Thr 


Cys 


Lys 


Gin 






OO 




Thr 


Val 


Trp 


He 




70 






Ser 


Lys 


Thr 


He 


85 








He 


Cys 


Val 


Leu 


He 


Asn 


Phe 


Val 








120 


Leu 


Tyr 


Ser 


Lys 






135 




Thr 


Pro 


His 


Val 




150 






Cys 


Gin 


Asp 


He 


165 








Pro 


His 


His 


His 


Ala 


Ser 


Ser 


Ala 








200 


Gly 


Gly 


Pro 


Gly 






215 




Leu 


Ser 


Arg 


Lys 




230 






Thr 


Glu 


Ser 


Gin 


245 








Pro 


Ala 


Leu 


Ser 


He 


His 


He 


Lys 








280 


Cys 


Asp 


Thr 


Trp 






295 




Gly 


Arg 


Asn 


Cys 




310 






Pro 


He 


Val 


Ser 


325 








Ala 


His 


His 


Gin 


Gly 


Ala 


Gly 


Ala 








jdO 


Leu 


Arg 


Ala 


Arg 






375 




Thr 


Phe 


Ala 


Leu 




390 






Tyr 


Trp 


Pro 


Phe 


405 








Trp 


Tyr 


He 


Ala 


Asp 


Gly 


Lys 


Leu 








440 


Pro 


Ala 


Gly 


He 






455 





He Glu Ser Trp 
10 

Val His Ser Leu 
25 

Cys Met Glu Arg 

He Leu Thr Ala 
60 

Thr Ser Ala Val 
75 

Lys Asn He His 
90 

Asp Arg Glu Met 
105 

Leu Leu Tyr Val 

He Ala He Ala 
140 

Val Gin His Gin 
155 

Gly Met Gly Met 
170 

His His His His 
185 

Gly Val Val Gly 

Pro Ser Leu Ala 
220 

Gin Ser Ser Lys 
235 

Leu Ala Val Val 
250 

Pro Gly Gin Val 
265 

Ala He Ala Gly 

Ser Val Glu Trp 
300 

Lys Cys Glu Ala 
315 

Ala Cys Arg Lys 
330 

Arg Ala Gly Asn 
345 

Gly Ala Thr His 

Arg Gly Val Val 
380 

Cys Asn Leu Pro 
395 

He Ser Pro His 
410 

Pro Ala He Gly 
425 

Thr Ser Ala Ser 

Glu Ala Ser Lys 
460 



Val Phe Gly Glu 
15 

Ser Tyr Thr Ala 
30 

Tyr Phe Ala He 
45 

Ala Arg Leu Arg 

Tyr Ser Thr Pro 
80 

Thr Gin Asp Gly 
95 

Phe Asn Ser Lys 
110 

Met Pro Leu Leu 
125 

Leu Trp Arg Ser 

His Gin Gin Pro 
160 

His Asn Ser Met 
175 

Gin His His Gin 
190 

Val Gly Leu Gly 
205 

Ser Gly Gly Ser 

Tyr Glu Lys Arg 
240 

Lys Thr Leu Pro 
255 

Ala Arg Cys Ser 
270 

Pro Cys His Lys 
285 

Trp Ser Glu Val 

Lys Val Ser Leu 
320 

Thr Ser Phe Tyr 
335 

Ala Ser Val Gly 
350 

Met Ser His Ser 
365 

Arg Met Leu He 

Tyr His Ala Arg 
400 

Gly Leu Ser Val 
415 

Gly Asn Leu His 
430 

Gly Gin He Ser 
445 

Gin Leu Ser His 
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Gin Trp Thr Glu Leu Glu Leu Gin Pro Ala Leu Pro Leu Ala 
465 470 475 



<210> 175 
<211> 17847 
<212> DNA 
<213> Drosophila 

<400> 175 

ccttcgactt gagcggacag ttgaggttct cgaactaatt gcaaaattgt caatgattta 60 

ttcagttggc aattaaggtg ccaactaacg gtgtggctgg cacggctata tcgatttgaa 120 

aatattcaat aatcccggag ccttgataag ctcatcaaac atgcacatgg ccatcagcag 180 

gggggggcaa gtttgtattt agagccaagt gtgtttggtg aaggggggtt agctgtatat 240 

ttcagggttt cccctttttt tgtcgagcgg tttggggggg ggttttgcga ggggacacaa 300 

aagtacaagt tgctatggat caagctcaag gcgtggcagt tgggcattag ctctatggtg 360 

gagcacggca accaagcgaa caaaggtcaa ctcgacaagt gagttaaatg acaaggacct 420 

caccttgcct tccttgggct tcggctttgg ctttggcccg tgcgtgtttg catgtttggc 480 

tcgatgaggc gacgcgaccc caaaaatgaa actcaacttt ggctttgact gcggctcggg 540 

attgtggcgt gcaagcatta tggcctttcg ctttcaaaac aatgccccca tcctgtttcc 600 

cggttggcct agtggcactt gctccttgtt gttggttttt cgcattgccc ttggccaatt 660 

gcgagtgaat tttgatgttt tgactggcca ttcctggggc catttgacaa ctctataaaa 720 

ccggtgtcct taagcccgcc ctcgtccttg cgcccaaccc ccaattaccg tccaactacc 780 

acccacctac accccgctac acccacccac aaagtaaacg taattgcccg gagagtttcg 840 

cgactttcga cttggccagc aaactgaaaa agcgaaactt caaaaaaccc gacttgactt 900 

ggcaactgga aaagtggaaa agcggaagca cgagctttcc caccgaaaag ctctcgtgct 960 

ctaagctttc gtcctggcga ggcgaaaacg taaatgtcag tcatggagtt ccagtcgtcg 1020 

gatacatgtg gaacacaacg aacacagcag gagcctctcg cgtctcatcc cccgaataca 1080 

atgcgaaaaa gtctgtacaa aaaatagaca aataaggagc aggagcgaaa acaccactct 1140 

gtgcataaga tataactccc gcagaactga atacctgcaa aaagaaatac gcgttaaatg 1200 

gaattacggt gcgggaattg ataaagtgaa gtgtgttaca gctagaaagc aagtgaaata 1260 

aaacgaatac attaataatt catgcccctc atagtcagaa gcttactagt ttgataagga 1320 

attcaacgta aatctgatat agtaatgtag aaaagttgta caaagtaata tacatagcat 1380 

atagcatata gcacagttac acagaacaca ccgcaaactc aaacaaagtt gacaaaaaaa 1440 

aagctcacgc ccctcacatc tcagtttttt ctcgaattat ttgtgaaagt gcaacaaaca 1500 

tatataaaaa ataaaaaaga aaagggagga gagagcacag caaaaaaagg cagtgacagc 1560 

aggacctgca aaggaattgc catcatttgg ggattacgga attatcggga tccgttctca 1620 

ttaaaatgga tttaaggcaa tgccgatgtg ctgagctgca tttgtgcatg ggattatcga 1680 

ttgtaatgcg cagtcatcaa attatgtttg aacaacgccg ccgcgacagc gataatagca 1740 

acaataacaa caacaacaac aaccaaatca acaacaaaca ctgatttggc ccctgacccc 1800 

ctttggcagc accccctcca tccaacaccc cctacgaccc tgcccaagca aaaggagcag 1860 

cgaaaaagaa gaagcgtctt atgaaacaat tgaagcaccg gccaaaggag caaaggagca 1920 

gacaaagaca gagacaaaaa catcggcaaa ggataaaacg gaggccgaga agaagaggaa 1980 

gcagtagtag caaagcagga aacaacaaca acaacaaccg caacatcgca tttttaaacg 2040 

gcaatacgca aagtaagtga agtgtaaagt gagcgagatg gcaatagaaa ggtgtagagt 2100 

gccgcgaaga gaggcggaga gaatacactt gcaaaaatgt gagcagctaa ttcccaggga 2160 

tgttaaaaaa aattaaaccc tcatggttca aagattctag gagatggaca agtgatgaat 2220 

ggtcagtgga aatatatgta tgttttaagc caagtgaaat caaaatgatt tgagaagtac 2280 

aaataaaaga aagaaaggaa agaatgatat atgatattta ttggttgtag tacaggaaaa 2340 

taatagttaa aatgatatcc cggattgatg gtcattacaa tgatcgtacc acaaaactct 2400 

ttctaactca aattaaattg ttaaaattat ttttcaatac aaatttgata tatttttgtt 2460 

ttttcgccga caagatgcat agatctttca tagaatctga aatctatatc ttttttatat 2520 

tgcttttgag aagagtaaac cgatcatcat ctacgatcac tcctttttgt tctgtgtagg 2580 

tatgaagaga gcaggtggga aggcagtgaa gtgaagtgaa gcatttgtcg catttgtgca 2640 

gaattcccga cgtaattaat gtttaaagtc tgtgtgtgtg tatgtgtgcg aaggtttgta 2700 

aactaggtca ctaggaagtt agccccttga cgaccatccc ctctcatcag cgcctcttca 2760 

gcggcaactc tacgtcaaag gtgtcaactc gccaaatctg caaccacacc ttagttaagt 2820 

gagccacccc gtagccacca aaattatcca cagcaccacc accttttatt cgtccttaac 2880 

acaccgccat gattaacgcg cctcagtgcg attaactggc aaacaaattt ggagtgacgc 2940 

tgcggcgatg tacatacata catatatgta tgcatgtttg tgctaaaccg acagcatttc 3000 
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cattcaataa cacacgcggc tcgtatgcaa acactgtaaa aaggttttat gctaaaaaag 3060 

acgccgtatt tgtgcatgaa aaactgtcat tcatcatcca accaacacaa ataaccggta 3120 

atggcatcgc cacattatgg ctttgtattt tttacatcct caaacacaaa aaaactttag 3180 

ttgatttctg acttgcatcg cattcatcac ccactaaagt cgagcaatca atcctcaaaa 3240 

tacgtatacg cactgttagt tcagttgatt aataggctaa ttatgaagga aactgaaagg 3300 

aaacactaaa tcaataaatc ctggcaaatt agcatcagta tatatttgtg ttagtgcaaa 3360 

catttaagtt ttctatcaat tttaaataca aatatgttca gaattaagat atatttcaaa 3420 

gggaactatg ctttgtttaa ttaagattta aggttttata actataaaag ccttcaaaat 3480 

gtgccatcta ttttatattc acttacaagc agatttcaat caagaccact ctttaatatg 3540 

atatattatg tctagaacaa ccgagaaggc actcttaaat taaatatttt agtaccgtaa 3600 

taaattatga aacgtgctga cggtaaaata tataatacaa ttcatacttt aagttatatt 3660 

ttttttttgc cctatatggt tttacgtatc ggtgtgtatg tgagtacctc gcacacgcag 3720 

gacatagaga taaaaaggct tctcccacat aaaagttcta ttctgttttt ttttttttat 3780 

agaaaaatat tgctggcagg agtttagaac attgccaaag gggatgcgga cagatgaagg 3840 

ggcgggggcg tggcgggtaa aagccataaa actcaggcgc cgaagcgcaa aatgcccttg 3900 

ccctgaacaa aaagaagaaa ttgaacagca acaaccaaca acaaaaacaa agaagaacaa 3960 

gagcaaataa aatacaaact gcttatttat ttatttgttt cgacaaatct caattgaaac 4020 

aaagcttaaa agcttcgaaa tcaaaactta aatagaaacc acacagacat ccatccagac 4080 

gaataaaacg aaaacgatag agcgcgagac aaagagacaa ttactattaa atccaagttc 4140 

aattcggtcc gcaaacaatt gaatttgttg cagatttgat ttggcttcat gtggattgcg 4200 

ataactattt tcctgggatt agtatgggat tggattcatt ggcccgaccg cagtttgctt 4260 

cctaatgaga tttgcttaat taaaatgaaa ttcaataact ggaattccaa tgaatttaaa 4320 

tgttagagaa aaactctaat tcggactttt gtggctagcc attatttgca gcaaggggga 4380 

cataaaaagc atttggcttt cgtttggatt agtcagcaag gtcaaatctc gaaactgacc 4440 

taagctaatg caaaatgact tttcctgtca atcacaatct atatttaatt caagcgaaat 4500 

tctctgccac atttcttgcg ctataaatcg cagtacccat tatagcatat caacaaccaa 4560 

attaagacga aagtacaata aaatgcaact aatttcagtt ttaataatgg atttgctggc 4620 

ggcttatttt tgtttcaaat ttggttattt tctttagaaa tgtgcgcata cattaagacg 4680 

aacacaaaaa ccagctttgg ctcaatctct taaaccttta aaacgttata tgtacgcaca 4740 

tatttatgta cgaacaaatg aaggaaaatc tcaagcctaa ttccgcatta agctgcaagg 4800 

aaacacataa cccaaaagta tgcaacataa ataaaattta aatcagcacg tttatgaagt 4860 

ggattatttg aactgaaatg cttattcaac gttattagat gagatttatg gccaggagag 4920 

catttgaatt ggcagattat tcgggccaac caagtaatgg ccactatatc aaacaaatca 4980 

gttatccttc aattgcaaaa tacacagtgc agacaaatag tttaaactgt cttagtggta 5040 

atcaaaatca actttaatta tcaactaatg tatgtgtatt ttacgaaaca aaagtgcatt 5100 

atgcatgaac gaatgatagc aatatttttc aatataaaag cttacgattg ttttcctttt 5160 

gttatcaatt atctatcaca ctccacagac atactcctag atcataataa taagttatat 5220 

aaatatagca tagattttga aaggttaaaa ggttttctcc ccaccactcc ctctaataat 5280 

attgtatata ataaaatata tatacaaatg tatttcataa tttcattgcc atccaattcc 5340 

gcatagtatc tggtataccc catctgcaga aagacaccga agtgtataaa ttgcttacgt 5400 

ttgcccatgt agtcataaaa attgctttgg cacataaaac gaagcttgac gctcgctttt 5460 

ggttttggtt ttggttttgg cctggtctag tctggtttgg tttcatttgg ccccttaccg 5520 

ttcttgtttc ctttggtttc catcgctggt gggcttgttt ttttgcgaga gtcctggccg 5580 

tgtccttctt atatttctat ttctatttgt atttgtattt gtatttgtat tccccgattt 5640 

gtattttgtt cgctcgccaa ttcggtaatc acatttcaag cgttttattc gtcctgattt 5700 

catacgattg ctgattcaag tttctgtttg tggtccttgc cttgcaggat atgacaagtg 5760 

atggtgctgt tacgttttgg atatttttgc tttgtttaat cgcctcgccg cacctgcaag 5820 

ggggcgtggc cgggaggccc gatgaactgc acatcggcgg catctttccg atagccggca 5880 

aaggaggatg gcagggcggc caggcgtgta tgcctgccgc aagactggcg ttggatgatg 594 0 

tcaacaagca gccaaatctg ctgccgggct tcaagctcat cctgcacagc aacgacagtg 6000 

aggtgagtag tcagtacact gaaaaaaaaa cgatatatac ctaaggatat tttgaatgta 6060 

atgcgatgca atagtcctac agtccaagtt cgatacgaaa atagtttttt atgtttcctc 6120 

ttaagctggc tttaataaga ataaacattt attgaagctt ttgaactttc ttataatttt 6180 

cataggtaac taatatatac aatatattac attaaatacc caataagtat ccaataaact 6240 

tcgaagcaac acagaacttg tttgcaacac gattaatatc actgcaaata atatcctgac 6300 

caaaaacccc tcctgatgca ctgagttctg tttaagcctt ctgttcacga agcattttcg 6360 

ctcagtgcac tagtaagttg gaaagcgata agccctactt atatgcagcc cctgctgcct 6420 

cctctgctgc catgccacat acccttgggt gccacttatt ttgtttcgtt tttgtttctg 6480 

ctcactgatg agtagtggag tgctggctga gcgctggcat tcattagcat atttctctag 6540 

gggatcaggc tggggctggg gctggagctg ggcactgtta acaaaggaca aacagtttta 6600 
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ttatgccgca 
tccgccgcat 
gattgagatt 
tcggttaatc 
aaaaatgtgt 
tcataaatgt 
cgaaaatctc 
aaacttcgaa 
agcagcacga 
gaacacagta 
attttgtagc 
ggaaaaaata 
atgctgtaac 
acggtcatgc 
accaagttgc 
caaaaccaaa 
aatttcgcat 
ttttgggcct 
ctacttgttt 
tgagttattt 
ctacgaacat 
ggaacagagc 
gtggtctggt 
gactgagcct 
tgctgcataa 
catccacact 
tggcaagcga 
tgacctactt 
acaaaaaaat 
gcttaagtcc 
tttgattttt 
tgccactttc 
tcaagcttta 
ctaaaaggtt 
caattattta 
taatcaaaaa 
attgttttgt 
actaataaat 
atcagtttta 
aaactattaa 
cccagatctg 
ggacatgaga 
accaagtaag 
gccgaacagg 
gaatgaatca 
gtatattcac 
tgaatgaatc 
gaaaaatatc 
agctacataa 
tgcatagttt 
ttaagcagct 
ttttcatttt 
agttttgtgc 
tcacaaagcc 
caggaaattc 
catcgcgttg 
gcaccgacgg 
cccatcccac 
tctgctctat 
cactgtagcc 



ttaaagggta 
gtccaacaat 
cgactggagc 
gggcttttat 
gtatatctcg 
atttacattt 
tcatttccca 
cttggttttt 
actgaacttc 
ttgatcatat 
taacttaaag 
gagcatgggc 
ttctgttggc 
atcataataa 
gcgctatcaa 
actatcttta 
tttaagggtc 
gcctattttt 
tgagttttga 
ttattcaagg 
atacgaacat 
cggagaggta 
ctggtctggt 
gagagccagg 
ttagttgcca 
tactcccaca 
caaggggcaa 
tgatgtgtga 
gtataaacaa 
aagatcacac 
taatggccaa 
tatcggctaa 
ggaataagcc 
tttgcttttt 
aactcacatt 
ttatataaat 
acaatgtaat 
catttagcta 
tgctgcttag 
attagagcgg 
aatagatgga 
aggaaatgaa 
aaatatgaaa 
gaaacgagtt 
tttatgtggc 
aacttcggca 
ttgtttaatt 
aaccaggtgg 
atattgctac 
aaaatttgat 
taatttcgta 
caattatatt 
attaaattca 
aattaatcat 
gctattaata 
gccatcaaac 
atgagagtat 
cattttcatt 
aataaaccgc 
gaggctgcca 



ttgctgcgcg 
atttggcaat 
acgttcgaaa 
gtgcgtgatt 
aacaaggtcg 
attcgtgtag 
gaaagtttct 
tgcaatcaaa 
tgcacaaagc 
tttttttatt 
agcgcggaat 
acgcaataaa 
ttttagagaa 
atggccagta 
tgccccttgg 
tacattttca 
tgcggtacag 
aagcgcgaac 
atgaggtcac 
ctgccaggag 
atacacgccc 
aaccaagtcc 
ctggttagtt 
cgcgttatgt 
tctccaggtc 
acgcaacgca 
aggtcgcacg 
aatgtcccaa 
ggcaggttgg 
agaacaggtg 
atataggtgc 
gtgccttcat 
ggaaattgat 
cttttttaat 
ttcaattgca 
aggcatctac 
aatattccta 
ttttttggtt 
ttgttctata 
acatcgtggc 
gtagtacgaa 
aggaaaaaaa 
attgtcgtca 
tgccatgtaa 
caatgtggat 
caagggaaag 
gataaatgct 
cacataaatt 
aaattcgtat 
tgattataat 
gtttgctatg 
ggcgaaaatt 
atttaaaagc 
atcaataatt 
actcctgaag 
gaaacctaca 
aagtattctt 
cacagtgtga 
aaaagctgat 
aaatgtggaa 



tgtgttccac 
gcctcggctt 
cgttaccaac 
cgtaattgat 
cccaataaat 
tgcaacgctg 
atgagatcaa 
agttttacgc 
ttttcttgaa 
gatccttata 
atgtttttag 
tcaacatttc 
ccagctatta 
accgatgtag 
cgcgtatctc 
tatttcgcac 
ttgttgcggt 
ctcattagtt 
tttgccccgg 
ggacagtcac 
atttacgagc 
tatccctggt 
cagttcagtt 
ttgtggctta 
ccacagccac 
ccaccccatc 
caatgtttgc 
agaattctgc 
ttatcctgta 
aatacttttg 
tgaaaaatcg 
gcgaaccact 
ttaacagcgt 
taaagtttaa 
atacagaaaa 
agaatctgcg 
aatgtgcttt 
aatattaaaa 
aaatcagtat 
atgcctcggt 
caggataata 
aaaccaatta 
gaatggggtt 
atgaatgaat 
ggatacacgt 
ggacacacct 
atgacatggg 
aggcaggact 
aataaatcac 
tttcaatttt 
tcaaatccag 
ggacgagatg 
catgccaatg 
atcttccaat 
ggtcgttgct 
tatccgccca 
ggcaacttgg 
gcccggtttg 
gctgttggca 
tctaattgtg 



aaattgaatt 
aatgactgca 
agcaattata 
aaagggtatt 
gtcgcccggt 
taagtttaat 
taagtaaaag 
attttatctt 
taaatcggta 
taaaattgtg 
ctggcatatt 
aatagtatcc 
aacaaatacc 
tttccgtttt 
taatacaata 
atattttttg 
ttccactact 
ttaagtgagc 
ccgagccgta 
tactactaca 
cagatacgga 
catatctgtt 
cagttcgctt 
ggcaaacagc 
cttctcctcc 
ccgcccacat 
ctttgccgct 
acaaaagcac 
ataagcgaaa 
tggcgaattt 
tattattgca 
gaaatggctt 
tgggataaaa 
ttaaatcaca 
ggtaaatata 
agttttaatt 
gtattataga 
gtactatagt 
gcctctgccc 
acccaaagcc 
cttatgcata 
gaatggttgt 
tgggccaaag 
gcgagaatga 
atgcccacgt 
tcgtgaatta 
gattaatgaa 
tggaatatgg 
gctaaatgca 
ctctgccagc 
cattccatcc 
acgaaaattc 
tcataaacgc 
gaagacagac 
tagcataatc 
tgtttatgaa 
ctaattccgc 
ggcgccagcg 
ggatgcagca 
gtaagtgaaa 



cgccgcacaa 
cataaattgc 
ttggctgagt 
tagcattaag 
gttcattaaa 
tatttcaatg 
tttttgctat 
taatgccagc 
taatcaaagt 
aaatttttgc 
tgttctaaat 
agttgtctgg 
cgccctccaa 
ccagttccca 
catataaaac 
tttttttgtg 
tcggcggctg 
gaaagcttaa 
gctcatcatt 
tacatatatc 
catatagata 
cctatctggc 
cgcttcgctg 
ttccgttaaa 
ggcacatcca 
ccacgtcact 
gccttttgct 
tgagagttga 
tccctagtcg 
cgaatcgaac 
tttgcaaagc 
aaaccggttg 
atatgtgctc 
acaaagcatt 
atggctaagg 
taaattgaat 
agaacattct 
actatgtact 
acgaaaaggg 
aaagcacaca 
aacagggact 
ttacacgcaa 
taaaggaaac 
ataactgaat 
atgtatgtat 
ttcattgtca 
aagcataatg 
aaatttaact 
acaattaaca 
aagtttttaa 
agttgattaa 
cttctgggcc 
cccgcatgaa 
acataaataa 
ataatcaata 
tatttatcga 
tttccccatc 
tgatgtacaa 
cggtctgcac 
tttgcatagc 



6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 
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gccagaatgg 
ccctccacag 
cactctattc 
gaagaaattc 
ggtaggtaat 
agaccgtaga 
aatcatttct 
tcattgtggg 
agcagctata 
tagataccga 
ttttttttta 
cacctgcact 
catgtggaat 
agtactacta 
acatcgactg 
ggagggatat 
caacaagacc 
ggacaaggag 
atcggtcagt 
cagggtgtgg 
atgatagacg 
tggttgaata 
acaatcatat 
cgactgaacc 
taacaagctt 
agcaagctcc 
acacagtaac 
aagaatctat 
agctattagt 
tggaaccatc 
caacgaggcg 
tgatcgatta 
acagtacact 
aggaggaaga 
tgccacagaa 
agttgaagga 
atgataaaac 
ttgaagaagt 
aacatatagg 
gctccaaaga 
aaaataaaga 
aagcttggac 
tgcaagaata 
tcaatacgga 
atgacttagt 
atcttaagaa 
atataggaca 
aaatcataat 
ttggaaccag 
tgcatcccat 
ataatggacc 
aatatcattt 
aatacaagct 
ttttgtttgt 
gaaaggtacc 
gaactaatcc 
acgctggtat 
gaacgtgtac 
aagttaatcg 
aagccaagat 



ctggaatata 
ctctgctacg 
cgcacccatc 
ggctggtccc 
cggtgctaac 
ggatctcgag 
atccgatcca 
actcttctat 
tggccgagct 
tatctcatat 
ggctggtacg 
gttgaacaga 
cagaacaatc 
ctctatccat 
aatcaggcgc 
caggaggcgc 
atggaacgat 
attgccgatg 
ttgtgaaact 
tggcattcag 
gcaagtacga 
ctgaacagtg 
aatcaaaatg 
gaataagaac 
aagtgagaaa 
taaggtagca 
ccaaaaagtc 
tggtgtgcga 
aaagaccgtg 
tagacgcaca 
agttcgtgag 
tgatcgatcg 
tcgacgtcgt 
gatcgattta 
tgccgaggat 
gctcaaggag 
cattgatgaa 
taaatataat 
tacaatcaca 
aagtgaagaa 
cgatgaacag 
ttttccgatc 
ccaaaaggtt 
tataagcaga 
agtataccat 
tcaacgaaag 
cagcgatgat 
accaccaatt 
cacagataag 
taagactgat 
ctacaagatg 
tcctggccga 
acgccgtcca 
cattatcttt 
catgataaag 
caaggcggta 
aatggctctt 
tgcgaatgaa 
gattattgga 
cgatgaggtt 



tacatacata 
gggcctcgag 
catcggccac 
gggtggccat 
taattacatt 
aatcgatgca 
acagacgccg 
gtggtggccg 
catgtgtggt 
gcatagatac 
aggacaactg 
tgcgaatagc 
agacaactat 
tgctaaagtt 
taatcgagga 
cactcgccta 
tgacaaccgg 
aaatctacgc 
aatttcataa 
ttctcagggc 
gaagttgggt 
gattggtggc 
tcggacgatg 
ggaaatccgt 
aatgatgaat 
ccaaaaccga 
aaagaaaatg 
gcaaatagaa 
cccggaaaat 
aaaagtcagc 
ccatttgatt 
tccgaggacg 
tttgtcagca 
gattcagttg 
gaacgacctg 
caacgcgaat 
gtgaagggtt 
gaaacagcaa 
aaaatcttta 
gacgagaata 
aaacttcccc 
gcagagacat 
aaaattggca 
aaatctaaga 
gatggtggaa 
aatcataaac 
cgagtaaggg 
gccgagatgc 
ccattttggc 
aggaagaaat 
ccaaccaaag 
acggaatggc 
agtcattggc 
tccatggcct 
atggcacaac 
tcctttgatc 
ctggaaaaat 
aagtttggct 
tttaagaccg 
gaatttacgg 



cttcgaaata 
tccggctctt 
ggtgcacaat 
tctgcagcag 
tccatattaa 
tggaggctgg 
tgcgcaattt 
ccaggagggt 
tctttattgg 
tttcacagag 
gtacgaggtg 
tgccgaagga 
atccggaatg 
tttgcttcaa 
gggttacgac 
cgatgcagtg 
gaagaaatct 
tgccatgaac 
tacttagtta 
gatcgtattg 
tactacgata 
aaggtaagca 
ataaaaatca 
ccaaagacga 
ccaaaaatcc 
aaaagttatc 
aagagccagg 
acagcagtgc 
ctcttataaa 
cagttggaaa 
cttttgatag 
tgccggaaaa 
ataatgaata 
gatcgtcaac 
ttatcctagc 
caactaacgt 
ccaagatcac 
gtaatgaaga 
gcattgcgca 
aggacgagaa 
cggaaccaga 
cacctgatat 
aaagaaacag 
ttaaaacatt 
gattgcgaga 
acaaaaccga 
aaattggagt 
atgtgcacaa 
caatcgatga 
taaacaaact 
aagatagacg 
gcaggctttt 
tctacacact 
ggttcgattt 
catttattag 
ctagaaacag 
acggcgacta 
atccaagcgg 
aaccctacat 
ccttaaaaag 



ctttacctac 
tcggatcgca 
ccaacgcgca 
gcggaggagg 
cttcgaaaac 
cgttgaaatc 
gcgacgccag 
gctctgcgaa 
taaatacaaa 
tcttcttgta 
aatctgaaag 
catctgacaa 
actgcagagg 
tgtgccttct 
attaaccacg 
tggagtgtgg 
ctgagggatt 
tccacacaat 
gttacttatt 
ctcttacaca 
ctcagttgga 
aatcagttag 
aaagaagaca 
cgttaataca 
atcatcatct 
aatttcggat 
attttctaaa 
aacaaaaaag 
agaatctaac 
aaaagtttac 
ggaaaagtac 
aaaactatct 
tcattctagt 
aatgcgaggt 
tgaaataatc 
atttcgtaac 
aggaaccagc 
tgatgacaat 
aagaatgaaa 
taaagatgta 
gatcgaattg 
ccctggagaa 
gaaatacaga 
tatcagcaag 
tataggacaa 
accagatgat 
gaatactaaa 
aaatggcaaa 
cggaactgat 
gattgtggat 
aacgtactac 
cttcaacaaa 
agttttttcc 
catcaaagat 
tttcactccc 
cacagaagtc 
tggacacaat 
cgaaccgtgt 
caattccgat 
gttattggaa 



atgccactgc 
aacgattccc 
tcaagctgat 
tctttatatc 
ttttatgaac 
gtaactagac 
gatgcacgca 
atgtacaaac 
ttatactact 
actaaactac 
cagagggcat 
cggaagcgct 
aatttcggtt 
ttttgcgcag 
atcgctatcc 
ctttggcttt 
ttacctatac 
ttctgggtgt 
tttttatttg 
gatcgaacag 
taacctatcc 
cgttgaaaaa 
aagcatgtaa 
agaggatcat 
gcaagtcttc 
gctggtaaag 
caatttgaaa 
gaaaatgaaa 
gatgaaaatg 
gaggaatctt 
ctgtctgata 
tgggaaccag 
cttgaaggtg 
agctatcgaa 
aatatgggaa 
aatcgtagag 
aaacaacatg 
gcacaggtaa 
gagaagaaaa 
aaaaaagatg 
gaggccaaaa 
aatcagatcc 
tgtataggca 
tctgcaggtg 
accactgaaa 
atacccgttg 
aagttaccaa 
ttaagggaca 
gtaatctaca 
ccaccgccgg 
aaaggttgtg 
atccatggaa 
gttttatata 
gacgcttcca 
atcggaccac 
atggaaaagt 
ccgcgttttg 
gtttttctta 
gagcttgtca 
aatacaacaa 



10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
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cagaggaggg 
ttttgattga 
aaatcgagta 
gaatcgtggc 
aaatgtgggc 
tgttattagc 
cattataaaa 
ttatttataa 
cgcaccgtgt 
gttgccttcg 
tggtagtgct 
gcatcccgtt 
cttactgggc 
ttaccttaaa 
atatgtcaag 
ttcagcaagg 
ggtttaaatt 
aaaaaagtgg 
gtgatattac 
ctcgaagatc 
gaaagtcaac 
ataatatatt 
tttggcctct 
cgttatgtgg 
ggcatggtca 
ttctgttgtt 
ctaaaagaag 
catcccaagg 
gaagaacgct 
gtatgctgaa 
gattaaattc 
gaacaaaagt 
tttcaccttc 
cttataaatc 
caaccaccgc 
gttgagaacc 
cccacccgtt 
gcgtatactg 
ctagtcatta 
ccatctttgt 
atgacaatcc 
ggcaacatta 
tggcccactt 
ctcgacatgc 
gaacattggg 
tattctattt 
aaaagataga 
gttttataaa 
tgcgacagcg 
gtgtcgcctc 
cacatgccac 
tggaactatg 
gttggtttag 
ctattcgtcg 
atttgcttat 
tcagtttctg 
gtttttgagc 
ctgcaccctt 
tatcttacag 
taggtgaatt 



tcatttgaat 
atttcatcca 
tattgcgaac 
tcttaagatc 
tcagaatata 
tacaaatgaa 
tacataaata 
catctataat 
ccttgccatt 
ccttgatcat 
cgtgcagaac 
tgcaatacga 
atcgacggac 
agttctcttc 
cgcgggcttg 
tctggcgtgt 
aaattaacta 
aaccttggaa 
tctcatggca 
cagtatctac 
gcaactccat 
tatattattt 
ttttggcgta 
gcatgagcat 
ttgcatcgca 
tcctaagcat 
tagcacattt 
ataaggccga 
atcagaaact 
ttatggtaga 
aactgttcgt 
taaggattgg 
tctaaaataa 
aaattttaat 
catgctcggc 
ttttatttat 
ccagtttgta 
tagcgccagg 
ggtaaagctt 
aactaaacca 
aatctcttgc 
gctggtgaat 
cgaacagcaa 
gtactgttag 
acgcagtttt 
aggtcggcat 
actacatttt 
cattgattta 
tctggtggag 
cgccgccgtt 
gcccgcagcc 
catcccaaat 
cttgacgttc 
gcattgacac 
gctaacgcta 
gaatgtctat 
ttatttatat 
ttttcacatt 
atactcatga 
atacaaaaga 



cgcacttgga 
gagccggcga 
gagggtaaaa 
aaaaatctca 
catcatagga 
aaccgcgaga 
taactttcta 
ccaggttcct 
atttgtgtgc 
ctttaatata 
taactaaggg 
tcatgttatt 
gctttgtcag 
atttttatta 
gttactatcc 
gcatcgtttt 
caactatata 
gctatacacc 
gatctttgat 
tactgatgat 
gtggttgggt 
tgcaggtctt 
cgagacgcgc 
ctataacgtg 
acaggacgcg 
gctgctgata 
gtaatgatct 
atcgaaatac 
tgttaccgaa 
ttatcttaat 
tgaacgttaa 
aatttccagc 
agtccattta 
ttgataatta 
tgcacaattt 
atggtatctt 
taattcagtt 
accatgcttc 
ctgaagaccg 
cgattggtta 
cgtcgttttg 
tccatttgga 
agtccataaa 
aagatcggtg 
gtggttttgc 
ttaagcaaat 
tattgtttaa 
ttaattttcc 
cggggcgacg 
gcaacaactt 
acactcgcaa 
gccccttagc 
ttaatcgtta 
aattgattag 
tgtaaactcg 
tgctttgtgt 
cttcagcact 
ttagcaagat 
gcatcgtaca 
tatgaaatgc 



taacatgccg 
taagaacgga 
agtcattttt 
aagcaaatga 
aagagggtta 
gggtggaaaa 
ataaaatgtg 
caagatcgca 
atgtgcacaa 
tggaataagc 
ttatataatt 
tggtgtcatc 
ccccgaggaa 
aagttaattt 
accggtttta 
acaacaaaag 
ttagttgcta 
atggtttcgg 
ccgctgcagc 
attaaaatac 
atgttgtgat 
gtatacggct 
tccattaaag 
gtcgtccttt 
tcctttgcct 
tttgtgccaa 
ctattcgcac 
aatcccgatt 
aacgaggaat 
aaaatcgtat 
tctaaaagta 
ggccccagtg 
gcgagctgca 
atttaccttg 
ttactagatt 
agggagtccc 
tcactgtcca 
aattgaatga 
atgaaaaatt 
aaattaacac 
cactgggtca 
ctgccactag 
aatagtacaa 
gcttactgaa 
tacctttgac 
ggattccaaa 
atgtttgttt 
gcttgcagaa 
ccaagggcac 
cgcagcccgc 
tcacacaagg 
cagttgcatg 
tgttctcgat 
gaacaacaat 
attttcgcac 
gcatacatat 
ttagcacttt 
tattcaaaac 
tgaaaaaact 
aaataatgat 



ttcagataaa 
atacaccgat 
cggcccaaac 
acgtgtccat 
tggccaagtt 
attactccga 
ttatatgata 
caattgtcac 
tatccagttg 
atagaaggta 
tctagagtaa 
atctgtctaa 
tatccaaagg 
gatataaaat 
cactagcata 
caaaaactga 
atcattttgt 
ggctattatc 
gttatctcga 
gtccagagct 
aacgatccat 
tcaaggggct 
tgaaacagat 
gcctgataac 
tcgttgctct 
aggtcagcca 
aggtcattga 
cagccatatc 
tgcaacgatt 
tttatttaat 
tttcggtctt 
aattccattc 
atatgaagta 
acaaacatga 
catcattatc 
cagctgttcc 
cttgaaagtc 
aaagttcgtg 
cgccattcgg 
ttccatctag 
cttgaaaagg 
gacacgtgtg 
aaaagcacga 
gtatctacca 
cgcattgctt 
aaagtgaatg 
gttttgctta 
ggaggaaaag 
agaactgaat 
ttccctcatc 
tgagtagagt 
tggtaatcga 
tttaactaga 
ataaatggag 
aaaaaaccaa 
gtatcttaag 
tttaataaat 
atactaatac 
tgtcaatttc 
gctaacggaa 



gataagaatg 
attgaagaga 
gatgtgaatc 
ataaattgca 
tcgttttttg 
accgactact 
tatgtatgta 
ccatgttcta 
tggcatattc 
actatttaat 
tacaatcctc 
tatctgtcat 
tgcgagttac 
tatccattag 
cggtgctatg 
cccaaaggtg 
ggaattgcag 
aatagattta 
aacattccca 
tgagcattgt 
tatattgtat 
aatcctggtg 
caacgattcg 
agctccggtg 
agctgtgata 
ctacactgaa 
ggttatacgt 
gaaagaggac 
aataacacag 
tttgtttaaa 
atcataattt 
agcatgccgg 
ttatcaacct 
gtgaaaccac 
acgatcaaaa 
agaatatttt 
atcaaagtga 
tggttgttct 
atcgccaaat 
tcgtctttga 
ttcttcttct 
tggtgaactc 
tttcattgta 
tttaaaaggg 
gatatgcata 
gagactattt 
tatttttatt 
attcgagtcc 
ggtgcaacag 
aactcatcag 
acctggtgca 
tactagttac 
aatcttagct 
ttcaaaggtt 
aggcaattta 
cacttttgca 
atcactgctt 
gtaatatatt 
ctttgcgtga 
ggatggattt 



13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
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aaacaaacac gtacttaaag aactttgtat caactttaag ctgagctaaa taagtgaatt 17460 

ggtagcagca attttgtaga ataatgaaat atgtattgta ccttaaacgt gtttagcatt 17520 

aatattgaaa acgaagtaat tttaacttaa aagaaaacaa aacaaaaatt cgacactagc 17580 

atctaaacta agcgaattgt taggctttaa agttgcaaca cagaaaccaa aaaaaattct 17640 

agcaaatcaa gagaattcat tacgaaaaga tcgtgttaaa gttagtaatt tttaaatgca 17700 

ggtagagaag acaaattgtt gattttgtgt ttagtaaata ctgttttttt agttaaacga 17760 

attatccttt cgaacctcat ttgctcagcg ctgtctccta gagctctcta gttaattata 17820 

tctgcgtaga tgaaaattcc ccaaaag 17847 



<210> 176 
<211> 5167 
<212> DNA 
<213> Drosophila 

<400> 176 

tcatggagtt ccagtcgtcg gatacatgtg gaacacaacg aacacagcag gagcctctcg 60 

cgtctcatcc cccgaataca atgcgaaaaa aactgaatac ctgcaaaaag aaatacgcgt 120 

taaatggaat tacgcacccc ctccatccaa caccccctac gaccctgccc aagcaaaagg 180 

agcagcgaaa aagaagaagc gtcttatgaa acaattgaag caccggccaa aggagcaaag 240 

gagcagacaa agacagagac aaaaacatcg gcaaaggata aaacggaggc cgagaagaag 300 

aggaagcagt agtagcaaag caggaaacaa caacaacaac aaccgcaaca tcgcattttt 360 

aaacggcaat acgcaaaaaa aatattgctg gcaggagttt agaacattgc caaaggggat 420 

gcggacagat gaaggggcgg gggcgtggcg ggatatgaca agtgatggtg ctgttacgtt 480 

ttggatattt ttgctttgtt taatcgcctc gccgcacctg caagggggcg tggccgggag 540 

gcccgatgaa ctgcacatcg gcggcatctt tccgatagcc ggcaaaggag gatggcaggg 600 

cggccaggcg tgtatgcctg ccgcaagact ggcgttggat gatgtcaaca agcagccaaa 660 

tctgctgccg ggcttcaagc tcatcctgca cagcaacgac agtgagtgtg agcccggttt 720 

gggcgccagc gtgatgtaca atctgctcta taataaaccg caaaagctga tgctgttggc 780 

aggatgcagc acggtctgca ccactgtagc cgaggctgcc aaaatgtgga atctaattgt 840 

gctctgctac ggggcctcga gtccggctct ttcggatcgc aaacgattcc ccactctatt 900 

ccgcacccat ccatcggcca cggtgcacaa tccaacgcgc atcaagctga tgaagaaatt 960 

cggctggtcc cgggtggcca ttctgcagca ggcggaggag gtctttatat cgaccgtaga 1020 

ggatctcgag aatcgatgca tggaggctgg cgttgaaatc gtaactagac aatcatttct 1080 

atccgatcca acagacgccg tgcgcaattt gcgacgccag gatgcacgca tcattgtggg 1140 

actcttctat gtggtggccg ccaggagggt gctctgcgaa atgtacaaac agcagctata 1200 

tggccgagct catgtgtggt tctttattgg ctggtacgag gacaactggt acgaggtgaa 1260 

tctgaaagca gagggcatca cctgcactgt tgaacagatg cgaatagctg ccgaaggaca 1320 

tctgacaacg gaagcgctca tgtggaatca gaacaatcag acaactatat ccggaatgac 1380 

tgcagaggaa tttcggttag tactactact ctatccattg ctaaagtttt tgcttcaatg 1440 

tgccttcttt ttgcgcagac atcgactgaa tcaggcgcta atcgaggagg gttacgacat 1500 

taaccacgat cgctatccgg agggatatca ggaggcgcca ctcgcctacg atgcagtgtg 1560 

gagtgtggct ttggctttca acaagaccat ggaacgattg acaaccggga agaaatctct 1620 

gagggatttt acctatacgg acaaggagat tgccgatgaa atctacgctg ccatgaactc 1680 

cacacaattt ctgggtgtat cgggtgtggt ggcattcagt tctcagggcg atcgtattgc 1740 

tcttacacag atcgaacaga tgatagacgg caagtacgag aagttgggtt actacgatac 1800 

tcagttggat aacctatcct ggttgaatac tgaacagtgg attggtggca agaagacaaa 1860 

gcatgtaacg actgaaccga ataagaacgg aaatccgtcc aaagacgacg ttaatacaag 1920 

aggatcatta acaagcttaa gtgagaaaaa tgatgaatcc aaaaatccat catcatctgc 1980 

aagtcttcag caagctccta aggtagcacc aaaaccgaaa aagttatcaa tttcggatgc 2040 

tggtaaagac acagtaaccc aaaaagtcaa agaaaatgaa gagccaggat tttctaaaca 2100 

atttgaaaaa gaatctattg gtgtgcgagc aaatagaaac agcagtgcaa caaaaaagga 2160 

aaatgaaaag ctattagtaa agaccgtgcc cggaaaatct cttataaaag aatctaacga 2220 

tgaaaatgtg gaaccatcta gacgcacaaa aagtcagcca gttggaaaaa aagtttacga 2280 

ggaatcttca acgaggcgag ttcgtgagcc atttgattct tttgataggg aaaagtacct 2340 

gtctgatatg atcgattatg atcgatcgtc cgaggacgtg ccggaaaaaa aactatcttg 2400 

ggaaccagac agtacacttc gacgtcgttt tgtcagcaat aatgaatatc attctagtct 2460 

tgaaggtgag gaggaagaga tcgatttaga ttcagttgga tcgtcaacaa tgcgaggtag 2520 

ctatcgaatg ccacagaatg ccgaggatga acgacctgtt atcctagctg aaataatcaa 2580 
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tatgggaaag ttgaaggagc tcaaggagca acgcgaatca actaacgtat ttcgtaacaa 2640 
tcgtagagat gataaaacca ttgatgaagt gaagggttcc aagatcacag gaaccagcaa 2700 
acaacatgtt gaagaagtta aatataatga aacagcaagt aatgaagatg atgacaatgc 2760 
acaggtaaaa catataggta caatcacaaa aatctttagc attgcgcaaa gaatgaaaga 2820 
gaagaaaagc tccaaagaaa gtgaagaaga cgagaataag gacgagaata aagatgtaaa 2880 
aaaagatgaa aataaagacg atgaacagaa acttcccccg gaaccagaga tcgaattgga 2940 
ggccaaaaaa gcttggactt ttccgatcgc agagacatca cctgatatcc ctggagaaaa 3000 
tcagatcctg caagaatacc aaaaggttaa aattggcaaa agaaacagga aatacagatg 3060 
tataggcatc aatacggata taagcagaaa atctaagatt aaaacattta tcagcaagtc 3120 
tgcaggtgat gacttagtag tataccatga tggtggaaga ttgcgagata taggacaaac 3180 
cactgaaaat cttaagaatc aacgaaagaa tcataaacac aaaaccgaac cagatgatat 3240 
acccgttgat ataggacaca gcgatgatcg agtaagggaa attggagtga atactaaaaa 3300 
gttaccaaaa atcataatac caccaattgc cgagatgcat gtgcacaaaa atggcaaatt 3360 
aagggacatt ggaaccagca cagataagcc attttggcca atcgatgacg gaactgatgt 3420 
aatctacatg catcccatta agactgatag gaagaaatta aacaaactga ttgtggatcc 3480 
accgccggat aatggaccct acaagatgcc aaccaaagaa gatagacgaa cgtactacaa 3540 
aggttgtgaa tatcattttc ctggccgaac ggaatggcgc aggcttttct tcaacaaaat 3600 
ccatggaaaa tacaagctac gccgtccaag tcattggctc tacacactag ttttttccgt 3660 
tttatatatt ttgtttgtca ttatcttttc catggcctgg ttcgatttca tcaaagatga 3720 
cgcttccaga aaggtaccca tgataaagat ggcacaacca tttattagtt tcactcccat 3780 
cggaccacga actaatccca aggcggtatc ctttgatcct agaaacagca cagaagtcat 3840 
ggaaaagtac gctggtataa tggctcttct ggaaaaatac ggcgactatg gacacaatcc 3900 
gcgttttgga acgtgtactg cgaatgaaaa gtttggctat ccaagcggcg aaccgtgtgt 3960 
ttttcttaaa gttaatcgga ttattggatt taagaccgaa ccctacatca attccgatga 4020 
gcttgtcaaa gccaagatcg atgaggttga atttacggcc ttaaaaaggt tattggaaaa 4080 
tacaacaaca gaggagggtc atttgaatcg cacttggata acatgccgtt cagataaaga 4140 
taagaatgtt ttgattgaat ttcatccaga gccggcgata agaacggaat acaccgatat 4200 
tgaagagaaa atcgagtata ttgcgaacga gggtaaaaag tcatttttcg gcccaaacga 4260 
tgtgaatcga atcgtggctc ttaagatcaa aaatctcaaa gcaaatgaac gtgtccatat 4320 
aaattgcaaa atagtaatac aatcctcgca tcccgtttgc aatacgatca tgttatttgg 4380 
tgtcatcatc tgtctaatat ctgtcatctt actgggcatc gacggacgct ttgtcagccc 4440 
cgaggaatat ccaaagatct ttgatccgct gcagcgttat ctcgaaacat tcccactcga 4500 
agatccagta tctactactg atgatattaa aatacgtcca gagcttgagc attgtgaaag 4560 
tcaacgcaac tccatgtggt tgggtcttgt atacggcttc aaggggctaa tcctggtgtt 4620 
tggcctcttt ttggcgtacg agacgcgctc cattaaagtg aaacagatca acgattcgcg 4680 
ttatgtgggc atgagcatct ataacgtggt cgtcctttgc ctgataacag ctccggtggg 4740 
catggtcatt gcatcgcaac aggacgcgtc ctttgccttc gttgctctag ctgtgatatt 4800 
ctgttgtttc ctaagcatgc tgctgatatt tgtgccaaag gtcattgagg ttatacgtca 4860 
tcccaaggat aaggccgaat cgaaatacaa tcccgattca gccatatcga aagaggacga 4 920 
agaacgctat cagaaacttg ttaccgaaaa cgaggaattg caacgattaa taacacagaa 4980 
ggaggaaaag attcgagtcc tgcgacagcg tctggtggag cggggcgacg ccaagggcac 5040 
agaactgaat ggtgcaacag gtgtcgcctc cgccgccgtt gcaacaactt cgcagcccgc 5100 
ttccctcatc aactcatcag cacatgccac gcccgcagcc acactcgcaa tcacacaagg 5160 
tgagtag 5167 



<210> 177 
<211> 1713 
<212> PRT 
<213> Drosophila 

<400> 177 

Met Trp Asn Thr Thr Asn Thr Ala Gly Ala Ser Arg Val Ser Ser Pro 

15 10 15 

Glu Tyr Asn Ala Lys Lys Thr Glu Tyr Leu Gin Lys Glu lie Arg Val 

20 25 30 

Lys Trp Asn Tyr Ala Pro Pro Pro Ser Asn Thr Pro Tyr Asp Pro Ala 

35 40 45 

Gin Ala Lys Gly Ala Ala Lys Lys Lys Lys Arg Leu Met Lys Gin Leu 
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50 










55 




Lys 


His 


Arg 


Pro 


Lys 


Glu 


Gin 


Arg 


65 










70 






His 


Arg 


Gin 


Arg 


He 
85 


Lys 


Arg 


Arg 


Ser 


Lys 


Ala 


Gly 


Asn 


Asn 


Asn 


Asn 








100 










Asn 


Gly 


Asn 


Thr 


Gin 


Lys 


Lys 


Tyr 






115 










120 


Pro 


Lys 


Gly 


Met 


Arg 


Thr 


Asp 


Glu 




130 










135 




Thr 


Ser 


Asp 


Gly 


Ala 


Val 


Thr 


Phe 


145 










150 






Ala 


Ser 


Pro 


His 


Leu 


Gin 


Gly 


Gly 










165 








His 


He 


Gly 


Gly 


lie 


Phe 


Pro 


He 








180 










Gly 


Gin 


Ala 


Cys 


Met 


Pro 


Ala 


Ala 






195 










200 


Lys 


Gin 


Pro 


Asn 


Leu 


Leu 


Pro 


Gly 




210 










215 




Asp 


Ser 


Glu 


Cys 


Glu 


Pro 


Gly 


Leu 


225 










230 






Leu 


Tyr 


Asn 


Lys 


Pro 


Gin 


Lys 


Leu 










245 








Val 


Cys 


Thr 


Thr 


Val 


Ala 


Glu 


Ala 








260 










Leu 


Cys 


Tyr 


Gly Ala 


Ser 


Ser 


Pro 






275 










280 


Pro 


Thr 


Leu 


Phe 


Arg 


Thr 


His 


Pro 




290 










295 




Arg 


He 


Lys 


Leu 


Met 


Lys 


Lys 


Phe 


305 










310 






Gin 


Gin 


Ala 


Glu 


Glu 


Val 


Phe 


He 










325 








Arg 


Cys 


Met 


Glu 


Ala 


Gly 


Val 


Glu 








340 










Ser 


Asp 


Pro 


Thr 


Asp 


Ala 


Val 


Arg 






355 










360 


He 


He 


Val 


Gly 


Leu 


Phe 


Tyr 


Val 




370 










375 




Glu 


Met 


Tyr 


Lys 


Gin 


Gin 


Leu 


Tyr 


385 










390 






He 


Gly 


Trp 


Tyr 


Glu 


Asp 


Asn 


Trp 










405 








Gly 


He 


Thr 


Cys 


Thr 


Val 


Glu 


Gin 








420 










Leu 


Thr 


Thr 


Glu 


Ala 


Leu 


Met 


Trp 






435 










440 


Ser 


Gly 


Met 


Thr 


Ala 


Glu 


Glu 


Phe 




4 50 










455 




Leu 


Leu 


Lys 


Phe 


Leu 


Leu 


Gin 


Cys 


465 










470 






Leu 


Asn 


Gin 


Ala 


Leu 


He 


Glu 


Glu 










485 








Tyr 


Pro 


Glu 


Gly 


Tyr 


Gin 


Glu 


Ala 








500 










Ser 


Val 


Ala 


Leu 


Ala 


Phe 


Asn 


Lys 






515 










520 


Lys 


Lys 


Ser 


Leu 


Arg 


Asp 


Phe 


Thr 



60 



Ser 


Arg 


bin 


Arg 


bin 


Arg 


bin 


Lys 






75 










80 


Pro 


Arg 


Arg 


Arg 


Gly 


Ser 


Ser 


Ser 




90 










95 




Asn 


Asn 


Arg 


Asn 


He 


Ala 


Phe 


Leu 


105 










110 






Cys 


Trp 


Gin 


Glu 


Phe 


Arg 


Thr 


Leu 










125 








Gly 


Ala 


Gly 


Ala 


Trp 


Arg 


Asp 


Met 








140 










Trp 


He 


Phe 


Leu 


Leu 


Cys 


Leu 


He 






155 










160 


Val 


Ala 


Gly 


Arg 


Pro 


Asp 


Glu 


Leu 




170 










175 




Ala 


Gly 


Lys 


Gly 


Gly 


Trp 


Gin 


Gly 


185 










190 






Arg 


Leu 


Ala 


Leu 


Asp Asp 


Val 


Asn 










205 








Phe 


Lys 


Leu 


He 


Leu 


His 


Ser 


Asn 








220 










Gly Ala 


Ser 


Val 


Met 


Tyr 


Asn 


Leu 






235 










240 


Met 


Leu 


Leu 


Ala 


Gly 


Cys 


Ser 


Thr 




250 










255 




Ala 


Lys 


Met 


Trp 


Asn 


Leu 


He 


Val 


265 










270 






Ala 


Leu 


Ser 


Asp 


Arg 


Lys 


Arg 


Phe 










285 








Ser 


Ala 


Thr 


Val 


His 


Asn 


Pro 


Thr 








300 










Gly 


Trp 


Ser 


Arg 


Val 


Ala 


He 


Leu 






315 










320 


Ser 


Thr 


Val 


Glu 


Asp 


Leu 


Glu 


Asn 




330 










335 




He 


Val 


Thr 


Arg 


Gin 


Ser 


Phe 


Leu 


345 










350 






Asn 


Leu 


Arg 


Arg 


Gin 


Asp 


Ala 


Arg 










365 








Val 


Ala 


Ala 


Arg 


Arg 


Val 


Leu 


Cys 








380 










Gly Arg 


Ala 


His 


Val 


Trp 


Phe 


Phe 






395 










400 


Tyr 


Glu 


Val 


Asn 


Leu 


Lys 


Ala 


Glu 




410 










415 




Met 


Arg 


He 


Ala 


Ala 


Glu 


Gly 


His 


425 










430 






Asn 


Gin 


Asn 


Asn 


Gin 


Thr 


Thr 


He 










445 








Arg 


Leu 


Val 


Leu 


Leu 


Leu 


Tyr 


Pro 








460 










Ala 


Phe 


Phe 


Leu 


Arg 


Arg 


His 


Arg 






475 










480 


Gly 


Tyr 


Asp 


He 


Asn 


His 


Asp 


Arg 




490 










495 




Pro 


Leu 


Ala 


Tyr 


Asp 


Ala 


Val 


Trp 


505 










510 






Thr 


Met 


Glu 


Arg 


Leu 


Thr 


Thr 


Gly 










525 








Tyr 


Thr 


Asp 


Lys 


Glu 


He 


Ala 


Asp 
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530 










535 










540 










Glu 


He 


Tyr 


Ala 


Ala 


Met 


Asn 


Ser 


Thr 


Gin 


Phe 


Leu 


Gly 


Val 


Ser 


-Gly 


545 










550 










555 










560 


Val 


Val 


Ala 


Phe 


Ser 


Ser 


Gin 


Gly 


Asp 


Arg 


He 


Ala 


Leu 


Thr 


Gin 


He 










565 










570 










575 




Glu 


Gin 


Met 


He 


Asp 


Gly 


Lys 


Tyr 


Glu 


Lys 


Leu 


Gly 


Tyr 


Tyr 


Asp 


Thr 








580 










585 










590 






Gin 


Leu 


Asp 


Asn 


Leu 


Ser 


Trp 


Leu 


Asn 


Thr 


Glu 


Gin 


Trp 


He 


Gly 


Gly 






595 










600 










605 








Lys 


Lys 


Thr 


Lys 


His 


Val 


Thr 


Thr 


Glu 


Pro 


Asn 


Lys 


Asn 


Gly 


Asn 


Pro 


610 










615 










620 










Ser 


Lys 


Asp 


Asp 


Val 


Asn 


Thr 


Arg 


Gly 


Ser 


Leu 


Thr 


Ser 


Leu 


Ser 


Glu 


625 










630 










635 










640 


Lys 


Asn 


Asp 


Glu 


Ser 


Lys 


Asn 


Pro 


Ser 


Ser 


Ser 


Ala 


Ser 


Leu 


Gin 


Gin 










645 










650 










655 




Ala 


Pro 


Lys 


Val 


Ala 


Pro 


Lys 


Pro 


Lys 


Lys 


Leu 


Ser 


He 


Ser 


Asp 


Ala 








660 










665 










670 






Gly 


Lys 


Asp 


Thr 


Val 


Thr 


Gin 


Lys 


Val 


Lys 


Glu 


Asn 


Glu 


Glu 


Pro 


Gly 






675 










680 










685 








Phe 


Ser 


Lys 


Gin 


Phe 


Glu 


Lys 


Glu 


Ser 


He 


Gly 


Val 


Arg 


Ala 


Asn 


Arg 




690 










695 










700 










Asn 


Ser 


Ser 


Ala 


Thr 


Lys 


Lys 


Glu 


Asn 


Glu 


Lys 


Leu 


Leu 


Val 


Lys 


Thr 


705 










710 










715 










720 


Val 


Pro 


Gly 


Lys 


Ser 


Leu 


He 


Lys 


Glu 


Ser 


Asn 


Asp 


Glu 


Asn 


Val 


Glu 










725 










730 










735 




Pro 


Ser 


Arg 


Arg 


Thr 


Lys 


Ser 


Gin 


Pro 


Val 


Gly 


Lys 


Lys 


Val 


Tyr 


Glu 








740 










745 










750 






Glu 


Ser 


Ser 


Thr 


Arg 


Arg 


Val 


Arg 


Glu 


Pro 


Phe 


Asp 


Ser 


Phe 


Asp 


Arg 






755 










760 










765 








Glu 


Lys 


Tyr 


Leu 


Ser 


Asp 


Met 


He 


Asp 


Tyr 


Asp 


Arg 


Ser 


Ser 


Glu 


Asp 




770 










775 










780 










Val 


Pro 


Glu 


Lys 


Lys 


Leu 


Ser 


Trp 


Glu 


Pro 


Asp 


Ser 


Thr 


Leu 


Arg 


Arg 


785 










790 










795 










800 


Arg 


Phe 


Val 


Ser 


Asn 


Asn 


Glu 


Tyr 


His 


Ser 


Ser 


Leu 


Glu 


Gly 


Glu 


Glu 










805 










810 










815 




Glu 


Glu 


He 


Asp 


Leu 


Asp 


Ser 


Val 


Gly 


Ser 


Ser 


Thr 


Met 


Arg 


Gly 


Ser 








820 










825 










830 






Tyr 


Arg 


Met 


Pro 


Gin 


Asn 


Ala 


Glu 


Asp 


Glu 


Arg 


Pro 


Val 


He 


Leu 


Ala 






835 










840 










845 








Glu 


He 


He 


Asn 


Met 


Gly 


Lys 


Leu 


Lys 


Glu 


Leu 


Lys 


Glu 


Gin 


Arg 


Glu 




850 










855 










860 










Ser 


Thr 


Asn 


Val 


Phe 


Arg 


Asn 


Asn 


Arg 


Arg 


Asp 


Asp 


Lys 


Thr 


He 


Asp 


865 










870 










875 










880 


Glu 


Val 


Lys 


Gly 


Ser 


Lys 


He 


Thr 


Gly 


Thr 


Ser 


Lys 


Gin 


His 


Val 


Glu 










885 










890 










895 




Glu 


Val 


Lys 


Tyr 


Asn 


Glu 


Thr 


Ala 


Ser 


Asn 


Glu 


Asp 


Asp 


Asp 


Asn 


Ala 








900 










905 










910 






Gin 


Val 


Lys 


His 


He 


Gly 


Thr 


He 


Thr 


Lys 


He 


Phe 


Ser 


He 


Ala 


Gin 






915 










920 










925 








Arg 


Met 


Lys 


Glu 


Lys 


Lys 


Ser 


Ser 


Lys 


Glu 


Ser 


Glu 


Glu 


Asp 


Glu 


Asn 




930 










935 










940 










Lys 


Asp 


Glu 


Asn 


Lys 


Asp 


Val 


Lys 


Lys 


Asp 


Glu 


Asn 


Lys 


Asp 


Asp 


Glu 


945 










950 










955 










960 


Gin 


Lys 


Leu 


Pro 


Pro 


Glu 


Pro 


Glu 


He 


Glu 


Leu 


Glu 


Ala 


Lys 


Lys 


Ala 








965 










970 










975 




Trp 


Thr 


Phe 


Pro 


He 


Ala 


Glu 


Thr 


Ser 


Pro 


Asp 


He 


Pro 


Gly 


Glu 


Asn 








980 










985 










990 






Gin 


He 


Leu 


Gin 


Glu 


Tyr 


Gin 


Lys 


Val 


Lys 


He 


Gly 


Lys 


Arg 


Asn 


Arg 






995 










1000 








1005 






Lys 


Tyr 


Arg 


Cys 


He 


Gly 


He 


Asn 


Thr 


Asp 


He 


Ser 


Arg 


Lys 


Ser 


Lys 
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1010 



1015 



1020 



lie Lys Thr Phe lie Ser Lys Ser Ala Gly Asp Asp Leu Val Val Tyr 
1025 1030 1035 10H 

His Asp Gly Gly Arg Leu Arg Asp lie Gly Gin Thr Thr Glu Asn Leu 

1045 1050 1055 

Lys Asn Gin Arg Lys Asn His Lys His Lys Thr Glu Pro Asp Asp lie 

1060 1065 1070 

Pro Val Asp He Gly His Ser Asp Asp Arg Val Arg Glu He Gly Val 

1075 1080 1085 

Asn Thr Lys Lys Leu Pro Lys He He He Pro Pro He Ala Glu Met 

1090 1095 1100 

His Val His Lys Asn Gly Lys Leu Arg Asp He Gly Thr Ser Thr Asp 
1105 1110 1115 112i 

Lys Pro Phe Trp Pro He Asp Asp Gly Thr Asp Val He Tyr Met His 

1125 1130 1135 

Pro He Lys Thr Asp Arg Lys Lys Leu Asn Lys Leu He Val Asp Pro 

1140 1145 1150 

Pro Pro Asp Asn Gly Pro Tyr Lys Met Pro Thr Lys Glu Asp Arg Arg 

1155 1160 1165 

Thr Tyr Tyr Lys Gly Cys Glu Tyr His Phe Pro Gly Arg Thr Glu Trp 

1170 1175 1180 

Arg Arg Leu Phe Phe Asn Lys He His Gly Lys Tyr Lys Leu Arg Arg 
1185 1190 1195 1201 

Pro Ser His Trp Leu Tyr Thr Leu Val Phe Ser Val Leu Tyr He Leu 

1205 1210 1215 

Phe Val He He Phe Ser Met Ala Trp Phe Asp Phe He Lys Asp Asp 

1220 1225 1230 

Ala Ser Arg Lys Val Pro Met He Lys Met Ala Gin Pro Phe He Ser 

1235 1240 1245 

Phe Thr Pro He Gly Pro Arg Thr Asn Pro Lys Ala Val Ser Phe Asp 

1250 1255 1260 

Pro Arg Asn Ser Thr Glu Val Met Glu Lys Tyr Ala Gly He Met Ala 
1265 1270 1275 1281 

Leu Leu Glu Lys Tyr Gly Asp Tyr Gly His Asn Pro Arg Phe Gly Thr 

1285 1290 1295 

Cys Thr Ala Asn Glu Lys Phe Gly Tyr Pro Ser Gly Glu Pro Cys Val 

1300 1305 1310 

Phe Leu Lys Val Asn Arg He He Gly Phe Lys Thr Glu Pro Tyr He 

1315 1320 1325 

Asn Ser Asp Glu Leu Val Lys Ala Lys He Asp Glu Val Glu Phe Thr 

1330 1335 1340 

Ala Leu Lys Arg Leu Leu Glu Asn Thr Thr Thr Glu Glu Gly His Leu 
1345 1350 1355 1361 

Asn Arg Thr Trp He Thr Cys Arg Ser Asp Lys Asp Lys Asn Val Leu 

1365 1370 1375 

He Glu Phe His Pro Glu Pro Ala He Arg Thr Glu Tyr Thr Asp He 

1380 1385 1390 

Glu Glu Lys He Glu Tyr He Ala Asn Glu Gly Lys Lys Ser Phe Phe 

1395 1400 1405 

Gly Pro Asn Asp Val Asn Arg He Val Ala Leu Lys He Lys Asn Leu 

1410 1415 1420 

Lys Ala Asn Glu Arg Val His He Asn Cys Lys He Val He Gin Ser 
1425 1430 1435 1441 

Ser His Pro Val Cys Asn Thr He Met Leu Phe Gly Val He He Cys 

1445 1450 1455 

Leu He Ser Val He Leu Leu Gly He Asp Gly Arg Phe Val Ser Pro 

1460 1465 1470 

Glu Glu Tyr Pro Lys He Phe Asp Pro Leu Gin Arg Tyr Leu Glu Thr 

1475 1480 1485 

Phe Pro Leu Glu Asp Pro Val Ser Thr Thr Asp Asp He Lys He Arg 
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1490 1495 1500 

Pro Glu Leu Glu His Cys Glu Ser Gin Arg Asn Ser Met Trp Leu Gly 
1505 1510 1515 1520 

Leu Val Tyr Gly Phe Lys Gly Leu He Leu Val Phe Gly Leu Phe Leu 

1525 1530 1535 

Ala Tyr Glu Thr Arg Ser He Lys Val Lys Gin He Asn Asp Ser Arg 

1540 1545 1550 

Tyr Val Gly Met Ser He Tyr Asn Val Val Val Leu Cys Leu He Thr 

1555 1560 1565 

Ala Pro Val Gly Met Val He Ala Ser Gin Gin Asp Ala. Ser Phe Ala 

1570 1575 1580 

Phe Val Ala Leu Ala Val He Phe Cys Cys Phe Leu Ser Met Leu Leu 
1585 1590 1595 1600 

He Phe Val Pro Lys Val He Glu Val He Arg His Pro Lys Asp Lys 

1605 1610 1615 

Ala Glu Ser Lys Tyr Asn Pro Asp Ser Ala He Ser Lys Glu Asp Glu 

1620 1625 1630 

Glu Arg Tyr Gin Lys Leu Val Thr Glu Asn Glu Glu Leu Gin Arg Leu 

1635 1640 1645 

He Thr Gin Lys Glu Glu Lys He Arg Val Leu Arg Gin Arg Leu Val 

1650 1655 1660 

Glu Arg Gly Asp Ala Lys Gly Thr Glu Leu Asn Gly Ala Thr Gly Val 
1665 1670 1675 1680 

Ala Ser Ala Ala Val Ala Thr Thr Ser Gin Pro Ala Ser Leu He Asn 

1685 1690 1695 

Ser Ser Ala His Ala Thr Pro Ala Ala Thr Leu Ala He Thr Gin Gly 
1700 1705 1710 

Glu 



<210> 178 
<211> 4463 
<212> DNA 
<213> Drosophila 



<400> 178 

tgtgggtggt 

ccctgccgcc 

aggatgtggt 

gctgatatat 

gaatgccgat 

ctggttccta 

aatctaaatt 

ccaataaatc 

caaaaactaa 

ggaactactt 

cagctgtaat 

aagcaggact 

aagtaaaatg 

cgggttttat 

tacttttcat 

gcttgcagac 

tgaatgaaag 

attgctgcac 

tactatgtag 

atactagtac 

tgtatagttt 

tggttctctt 

gtcaggcaga 



tcgcacgccc 
caacacgccc 
acccggaagc 
gaatcttggg 
acccgacact 
gcgctagtct 
aagccaccac 
caaaactaag 
gttgtcatct 
ggctcgaaac 
caaaagctta 
aaaagctttt 
acaaaacttg 
tgtatttgaa 
gttctacgca 
tttataatca 
tggttcagat 
aggaaaataa 
aatttattta 
tgtttgttga 
gctgatttct 
tgttaaatac 
ataaataggc 



gccatttccg 
actccgaagg 
tcttacatac 
cggcggcttg 
taatgcccaa 
agcaatagtt 
atcccggggt 
actcagccac 
catttacata 
ggttagaagc 
aagcttcgct 
atttgccgaa 
gaagttcttt 
aagtctgatg 
aggggtatga 
tccaacaaac 
aagttacatt 
atcaagatta 
cattttgttg 
tgaatggctt 
cttcttgctc 
tataaagtaa 
gaagatcctt 



gcgcggcctc 
actacgcgca 
cgctgcggtt 
ccttttgttg 
ccagtagttg 
tagttgttta 
ccgtgactaa 
atgccgtcgt 
aattagaagt 
tgaattcaaa 
tgtttggttt 
actaaagctc 
ggtaatgtaa 
aatcattatc 
taactggttg 
attatggtgc 
ttattttatc 
tccacaatag 
tactaatggt 
aggccttgca 
gaattgcaac 
aggaataaaa 
ccgaaatcga 



ggataacgtc 
caagttcctg 
gctggtgata 
ttcgaaaaca 
gctaattggc 
tgttaaggcc 
gacttccgac 
aatctatgaa 
cataaatttg 
accggttaac 
agtctttgac 
ggaggtattt 
tgaacgccaa 
ttgatgactg 
acatggatac 
attgtatacc 
tcacacatct 
tttaaaagtc 
aacttaactt 
aaatcgagtc 
agagacccaa 
caaatccctg 
agtatgtaca 



ctcgattacg 
cccgagggca 
aagcgctgaa 
agttccggcg 
tttaagctta 
caagttcgct 
cttatcgcat 
atacattcac 
aagtgagcgg 
gtttatctgt 
taagtttgga 
atttattcac 
tatttacatt 
gcccctaatt 
caagattaga 
atttatgtta 
aagtttttgt 
aatttatagg 
aggtttttaa 
agtgggcatt 
ccaggcccgc 
taaagtggct 
caatccaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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atccatgtta 
aggtcacgtt 
ttggccaccg 
ggatagcaga 
agaaaaacca 
agtatataat 
aatccaatga 
accagagtac 
aggatcagga 
gtggaggcta 
cccaacagag 
ttggtgtcca 
tgatccgcat 
gtggatattc 
tgagtttcat 
tcttacgaaa 
cattcaatcc 
cttccgtctc 
cctttctctt 
tcatggccct 
tgccaccaga 
ctctgagttc 
actcagtgct 
ccaaataatc 
gaagggtttc 
ctgcagcatt 
catcggcggc 
tgtccggagg 
tacaggtcac 
gtcggatggg 
ctctcgtttt 
gttcaaagtc 
gctgaccatc 
gacaccgggc 
cgagtattat 
actaatttaa 
cggagataaa 
gcctgcaaga 
ttcgagcaga 
aaaggtcgaa 
aaataacacg 
gatcacattt 
ctgataaaga 
ataatttgtt 
ttcaacacca 
aattttcttt 
tctgatttta 
atatgaatgc 
gccaccttta 
tttgcattca 
gataaagtga 
gcatgaatag 



ttcccaaaag 
tgaacagggc 
tgataaggcc 
ggatgggata 
ccaacaaggt 
gtcgtgggtg 
taaacatcca 
atcgttccga 
agaagagcaa 
atgtccggaa 
aaagggtcaa 
gacctgcctg 
ggcagaggat 
catcgctaag 
tcgcttttgc 
agattttaga 
tggtattgtt 
cacaaagagt 
cagttgggtc 
tatatcttcc 
gaagttcact 
gctcgtcttt 
gtttgcaaaa 
ctcgaagttg 
attatctgta 
tctttccggt 
ggccagtttc 
gctcttcaca 
gtgatccaag 
aattccccgt 
ctccacgcga 
ctcggccttg 
tgcatccacg 
cacattcggc 
atagccagca 
gcgcaggctg 
atgcgagtgg 
attgccactt 
gctcaaggtc 
cggtgcgttc 
aacctcttta 
tttatcaagg 
gctaaatggt 
tatcgcagta 
ttgcatgtat 
tttgctctgg 
aaagtgcaac 
atcaatcggt 
gatactttcc 
gatataaaat 
gcatttcaat 
aatttgtctg 



aaagaagagc 
tgtatagaca 
aatagggagt 
gtccattgag 
gggtataagt 
attaactcca 
gctgaagaca 
ctcagattgt 
ctgaagaccc 
gctcttaaac 
gccgacattg 
cattttgctc 
cactgggtgc 
ccaggtctca 
attacgcaag 
ttaaaacata 
atggatataa 
gaatctcgag 
catagttttt 
tcattatccg 
atcccactca 
agaacaatga 
atagtggtgt 
ctaaggagtg 
gacataactt 
tttttgacca 
tcctgagttt 
gcgagaaggt 
ctctgccaaa 
tcgttcaggc 
ttgattcgct 
cagaatcctc 
cagaggagga 
tgatccaggg 
atccaccaga 
cgatgtcgca 
cgataaaacg 
tcctcactgt 
aactggttat 
ttttcaaaat 
ataatgggta 
gaacttccta 
aattatcaaa 
aagtgagatt 
cgataaaact 
gtaacaatgg 
gatacttaac 
agctaggggc 
ttactaacac 
aatgttctct 
tgaagaacca 
eta 



aacacaaaca 
ctccaggaga 
atgacgccca 
gaattattgg 
ggcaatgtcc 
atgactctca 
accaagagta 
tccagctgct 
agggcagcac 
aeggctgeag 
gtgattacat 
actcccagta 
gatgaggctt 
tagttccagt 
aaaaatggca 
tatgetataa 
cattactagt 
tatagacctt 
egggaataaa 
ataggtgaag 
catttctcac 
cacccatatc 
ttaactttaa 
cattggttgc 
tttggcatat 
gcgaatcgaa 
gagtcaggag 
ttatactcga 
ctctattccc 
atagatccct 
tttccttgcc 
ggttgcagtc 
agegcaagtt 
cgtacagggg 
gaaacagcag 
ctcccgaagc 
agttttcctc 
tctagctggg 
ctggccgacg 
atctttgaat 
ttatttatac 
attaaaaaca 
attatatata 
ctctgaattg 
aataggaatt 
ctgccatttc 
acgaactttg 
tgtcaacaac 
ctagaaggta 
tgttcaagct 
atttgtatat 



ggccgagttg 
tgtatcccac 
gacttaaacc 
gtctgtagga 
aggccgccac 
catatctttg 
gatattgeat 
ccagcaaatg 
acagattcag 
ttataaaaat 
ctaaggagtg 
gaaatgtaaa 
ccaagctgcc 
aaccacatcc 
actttccagg 
ttttccctca 
tggccgtcgt 
aaaaactaag 
gaccgcactt 
catttcgaaa 
ttctggatct 
aaacaaggga 
aattgttgga 
attagegtet 
cttcatcaag 
tatctccgaa 
gttactcagt 
gctgagttcc 
attgegaatt 
tagattcgee 
ttcatccgtt 
cttgtaggga 
cttcaggtgg 
gggatttgta 
catactgeta 
tccgcgatcg 
tcctcatccc 
tatttgaatt 
ataagaattg 
ataataagtg 
acagtagttc 
aaaaaaaaaa 
ccatacaccc 
ataaaggtta 
acaaataaat 
tgaatgtttc 
agttgctagt 
tacaatatgt 
gagatagagg 
gtgaaatctt 
ttttttagat 



cttgaagata 
caatattgea 
atatccagaa 
accgggctcc 
agecgacatt 
atactgeaga 
cacggctcca 
gctttgactc 
caatatcttc 
cattaggagg 
atcatcttca 
ctgggtcaga 
accaccgctg 
tccctcaaat 
tagcttggta 
acccactcac 
ttgatttcct 
taagaggctc 
tctagattct 
tgegacatat 
acaaagaata 
tggaatgtga 
ctaagctcat 
gctgaaacac 
tcccctacaa 
atactgaaga 
tggctaatca 
ggtgccgacc 
tggcacctcc 
ttttgcccaa 
tegtattcca 
ttgtgcgaca 
cagtagtcca 
cctccagata 
tctattttta 
actgacgagg 
tgtaaatagg 
gctggtctac 
cagcatcttc 
aaagtctcga 
tgtttacttt 
aacgaatagc 
aacgataatg 
attgccgttt 
taaaaatctg 
tgttattctc 
cgccataagc 
atgataagca 
tattcaaata 
tatctgtaga 
ttcaccgaac 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4463 



<210> 179 
<211> 1938 
<212> DNA 
<213> Drosophila 



<400> 179 

atgctgctgt ttctctggtg gattgetgge tatataatac tegtatctgg aggtacaaat 



60 
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ccccccctgt acgccctgga tcagccgaat gtggcccggt gtctggacta ctgccacetg 120 
aagaacttgc gcttcctcct ctgcgtggat gcagatggtc agctgtcgca caatccctac 180 
aaggactgca accgaggatt ctgcaaggcc gaggactttg aactggaata cgaaacggat 240 
gaaggcaagg aaaagcgaat caatcgcgtg gagaaaacga gagttgggca aaaggcgaat 300 
ctaagggatc tatgcctgaa cgaacgggga attcccatcc gacggaggtg ccaaattcgc 360 
aatgggaata gagtttggca gagcttggat cacgtgacct gtaggtcggc accggaactc 420 
agctcgagta taaaccttct cgctgtgaag agccctccgg acatgattag ccaactgagt 480 
aacctcctga ctcaaactca ggagaaactg gccgccgccg atgtcttcag tatttcggag 540 
atattcgatt cgctggtcaa aaaaccggaa agaaatgctg cagttgtagg ggacttgatg 600 
aagatatgcc aaaaagttat gtctacagat aatgaaaccc ttcgtgtttc agcagacgct 660 
aatgcaacca atgcactcct tagcaacttc gaggattatt tggatgagct tagtccaaca 720 
attttaaagt taaacaccac tatttttgca aacagcactg agttcacatt ccatcccttg 780 
tttgatatgg gtgtcattgt tctaaagacg agcgaactca gagtattctt tgtagatcca 840 
gaagtgagaa atgtgagtgg gatagtgaac ttctctggtg gcaatatgtc gcatttcgaa 900 
atgcttcacc tatcggataa tgaggaagat ataagggcca tgaagaatct agaaagtgcg 960 
gtctttattc ccgaaaaact atggacccaa ctgaagagaa agggagcctc ttacttagtt 1020 
tttaaggtct atactcgaga ttcactcttt gtggagacgg aagaggaaat caaacgacgg 1080 
ccaactagta atgttatatc cataacaata ccaggattga atgataccaa gctacctgga 1140 
aagttgccat ttttcttgcg taatgcaaaa gcgaatgaaa ctcaatttga gggaggatgt 1200 
ggttactgga actatgagac ctggcttagc gatggaatat ccaccagcgg tggtggcagc 1260 
ttggaagcct catcgcaccc agtgatcctc tgccatgcgg atcatctgac ccagtttaca 1320 
tttctactgg gagtgagcaa aatgcaggca ggtctggaca ccaatgaaga tgatcactcc 1380 
ttagatgtaa tcaccaatgt cggcttgacc ctttctctgt tgggcctcct aatgattttt 1440 
ataactgcag ccgtgtttaa gagcttccgg acattagcct ccacgaagat attgctgaat 1500 
ctgtgtgctg ccctgggtct tcagttgctc ttcttcctga tcctgagtca aagccatttg 1560 
ctggagcagc tggaacaatc tgagtcggaa cgatgtactc tggttggagc cgtgatgcaa 1620 
tatctactct tggttgtctt cagctggatg tttatcattg gatttctgca gtatcaaaga 1680 
tatgtgagag tcattggagt taatcaccca cgacattata tactaatgtc ggctgtggcg 1740 
gcctggacat tgccacttat acccaccttg ttggtggttt ttctggagcc cggttcctac 1800 
agacccaata attcctcaat ggactatccc atcctctgct atccttctgg atatggttta 1860 
agtctgggcg tcatactccc tattggcctt atcacggtgg ccaatgcaat attgcaatac 1920 
aaaaacttag atgtgtga 1938 



<210> 180 
<211> 645 
<212> PRT 
<213> Drosophila 



<400> 180 



Met 


Leu 


Leu 


Phe 


Leu 


Trp 


Trp 


He 


Ala 


Gly 


Tyr 


lie 


He 


Leu 


Val 


Ser 


1 








5 










10 










15 




Gly 


Gly 


Thr 


Asn 


Pro 


Pro 


Leu 


Tyr 


Ala 


Leu 


Asp 


Gin 


Pro 


Asn 


Val 


Ala 








20 










25 










30 






Arg 


Cys 


Leu 


Asp 


Tyr 


Cys 


His 


Leu 


Lys 


Asn 


Leu 


Arg 


Phe 


Leu 


Leu 


Cys 






35 










40 










45 








Val 


Asp 


Ala 


Asp 


Gly 


Gin 


Leu 


Ser 


His 


Asn 


Pro 


Tyr 


Lys 


Asp 


Cys 


Asn 




50 










55 










60 










Arg 


Gly 


Phe 


Cys 


Lys 


Ala 


Glu 


Asp 


Phe 


Glu 


Leu 


Glu 


Tyr 


Glu 


Thr 


Asp 


65 










70 










75 










80 


Glu 


Gly 


Lys 


Glu 


Lys 


Arg 


He 


Asn 


Arg 


Val 


Glu 


Lys 


Thr 


Arg 


Val 


Gly 










85 










90 










95 




Gin 


Lys 


Ala 


Asn 


Leu 


Arg 


Asp 


Leu 


Cys 


Leu 


Asn 


Glu 


Arg 


Gly 


He 


Pro 








100 










105 










110 






He 


Arg 


Arg 


Arg 


Cys 


Gin 


He 


Arg 


Asn 


Gly 


Asn 


Arg 


Val 


Trp 


Gin 


Ser 






115 










120 










125 








Leu 


Asp 


His 


Val 


Thr 


Cys 


Arg 


Ser 


Ala 


Pro 


Glu 


Leu 


Ser 


Ser 


Ser 


lie 




130 










135 










140 










Asn 


Leu 


Leu 


Ala 


Val 


Lys 


Ser 


Pro 


Pro 


Asp 


Met 


He 


Ser 


Gin 


Leu 


Ser 


145 










150 










155 










160 


Asn 


Leu 


Leu 


Thr 


Gin 


Thr 


Gin 


Glu 


Lys 


Leu 


Ala 


Ala 


Ala 


Asp 


Val 


Phe 
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1 o3 










i / u 










1 

1 ' 3 




Ser 


T 1 Q 

i J.e 


C A> w~ 

ber 


blU 


ne 


r ne 


Asp 


oer 


Leu 


V a J. 


Lys 


Lys 


Pro 


clU 


Arg 


Asn 








1 o U 










1 Q C. 










i on 






Ala 


Ala 


Val 


Val 


Gly 


Asp 


Leu 


Me t 


Lys 


lie 


Cys 


bin 


Lys 


val 


Met 


ber 






n a c 
1 »D 










-inn 
ZUU 










one. 
ZU3 








Thr 


Asp 


Asn 


Glu 


Thr 


Leu 


Arg 


Val 


Ser 


Aia 


Asp 


Ala 


Asn 


Aia 


l nr 


Asn 




Z 1 U 










Z 1 3 










000 
Z Z U 










Ala 


Leu 


Leu 


C n tr- 

ber 


Asn 


Fne 


blu 


Asp 


Tyr 


Leu 


Asp 


r*l ii 
blu 


Leu 


ber 


Pro 


i nr 


225 










Z JU 










ZJD 










0 A A 

z 4 U 


lie 


Leu 


Lys 


Leu 


Asn 


Thr 


Thr 


He 


Phe 


Aia 


Asn 


Ser 


Thr 


blU 


Phe 


Thr 










O A C 

z4 3 










ZjU 










O C C 
Z33 




Phe 


His 


Pro 


Leu 


Phe 


Asp 


Met 


Gly 


val 


TI- 
116 


val 


Leu 


Lys 


Thr 


O 

ber 


blU 








1 C A 

z 60 










o c c 
zoo 










z ' U 






Leu 


Arg 


Val 


Phe 


Phe 


T7 — 1 

Val 


Asp 


Pro 


blU 


17 1 

val 


Arg 


Asn 


val 


Ser 


Gly 


He 






275 










280 










o o c 
ZOO 








Val 


Asn 


Phe 


Ser 


Gly 


Gly 


Asn 


Met 


Ser 


His 


Phe 


Glu 


Met 


Leu 


His 


Leu 




2 90 










295 










300 










Ser 


Asp 


Asn 


Glu 


Glu 


Asp 


He 


Arg 


Ala 


Met 


Lys 


Asn 


Leu 


Glu 


Ser 


Ala 


305 










310 










Tic 

JlD 










O O A 
JZ U 


Val 


Phe 


He 


Pro 


Glu 


Lys 


Leu 


Trp 


Thr 


Gin 


Leu 


Lys 


Arg 


Lys 


Gly 


Ala 










325 










330 










335 




Ser 


Tyr 


Leu 


Val 


Phe 


Lys 


Val 


Tyr 


Thr 


Arg 


Asp 


Ser 


Leu 


Phe 


Val 


Glu 








J4 U 










"3 a c 










Ten 
J3U 






Thr 


Glu 


Glu 


Glu 


He 


Lys 


Arg 


Arg 


Pro 


Thr 


Ser 


Asn 


val 


lie 


ber 


lie 






355 










O H A 

360 










"3 C C 








Thr 


He 


Pro 


Gly 


Leu 


Asn 


Asp 


Thr 


Lys 


Leu 


Pro 


Gly 


Lys 


Leu 


Pro 


Phe 




370 










375 










O O A 










Phe 


Leu 


Arg 


Asn 


Ala 


Lys 


Ala 


Asn 


Glu 


Thr 


Gin 


Phe 


Glu 


bly 


Gly 


Cys 


385 










O A A 










j y o 










A A A 
4 UU 


Gly 


Tyr 


Trp 


Asn 


Tyr 


Glu 


Thr 


Trp 


Leu 


Ser 


Asp 


Gly 


He 


Ser 


Thr 


Ser 










405 










410 










415 




Gly 


Gly 


Gly 


Ser 


Leu 


Glu 


Ala 


Ser 


Ser 


His 


Pro 


Val 


He 


Leu 


Cys 


His 








420 










425 










4 30 






Ala 


Asp 


His 


Leu 


Thr 


Gin 


Phe 


Thr 


Phe 


Leu 


Leu 


Gly 


Val 


Ser 


Lys 


Met 






435 










440 










4 4 5 








Gin 


Ala 


Gly 


Leu 


Asp 


Thr 


Asn 


Glu 


Asp 


Asp 


His 


Ser 


Leu 


Asp 


Val 


He 




450 










455 










4 60 










Thr 


Asn 


Val 


Gly 


Leu 


Thr 


Leu 


Ser 


Leu 


Leu 


Gly 


Leu 


Leu 


Met 


He 


Phe 


4 65 










470 










4 /O 










A O A 

4 BO 


lie 


Thr 


Ala 


Ala 


Val 


Phe 


Lys 


Ser 


Phe 


Arg 


Thr 


Leu 


Ala 


Ser 


Thr 


Lys 










485 










A A A 

4 90 










A A C 

4 ya 




lie 


Leu 


Leu 


Asn 


Leu 


Cys 


Ala 


Ala 


Leu 


Gly 


Leu 


Gin 


Leu 


Leu 


Phe 


Phe 








C A A 

500 










505 










CIA 
Oil) 






Leu 


He 


Leu 


Ser 


Gin 


Ser 


His 


Leu 


Leu 


GlU 


Gin 


Leu 


blU 


Gin 


Ser 


blU 






OlO 










520 










C O CI 

3Z3 








Ser 


Glu 


Arg 


Cys 


Thr 


Leu 


val 


Gly Ala 


val 


Met 


bin 


Tyr 


Leu 


Leu 


Leu 




C *3 A 










c "3 c 










C A A 

3 4 u 










Val 


Val 


Phe 


Ser 


Trp 


Met 


Fne 


He 


lie 


biy 


fne 


Leu 


bin 


Tyr 


bin 


Arg 


C A C 

3 4 3 










33U 










t= C CL 
JJJ 










c £n 


Tyr 


Val 


Arg 


val 


He 


Gly 


val 


Asn 


His 


Pro 


Arg 


HIS 


Tyr 


lie 


Leu 


Met 










565 










C T A 

3/0 










ETC 

3/3 




Ser 


Ala 


Val 


Ala 


Ala 


Trp 


Thr 


Leu 


Pro 


Leu 


He 


Pro 


Thr 


Leu 


Leu 


val 








C O A 

3oU 










585 










can 






Val 


Phe 


Leu 


Glu 


Pro 


Gly 


Ser 


Tyr 


Arg 


Pro 


Asn 


Asn 


Ser 


Ser 


Met 


Asp 






595 










600 










605 








Tyr 


Pro 


He 


Leu 


Cys 


Tyr 


Pro 


Ser 


Gly 


Tyr 


Gly 


Leu 


Ser 


Leu 


Gly 


Val 




610 










615 










620 










He 


Leu 


Pro 


He 


Gly 


Leu 


He 


Thr 


Val 


Ala 


Asn 


Ala 


He 


Leu 


Gin 


Tyr 


625 










630 










635 










640 


Lys 


Asn 


Leu 


Asp 


Val 
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<210> 181 
<211> 6938 
<212> DNA 
<213> Drosophila 



<400> 181 

cagctagtag 

atatgcactt 

tgtgttttga 

aagtctagcc 

ataatacaaa 

gaggggggaa 

cgctgatcga 

tgaaaggttg 

cgaaaacgcg 

agttatatcg 

ttgatctatc 

acgtaaacta 

ctttcaaggt 

tgtttggtga 

tttctttctt 

aactatttct 

tacaattctc 

cggagtggga 

aggtatccgt 

cctcggtgat 

gcgaatttga 

gtagtttcgt 

acatgcacac 

tgggtctaag 

gattttagtt 

aaaagcaacc 

ggttattgtt 

caagaacaat 

attggaggat 

gatctaggga 

aatatcgatc 

gtgagcacta 

gggagaagcg 

cctgccgcag 

tgtatagaaa 

cgtatgtcct 

cagagcccgg 

gcatccacag 

agatgcaaca 

ggaaacggaa 

ctgtgctgaa 

tattggtcgt 

taaaaaagta 

agagaacttc 

aggcagagtc 

ttgaaaggaa 

taaattcatg 

agttaaagtg 

tgtttaaacg 

tagcttatta 

ataaccataa 



tgaataatac 
tagcctctat 
aaatgtaaca 
catttttaga 
tacactttag 
ttgctagttt 
aaatgatctc 
acttgaaagc 
tgttttagga 
ttacctacac 
agtgatccta 
aagtaaaatc 
tcctcacttt 
gtgtttgttg 
tgcgaaatta 
cacatattat 
aaatatccgc 
gaccatcgag 
gggtccttcg 
ctcgatgtcc 
gatattttag 
gtgcaatctg 
acacacacac 
tttatagaca 
tacattcaca 
gaaaattctt 
ttgtttggtt 
gatatttata 
caagagaagt 
gcgtggggat 
tgaggcctac 
tggctggtgg 
acggtgaccc 
agtcttgttc 
atagagataa 
cgggactcat 
catatcctcc 
gtaaacgtat 
aacaccgcaa 
atggaaacgg 
gagttatgta 
gattgtatct 
actcaaaagg 
agtttaagat 
ttggatacac 
accgattttt 
caaaaatcaa 
caatacattc 
agacacactt 
ctaaaattat 
acaaatgaaa 



ttgagttttt 
taagcatttt 
atttgcgctt 
ttttgtatgg 
ttggtaagat 
tatgttaatg 
aagatctcta 
aaatggttag 
atgggtttgt 
aggtgggctc 
atcaatcgaa 
gtctaaaaat 
acgtttcatt 
catctccatt 
aagtaattct 
ctacacatac 
catgaaccag 
tgagagagca 
gcactgcaca 
tgatgatgct 
tgggtcacat 
ttttcgataa 
acacgcacaa 
aacgtgcgag 
acacagatat 
accttatcct 
tggttgtggt 
gatgtattaa 
acagattcat 
gggcagcaaa 
agcccggttc 
tggactacgc 
gccttgcgtc 
ttcattatct 
tagaacatat 
ttcggatatg 
gcccagccgc 
tgatagatca 
accacactca 
aactcaaagg 
tagttatatg 
ttgggggtgc 
taaacaagat 
tattcagttg 
tagtttggaa 
gtcatttgtt 
cactaaacat 
aggcataaat 
gcagcaactt 
gttaattttc 
tattcaaatt 



gagtttgttg 
cctatggttt 
tcgtcctcgt 
gaatgagtgg 
tagcaagacg 
cctataagct 
tcatttgttt 
taactaactg 
ttttcttagt 
tgttttaccg 
atcaatcgaa 
ttatcaaaac 
gaaaaataaa 
tgcatcaatc 
tttattgttt 
caacgtgtat 
caaggactcc 
ttgagtggga 
ccgaatcctc 
tgttgctgcg 
ggacaactgg 
gacttatgtg 
gctggtggtt 
aaattctttt 
tttagcaggt 
tgctgccctt 
gggtttcgaa 
tataagtggc 
agacggttag 
aggtttggtt 
cccattctgg 
tgccgttgtg 
cgccacgtcg 
ccaattgggt 
agatatgctt 
gtcagctcac 
tggccggcgt 
tacgctagcg 
ggaagtcaaa 
taagagagag 
tatatagtat 
tggaaatgaa 
ggacgatagc 
aggcacagaa 
tttttttttc 
tgttagtagt 
gcaaattaaa 
aataagagct 
catagcttga 
tcggtgaaaa 
gtattttaat 



ggctgggttg 
tcactctcgt 
acattgtgtg 
aaattacaag 
taagacgtac 
caaatttaca 
agctttagtg 
tcgaggaact 
gtatgtggcc 
atctaaacgt 
ctcaagactg 
tttataaaac 
gtaatatttg 
attttcctgc 
gtgttgttgc 
cgtaccgttt 
ctacttcgaa 
cacgccggag 
gggcgtcctg 
gtgtttggca 
tggatatgtg 
gtgctatcta 
tcgggttcat 
gatttagtcg 
taggtttaga 
tttctacatt 
tatattgtac 
aaacagtact 
agagacgggg 
tcggattgaa 
ccacgtcaat 
gtgaccccga 
cttactgatg 
atacagtctg 
cgtacctttt 
caatgtccgg 
cgtgtgacgt 
aacgaacctg 
cggaaaatgg 
caaaatcgca 
gtgtatgggt 
cgaaaagaaa 
tataagaaga 
agacagagaa 
tggtatgcat 
tagatgttgt 
tggcgtgctt 
aaaaacagtc 
ctaaagtttc 
ccagaacaag 
tagcaaaagc 



aaaaaaaaaa 
tcgggaactg 
tatttacaga 
tgtataatat 
gatagttaat 
actgattatt 
tggatagtgt 
tagaggtaaa 
ttgtttttac 
ttgaacaact 
acagaaatca 
ttcttggctg 
ttattgtttg 
tggttttttt 
gctatataat 
aatacctata 
tgcgagacca 
atttccgcat 
gaaggttccg 
cttagagcct 
ttttctcgtt 
tgggacggac 
tggggaactt 
ttttgttttg 
ggtttatgct 
gaacattggt 
atagtcatga 
gcgagttcac 
gagagagaaa 
gaataacagc 
gtcatgtaca 
ctcacctgca 
tgcggattgt 
gaatcgaaaa 
gagttcggcg 
atcgcccaga 
acccttgaaa 
caagctatga 
agaacggaac 
tggtatagat 
ggtctgtgct 
caaatcaaat 
ctacttagac 
agttacagtg 
attggaaatt 
gttttcttgt 
attgagttca 
gaacattctc 
aattcaatgt 
aactcaaaat 
aaccaagctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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ttgagtggat 
gtttcctcaa 
ccacgacaga 
ttcgtttggt 
agaacacaaa 
aatcatatcg 
caaatatcag 
agatggcgct 
agaccaggaa 
aaggaaatta 
atctaagcca 
catcttaccc 
cacaggatga 
ttcgtgtccg 
gccgtaaagg 
aggtccacgt 
cggtacaact 
ggctccagaa 
atatgtaaat 
aagtaaaaca 
ctgaaaatga 
cttgcacttt 
gcccagtacg 
ctgaaaggtc 
gcacgagtag 
cgtggaattg 
ggcgggtttg 
tggtattata 
actcacccat 
gcggcaatga 
ccaaccagct 
atggccagag 
gggtgccgtc 
caactttcca 
gcaagctacg 
atttcaaagc 
tgggacacaa 
cttattgtgc 
cattatcgct 
gtgaatgctg 
gctggcgatt 
cggtgtgttg 
ctcggcgatg 
tgcgttgtgt 
aatcgaattc 
tacgctttgc 
tttctaattg 
gtcgccacgt 
attattcacc 
agttttgctt 
ctgcgggaag 
taatttacac 
gccactttga 
gagggggttg 
ggctggcggc 
tgagcgacca 
tcgatataca 
ttttttttcc 
aggcagctgg 
agcactcgcg 



gtccacattt 
ccaatgtcaa 
cgaaaggcac 
gcccgactag 
agcgtactca 
tttcgaactc 
acccaaggcg 
gggactcatt 
ctccagcgtc 
ccgattaatt 
atctcttgcg 
ggaactgggt 
gcatctcgct 
cctccctgag 
cggccatgta 
cgcgcagcac 
tcagtatgat 
ggcagcgctc 
acacacagaa 
atgcccagca 
tgaaaggatt 
cgctgccgga 
atggccacca 
agacagacgc 
ttgtcgtagc 
ggcaggtagt 
ttctcggtga 
tatattatcc 
gacggcgacc 
aggcagccgc 
gcacatcgat 
ccagcgacgc 
cgtgctgttc 
agattatacg 
gtgcatattg 
cataagataa 
ccagtcttcg 
acctgatccg 
gattagcgag 
ggcgcggcaa 
ccattggccc 
gcatttggga 
tcggtgtagt 
cataaataat 
gatttaaata 
taaactcaat 
aagatatttt 
gatttaagtg 
tggcgcacag 
cgtttcgact 
ggtttttaat 
gttacaagaa 
tgacatcatt 
ctggtagcgg 
accttttggc 
cattgcgtat 
aacgtttgct 
tgccaaacgt 
agaaatggca 
cgcagccata 



taatacttct 
taccaagtac 
aaaatgaacg 
gacaaaacac 
aaccaaaaaa 
aaatcggtta 
aaactattcg 
tccggcaggt 
agtgcggtga 
catttgactt 
cttaattgat 
gttggcattc 
ggtctgcgtg 
gctctccagc 
gcagatgacg 
ccagcgatgc 
ggcgccgtag 
ggtggatgcg 
gttgaaggtg 
gtatcgtctc 
actcaaagtt 
agacaatcac 
ggaggcgcag 
cgttgctatc 
cctcggacag 
aggattcccg 
gcaggcagaa 
agaactaatg 
gaaaacttcc 
cacaactctg 
gcgactcact 
gtcaacaggt 
caggccgcac 
gcttatggtc 
aatttaattg 
ttgattttaa 
aaacttaccc 
gtggtatctg 
tgcataccgt 
tggcgaatgc 
ggcgactcac 
atttgcccga 
cgtaaaactg 
catttcaatg 
cacaaagaaa 
ttatttatac 
cggaaattgc 
accatgcggc 
gacacgtttc 
gtgaaataat 
ccaagttcgt 
actctcgaga 
atggctttgg 
ggaaaaagga 
ttttaaaaca 
acgccaccga 
ggatgctcct 
gagccaaaat 
cgagacctgc 
atcataaaca 



tgtgttttgg 
gcatttcgat 
ggtgtgaata 
ttcctaatta 
aaaaaaaaaa 
cctgcttgtg 
tcagctggga 
agacaaagcg 
ccagaaattg 
gacttaaatt 
aaagaaagat 
cgactggcga 
accagttccc 
tgcgcacttt 
gcaaagacca 
gctttccggg 
caggtgatga 
ggaaagaaat 
accatcgcca 
cagcacagtc 
tcccaacttt 
cccgaggaca 
acaggcgtcc 
gcagttagtg 
ttccactcga 
gcacagacag 
ggtcgtattc 
tcacgaatta 
tccaggccat 
aaagcattta 
tgatcctgtg 
tgcagtcgcg 
tgtagtcggg 
tattcaaatt 
tatttatctt 
tttgttgcta 
caactcccgc 
caataagtta 
aacgttatcc 
cagcagattg 
ctccgcacgt 
taaaaattca 
ttgaattcat 
tatatgaaat 
aagttcaata 
cacactatat 
gaaaaaatta 
acggccactt 
acagcaacac 
aggatttaat 
tcagctccta 
tctacgcacc 
gaaacaaaag 
cacgttgttt 
aatatttccc 
agccaccaaa 
ctaccatgtg 
ttacacatcg 
tcccagataa 
tttggtatac 



ctagttattt 
gactataatg 
ctctaagcat 
catcatatat 
aaaccaaagc 
ctggtaccac 
gcggatgaac 
cagaaagtag 
tcgttccttc 
ccccttctac 
ctgcaatcga 
tggacaggtg 
acttgagcgg 
cgagaagatc 
tggtgcccag 
tgcgaaagtc 
agcccagctc 
ggacggcaac 
ggagcactca 
cacataccag 
aggatatgta 
atgcagcaca 
acattgagca 
catccgccgg 
tcgccacgga 
gtgtacacat 
cgatcgcaac 
tcgtataaag 
cgttgcacac 
gcataaatta 
ctcgctgaag 
aaatggagcg 
taaaataata 
aagcgatgca 
ttagagtaat 
attaatgtcg 
caataggaac 
aatcgattag 
ctgaatttaa 
tcgtcctcgt 
gtcggctcat 
ataaactggc 
aaaaccagat 
gggcatataa 
aaattaacac 
aggaaacgga 
cgtatatgcc 
cccccgactt 
cttgcggcaa 
accctcgccg 
accgaccaaa 
aaagtgtagg 
taaccacaga 
gggcgtgagt 
attagctggg 
taaattgcca 
tgtgtgagtg 
aggcaaaagc 
agtca ttatg 
tcttgcatgc 



aacaaaaatc 
acgaccacgc 
atgaattata 
ctcaatgtgt 
aaagtaagta 
aactttggca 
agggccagca 
aagctggacg 
aatgggacaa 
ttagttaatc 
tttgactgta 
cagtccaaag 
atggcaggtg 
caacgacgag 
atacttgagc 
caccaggtgg 
ccgaagccag 
ctaaagatga 
cagatgcata 
acgcaatggc 
aactaaatac 
ggatgcaggc 
cctcctcctc 
gacacggaat 
atccctggag 
cccgggtggc 
tacagagata 
tttggccaca 
atcctcgtcg 
caaagttgtg 
gatatgcgaa 
ccccactcct 
catttatggg 
cattatttcg 
tagtatttgc 
cacacttttt 
ccaaatatgc 
cccatggccc 
cgaccaccgt 
tgagggcgta 
cgatggcaat 
gtgcggcatc 
cacaattaat 
atatcattta 
agcataaatt 
aaacgtgtgc 
acgtatattg 
gggaccatta 
gagtgcgaaa 
tgctgtggga 
tgtttataca 
aaatggtagg 
agggagcagg 
catgtagttg 
ctttcggtga 
acaagtaaca 
tgtttcttca 
aatcagtata 
tagtgctcct 
tagagaaata 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
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ttgtagttgg gtaagtgttt tcagatatta aactaagtaa ggaaatattt ctatttttac 6720 

accattattc attgtttttt cttcaggagt tatctgaact ttaagtaatc ccaccacagt 6780 

tcttggaagc agtagaatgt cttcattacc gccaaatcct tttgatatcc tttttcccaa 6840 

atgtccgcaa ttacacacat accctcgcac tagcgcccct cgttttttca gggtataaat 6900 

gtatggggaa aacacttgta aatagcggca gcagccac 6938 



<210> 182 
<211> 2010 
<212> DNA 
<213> Drosophila 



<400> 182 

atggtcccaa 

gaggatgccg 

ccgattgcca 

agctacgccc 

cacacggtgg 

aaggcatatt 

acccaggagt 

ccatttcgca 

gccgacgagg 

gatcggaata 

tacacctgtc 

ggcgatcgag 

ggatgcacta 

aatgtggacg 

tgcattgtcc 

atgtggactg 

gtccatttct 

ggcttcatca 

cgcacccgga 

accatggtct 

ctcgaaagtg 

aagtgggaac 

tccatcgcca 

ctgacgctgg 

gaaatgagtc 

ttcgccttgg 

cacgacgccg 

attggtgagc 

ctgtataccc 

agtaagcgac 

agcaaggata 

atcaccgagg 

acggatacct 

tcccactccg 



gtcgggggaa 
cacgccagtt 
tcgatgagcc 
tcaacgagga 
tcgttaaatt 
tgggttccta 
ggggcgctcc 
tatccttcag 
atgtgtgcaa 
cgaccttctg 
tgtgccggga 
tggaactgtc 
actgcgatag 
cctgtctgcg 
tcggggtgat 
tgctggagac 
ttcccgcatc 
cctgctacgg 
aagcgcatcg 
ttgccgtcat 
cgcagctgga 
tggtcacgca 
gtcggaatgc 
agttcctggt 
ccagcgccat 
gtctgatatt 
gccagcggct 
tgaccatatc 
aattggagat 
gtggcggacg 
aggctctaag 
cggaaccttc 
atgcggaaat 
tggtctcgca 



gtggccgtgc 
tattgaattt 
gacacgtgcg 
cgacaatctg 
cagggataac 
ttggcgggag 
atttcgcgac 
cgagcacagg 
cgatggcctg 
cctgctcacc 
atcctactac 
cgagggctac 
caacggcgtc 
cctcctggtg 
tgtcttccgg 
gatactgctg 
caccgagcgc 
cgccatcata 
ctgggtgctg 
ctgctacatg 
gagcctcagg 
gaccagcgag 
caacacccag 
ctcgtccagc 
cttgctggcc 
tgtgccaaag 
gggcggagga 
cgaaatgagt 
aatgaagaac 
caaggcgggt 
tgccaaacac 
caggacgccc 
ctccggcgtg 
ttcgaagtag 



cgcatgtttt 
ttatcgggca 
gaggtgagtc 
ctggcattcg 
gttacgatac 
ttgggtgcgg 
tgcaacctgt 
atcaaagttg 
gaggaagttt 
gagaacaaac 
ctgcccaatt 
gacaactact 
tgtctgacct 
gccatcgtac 
cagcgaaagt 
ggcattgttt 
tgccttctgg 
ctgaagttgt 
cgcgacgtgg 
gccgccttta 
gaggcggaca 
atgctcatcc 
ttccgggaac 
ttctactttc 
ctgttcatcc 
ttgtggtacc 
tatgccgggc 
cccgaggaca 
aagactctgc 
caccgtcgct 
cgcagcaaca 
gaggattcgg 
tcccactcaa 



acgactacac 
aattcccaaa 
gccgggccaa 
ccattgccgc 
caccggatca 
cctggaacag 
tgacgcgtcg 
tggcggctgc 
tcggtcgccg 
ccgccgccac 
ccacgctcca 
cgtgcattcc 
ttcaggagga 
tgggcgcctg 
gcaaggccat 
tactttatgc 
agccctggct 
accgccacct 
acctgctcaa 
cggcctcgtc 
cgaacacctg 
tgtgctttgg 
gacaatttct 
tgcgctttgt 
gctcccagct 
agcacaagca 
tctgtctggg 
tacgcgccga 
ggcaggacaa 
tctccctgca 
agcatcatca 
tgtgcagtgc 
tgctctctca 



cgacatcgcc 
tgccaacaca 
tggaatcgcc 
gcccagcatt 
ggtgcacaat 
cacggacggc 
ctggctctgg 
cttcattgcc 
tcatggttgc 
ccgggatgtg 
gggattccgt 
gtgtcccggc 
ggaggtgctc 
catcctgtgc 
tgcgtctggt 
atctgttgcc 
tcgggagctg 
ggtggacttt 
gtatctgggc 
gttggatctt 
ccatccgctc 
actgcacctg 
ggtcaccgca 
ctacctgccg 
gacgaatagt 
gggtacgtca 
cgatccggac 
actcaaaaga 
tccgcacatc 
gaaaaagggc 
ggacatcgag 
cgaaggaccc 
ctcgatggtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2010 



<210> 183 
<211> 669 
<212> PRT 
<213> Drosophila 

<400> 183 

Met Val Pro Ser Arg Gly Lys Trp Pro Cys Arg Met Phe Tyr Asp Tyr 

15 10 15 

Thr Asp lie Ala Glu Asp Ala Ala Arg Gin Phe He Glu Phe Leu Ser 

20 25 30 

Gly Lys Phe Pro Asn Ala Asn Thr Pro He Ala He Asp Glu Pro Thr 
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J D 










a n 

*i U 










4 5 








Arg 


A 1 — 

Ala 


GIU 


val 


ber 


Arg 


Arg 


Ala 


Asn 


Gly 


lie 


Ala 


oer 


Tyr 


Ala 


Leu 




DU 










c c 
D D 










DU 










Asn 


Glu 


Asp 


Asp 


Asn 


Leu 


Leu 


Ala 


Phe 


Ala 


lie 


Ala 


Ala 


Pro 


ber 


lie 


c c 
DJ 










<u 










» D 










Q ft 


His 


Thr 


Val 


Val 


Val 


Lys 


Phe 


Arg 


Asp Asn 


Val 


Thr 


Tl fl 

lie 


Pro 


Pro 


Asp 










85 










90 










ft c 




Gin 


Val 


His 


Asn 


Lys 


Ala 


Tyr 


Leu 


Gly 


Ser 


Tyr 


Trp 


Arg 


GlU 


Leu 


Gly 








inn 
1UU 










105 










lift 
1 1 U 






Ala 


Ala 


Trp 


Asn 


Ser 


Thr 


Asp 


Gly 


Thr 


Gin 


Glu 


Trp 


Gly 


Ala 


Pro 


Phe 






1 Id 










i o ft 
l<cU 










1 o c 
1ZD 








Arg 


Asp 


Cys 


Asn 


Leu 


Leu 


Thr 


Arg 


Arg 


Trp 


Leu 


Trp 


Pro 


Phe 


Arg 


He 




130 










135 










140 










Ser 


Phe 


Ser 


Glu 


His 


Arg 


He 


Lys 


Val 


Val 


Ala 


Ala 


Ala 


Phe 


He 


Ala 


145 










150 










155 










160 


Ala 


Asp 


Glu 


Asp 


Val 


Cys 


Asn 


Asp 


Gly 


Leu 


Glu 


Glu 


Val 


Phe 


Gly 


Arg 










165 










170 










175 




Arg 


His 


Gly 


Cys 


Asp 


Arg 


Asn 


Thr 


Thr 


Phe 


Cys 


Leu 


Leu 


Thr 


Glu 


Asn 








180 










185 










190 






Lys 


Pro 


Ala 


Ala 


Thr 


Arg 


Asp 


Val 


Tyr 


Thr 


Cys 


Leu 


Cys 


Arg 


Glu 


Ser 






195 










200 










205 








Tyr 


Tyr 


Leu 


Pro 


Asn 


Ser 


Thr 


Leu 


Gin 


Gly 


Phe 


Arg 


Gly 


Asp 


Arg 


Val 




210 










215 










220 










Glu 


Leu 


Ser 


Glu 


Gly 


Tyr 


Asp 


Asn 


Tyr 


Ser 


Cys 


He 


Pro 


Cys 


Pro 


Gly 


225 










230 










235 










24U 


Gly 


Cys 


Thr 


Asn 


Cys 


Asp 


Ser 


Asn 


Gly Val 


Cys 


Leu 


Thr 


Phe 


Gin 


Glu 










245 










250 










255 




Glu 


Glu 


Val 


Leu 


Asn 


Val 


Asp 


Ala 


Cys 


Leu Arg 


Leu 


Leu 


Val 


Ala 


He 








260 










265 










270 






Val 


Leu 


Gly 


Ala 


Cys 


He 


Leu 


Cys 


Cys 


He 


Val 


Leu 


Gly 


Val 


He 


Val 






Z ID 










O Q ft 

^oU 










IOC 








Phe 


Arg 


Gin 


Arg 


Lys 


Cys 


Lys 


Ala 


He 


Ala 


Ser 


Gly 


Met 


Trp 


Thr 


Val 




290 










o c 










300 










Leu 


Glu 


Thr 


He 


Leu 


Leu 


Gly 


He 


Val 


Leu 


Leu 


Tyr 


Ala 


Ser 


Val 


Ala 


305 










310 










315 










320 


Val 


His 


Phe 


Phe 


Pro 


Ala 


Ser 


Thr 


Glu 


Arg 


Cys 


Leu 


Leu 


Glu 


Pro 


Trp 










325 










330 










335 




Leu 


Arg 


Glu 


Leu 


Gly 


Phe 


lie 


Thr 


Cys 


Tyr 


Gly Ala 


lie 


He 


Leu 


Lys 








340 










345 










350 






Leu 


Tyr 


Arg 


His 


Leu 


Val 


Asp 


Phe 


Arg 


Thr 


Arg 


Lys 


Ala 


His 


Arg 


Trp 






355 










360 










365 








Val 


Leu 


Arg 


Asp 


Val 


Asp 


Leu 


Leu 


Lys 


Tyr 


Leu 


Gly 


Thr 


Met 


Val 


Phe 




370 










375 










380 










Ala 


Val 


He 


Cys 


Tyr 


Met 


Ala 


Ala 


Phe 


Thr 


Ala 


Ser 


Ser 


Leu 


Asp 


Leu 


385 










390 










395 










4 00 


Leu 


Glu 


Ser 


Ala 


Gin 


Leu 


Glu 


Ser 


Leu 


Arg 


Glu 


Ala 


Asp 


Thr 


Asn 


Thr 










405 










410 










4 Id 




Cys 


His 


Pro 


Leu 


Lys 


Trp 


GlU 


Leu 


Val 


Thr 


Gin 


Thr 


Ser 


Glu 


Met 


Leu 








420 










4 25 










4 30 






lie 


Leu 


Cys 


Phe 


Gly 


Leu 


His 


Leu 


Ser 


He 


Ala 


Ser 


Arg 


Asn 


A 1 -i 

Ala 


Asn 






435 










4 4 0 










4 4 5 








Thr 


Gin 


Phe 


Arg 


Glu 


Arg 


Gin 


Phe 


Leu 


Val 


Thr 


Ala 


Leu 


Thr 


Leu 


Glu 




/ten 
4 DU 










/ICC 
H D D 










460 










Phe 


Leu 


Val 


Ser 


Ser 


Ser 


Phe 


Tyr 


Phe 


Leu 


Arg 


Phe 


Val 


Tyr 


Leu 


Pro 


465 










470 










475 










480 


Glu 


Met 


Ser 


Pro 


Ser 


Ala 


He 


Leu 


Leu 


Ala 


Leu 


Phe 


He 


Arg 


Ser 


Gin 










485 










4 90 










495 




Leu 


Thr 


Asn 


Ser 


Phe 


Ala 


Leu 


Gly 


Leu 


He 


Phe 


Val 


Pro 


Lys 


Leu 


Trp 








500 










505 










510 






Tyr 


Gin 


His 


Lys 


Gin 


Gly 


Thr 


Ser 


His 


Asp 


Ala 


Gly 


Gin 


Arg 


Leu 


Gly 
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515 










520 








525 








Gly 


Gly 
530 


Tyr 


Ala 


Gly 


Leu 


Cys 
535 


Leu 


Gly Asp 


Pro 


Asp 
540 


He 


Gly 


Glu 


Leu 


Thr 


lie 


Ser 


Glu 


Met 


Ser 


Pro 


Glu 


Asp He 


Arg 


Ala 


Glu 


Leu 


Lys 


Arg 


545 










550 








555 










560 


Leu 


Tyr 


Thr 


Gin 


Leu 


Glu 


He 


Met 


Lys Asn 


Lys 


Thr 


Leu 


Arg 


Gin 


Asp 








565 








570 










575 




Asn 


Pro 


His 


He 


Ser 


Lys 


Arg 


Arg 


Gly Gly Arg 


Lys 


Ala 


Gly 


His 


Arg 








580 










585 








590 






Arg 


Phe 


Ser 


Leu 


Gin 


Lys 


Lys 


Gly 


Ser Lys 


Asp 


Lys 


Ala 


Leu 


Ser 


Ala 




595 










600 








605 








Lys 


His 


Arg 


Ser 


Asn 


Lys 


His 


His 


Gin Asp 


He 


Glu 


He 


Thr 


Glu 


Ala 


610 










615 








620 










Glu 


Pro 


Ser 


Arg 


Thr 


Pro 


Glu 


Asp 


Ser Val 


Cys 


Ser 


Ala 


Glu 


Gly 


Pro 


625 










630 








635 










640 


Thr 


Asp 


Thr 


Tyr 


Ala 


Glu 


He 


Ser 


Gly Val 


Ser 


His 


Ser 


Met 


Leu 


Ser 








645 








650 










655 




His 


Ser 


Met 


Val 
660 


Ser 


His 


Ser 


Val 


Val Ser 
665 


His 


Ser 


Lys 









<210> 184 
<211> 4243 
<212> DNA 
<213> Drosophila 



<400> 184 

ctcgtgacct 

cacacgtaat 

ttggttttga 

gtttagttga 

gcgattgtcg 

ttgtatctct 

ttattcgcag 

ttagttttag 

gttgcagtga 

gctttacggc 

atttatcgaa 

tacaacaaga 

gatctgtgct 

acaaaaatga 

acattacgag 

cactttaatt 

atccttacac 

tcgaggcggc 

tctggatctc 

catccttgcc 

attgtccgcc 

tcagcagcgc 

agggcgttgt 

gcaggctgga 

agaacaggca 

agatcagcgg 

tgtccggcac 

ataattatta 

cacaggtaat 

aataatacaa 

ctcataaacg 

tttgtctgtg 

gggccagggc 



ttgaacccga 
ttggttagtc 
ccaatcgcca 
ttatacgatt 
ttgaaataac 
ataaatttat 
gacttctcac 
ctttagctcg 
ttgccatttt 
ttacagattt 
atgagtttac 
tataaacgac 
ccaaggaacc 
aaaaatgtga 
taaatcagcc 
gcattaaatt 
cagtcgcacc 
ggcattgttg 
ctgcacatcg 
cgtatccttt 
caggcgggat 
ctcatccctg 
cgagttaagg 
gcgccggatg 
gagcagcgtg 
gtttagcgtt 
ttggcactcc 
gcttacactc 
gcaattttat 
aagcaaatat 
atctatattt 
tagcacataa 
tttgcataaa 



cgctggcact 
tgatatcgat 
gggtcctaca 
taggggttta 
aattgggttt 
caatattctc 
gtagttgtct 
tcgattttgc 
gaataattta 
tcgattggcg 
aacctgtgct 
ataaaatcat 
cggcaaactc 
agaaaatgaa 
gacgcatttg 
gattccaccc 
atttgcacca 
ttcttgttcc 
tcgcccgcca 
gcatccttcg 
ccgcccgccg 
tattataagt 
tagctctccg 
tcgatgtcca 
ttgcggaacg 
gagttgaagt 
tcgcaggtca 
attatcttta 
gtactttttt 
aaaagatcaa 
gaatactttg 
agatgagata 
tgagattata 



ttgaaacgga 
gcactatatg 
tatatggttt 
ctttaggtat 
tagctttgct 
cgattctaat 
ataatttgcg 
gaatccctat 
tttgtggtta 
tacgtacata 
taatttagtc 
aaataaataa 
gaattgagat 
atggcacgct 
ccggcatatt 
atccggctgg 
tttgcaccat 
atttgttgat 
agcaatcgac 
actccctgcc 
ccagccgctg 
tgtccaccat 
agtagacgct 
gaggcaggtg 
cctcccggaa 
acccgatcca 
tggagagtgt 
acttaattta 
tttcccggca 
atgaatattt 
atcaccattt 
tatataccca 
ctctgggtgg 



aacggaaaca 
ttggctaatt 
tgactatata 
tttgagcatt 
ttagctgccg 
tgcgttgggg 
gtctcttaat 
gtacacgcat 
tttcttttcg 
catattacat 
gtacaaaaat 
gttatcctat 
taaatgtttt 
ccacagattg 
aattttacaa 
catcgattga 
ttacacatga 
gtactccatg 
attggatttg 
atccttggcg 
ctgctgctgg 
ctgagactgg 
cttcgcccgc 
gcgatccttc 
gtcgcggttg 
gaagaggatt 
gtacctaaaa 
agtcaatttt 
ttgcataata 
ttgtttcata 
gaaatctttc 
taaattccag 
tgtgaaaaac 



cacactcaca 
tagacggact 
ttctcgctat 
ttcaattcat 
tgacttttgt 
gtgtggttgt 
tggcacagtt 
agctcagtga 
tgagcaaaat 
attcatatac 
actcagtatt 
acaaagtccc 
aaaaacgaaa 
aataatgtca 
attaatggaa 
ttcgattcgc 
ctcgaggccg 
ggtatttcaa 
cccttgcccg 
gcagctgcca 
gcgatggggg 
cggcgatgcg 
tggtcgtagc 
caccaattgc 
aagtacgcgt 
gagacgacta 
ttaaaacaat 
aaattagttg 
tctatttttt 
aaagttaaac 
tgcatcactt 
atgcatgagt 
aacaaccgcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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gaaacaatgg 
aattatgcct 
ccttcgacgg 
gcccatgatg 
cttgtcctta 
cccggatccg 
caggctggag 
tgtagaacaa 
gactttctgt 
atatattcaa 
aactttgact 
gccgtgccgc 
ggtgaatatc 
gtacttgttc 
cggcgtggtg 
tgtgtttagc 
cttaacaata 
agctatagtt 
aaatactcgc 
tgaagttcca 
taacgaggca 
catgaatttg 
gccacgcgaa 
tgcaaaaaat 
tctgattaaa 
taaaatacag 
atttaaacgg 
tagacacact 
gatttcatct 
caaattaaaa 
aagtggcaag 
gtcgtcgctg 
catttcggtg 
cgggcgcccc 
gctcgctggt 
ttgacactta 
gaaggtaaaa 
cgatgccaat 



cgcacgggcc 
ggaatcccat 
ctggttggta 
atgcccagcg 
gtgggggtgt 
ctcagcgcct 
atgggccact 
actagtgatc 
cctggtaaat 
acaaacattt 
ccaaactcac 
atcatcagct 
ataatggtgc 
accacgaacg 
taccagccga 
agcataatgc 
gcatagtatc 
ctggctgtgg 
tgttgagaag 
cctgaaaggg 
ataaactttg 
cagtatatat 
atacttttat 
ttagtttttc 
atcatttgat 
atatataggc 
cattttactg 
ctctttttcg 
tagtttcggt 
gcttagatgc 
gaaagcggag 
ttcgcctgtc 
aaggcggata 
gtttggcatc 
ttttggtgct 
agccatgaaa 
tgtctaatga 
ttatatgcaa 



gtatgagtaa 
gtcagcggga 
ctcaccacag 
tgcgtgcggc 
gctcctgctc 
cgcctgccgt 
gaactcaatg 
tttcacgctc 
ttatattttt 
ctttacagcc 
ctgatggctg 
gcttctcctg 
agggtatcca 
agcactgcgt 
tgaagatggg 
cgaccacgcg 
tgtgaagtgg 
catgcaactt 
tagacatcga 
tccaatgaat 
gcctttactt 
ttttgttgac 
gaagtcatcg 
acgattaatt 
tgtgggtaaa 
cataaaaact 
ccaattcctg 
cctattttcg 
tgccaggtga 
actgtgagac 
gcgactctgt 
gcttctgccg 
cccggccttt 
agtttcaaaa 
ttacactaca 
tggcagttcc 
atgccaagtg 
ctcgcgcttg 



tatttaatta 
ttggcctcca 
gaagaaaggc 
cttgtgctcc 
gacctcgtgc 
aggagataag 
cactcgcact 
actacctgtt 
gaagtattat 
aaactaggta 
catggatggt 
ccgattggct 
gaacgatatg 
cggattctgg 
caagaaggag 
tttcgtcatg 
gcaaaacgcc 
acctatccac 
ggctgttcca 
ggaggcatta 
tctaacgtca 
tcatttctgg 
ctttaaaact 
ccctttgtga 
attgcgctag 
cactagatat 
aatgtaggtc 
ttatcaggcc 
cattgcccct 
cagaaaatag 
tcgctgcatg 
tcgcccttgg 
ttggggccag 
acttttcgac 
ttgccatttg 
tcgtgctgca 
gcggcgagtt 
gccaaagttt 



tgcatgcttg 
actctcaacc 
agccagcaga 
cgcttcattt 
agcgtcaatg 
atggagattt 
gcatcactgt 
tgctattcag 
ttccacttaa 
agccttgttg 
cggtgcatca 
tcccggaaga 
gagctcgaga 
atgacgaact 
agcagtgccg 
ctaatcggat 
aggttattgg 
agatatgcag 
caaatcgcac 
gtatcgcgat 
ttgtatctga 
ttttttgcct 
gatttcctgg 
caacacagtt 
ttctaatatt 
gcgcaataag 
tataaaaaac 
gcctgattta 
gaactgccat 
gccataaggc 
aaaaactcat 
ttgccactcc 
gagccagtgg 
tatccaacga 
gggcgggcgc 
attatgcaac 
ggctgttgcc 
ttc 



acattagata 
aaacgttcaa 
ggatgaaggt 
tgattaagtg 
ttttcgacga 
gataaaagtc 
ccaggatgct 
aaagtaataa 
aacccattat 
tttacaaccc 
gcatggcatt 
tggccaggta 
tgacggcgta 
gctggtgctg 
gcgatatcca 
acttgagggg 
gccctcatat 
cataaatgca 
aggaaggggc 
gcttacccaa 
ttatcgagtc 
cacctaaaat 
tttttagctg 
atttacgtac 
tactaccatt 
gttgctctaa 
tcaaactgac 
tgtgtttatc 
tctggagggc 
ccgaggtgaa 
taaaacttgc 
gcattttggc 
aatttattat 
ggctgttttt 
gccatcggtt 
tggcaactgt 
agaaccaaac 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4243 



<210> 185 
<211> 1224 
<212> DNA 
<213> Drosophila 



<400> 185 

atggactcga 

gatttgtgga 

atatctgtgg 

aaacgcgtgg 

ttgcccatct 

ccgacgcagt 

ttctggatac 

cggcaggaga 

atgcagccat 

gaggtcgagc 

cacaaggccg 

ttcttcctgt 

gtctcaatcc 

tacttcaacc 

tggtggaagg 



taatcagata 
acagcctcga 
atagatacta 
tcggcattat 
tcatcggctg 
gctcgttcgt 
cctgcaccat 
agcagctgat 
caggcgaggc 
aggagcacac 
cacgcacgct 
ggtacacact 
tcttctggat 
gcgacttccg 
atcgccacct 



caatgacgtt 
tgtctacttc 
tgctattgtt 
gctgctaaac 
gtacaccacg 
ggtgaacaag 
tatgatattc 
gatgcggcac 
gctgagcgga 
ccccactaag 
gggcatcatc 
ctccatgacc 
cgggtacttc 
ggaggcgttc 
gcctctggac 



agaaagtgga 
tcaacagcga 
aagcccctca 
acatggatat 
ccgcagcacc 
tactacgccg 
acctacctgg 
ggcaatgcca 
tccgggtcgt 
gacaagcact 
atgggcacct 
tgcgaggagt 
aactcaacgc 
cgcaacacgc 
atcgacatcc 



acttcagccc 
gtattttgca 
agtatccgat 
cgccggcact 
agcagttcgt 
tcatctcgag 
ccatcttccg 
tgctgatgca 
cgaaaacatt 
taatcaaaat 
tcatcctctg 
gccaagtgcc 
taaacccgct 
tgctctgcct 
ggcgctccag 



cttcctgtgc 
tttatgctgc 
tagcatgacg 
gctctccttc 
catccagaat 
ctccatatcg 
ggaagccaat 
ccgaccatcc 
gacgctgcac 
gaagcgggag 
ctggctgcct 
ggacatagtc 
gatctacgcg 
gttctgcaat 
cctgcgctac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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gaccagcggg cgaagagcgt ctactcggag agctacctta actcgacaac gccctcgcat 960 

cgccgccagt ctcagatgca gcagcagcgg ctggcggcgg gcggatcccg cctgggcgga 1020 

caattggcag ctgccgccaa ggatggcagg gagtcgaagg atgcaaagga tacgggcaag 1080 

gatgcgggca agggcaaatc caatgtcgat tgcttggcgg gcgacgatgt gcaggagatc 1140 

cagattgaaa tacccatgga gtacatcaac aaatggaaca agaacaacaa tgccgccgcc 1200 

tcgacggcct cgagtcatgt gtaa 1224 



<210> 186 
<211> 407 
<212> PRT 
<213> Drosophila 



<400> 186 



Met 


Asp 


Ser 


He 


He 


Arg 


Tyr 


Asn 


Asp 


Val 


Arg 


Lys 


Trp 


Asn 


Phe 


Ser 


1 








5 










10 










15 




Pro 


Phe 


Leu 


Cys 


Asp 


Leu 


Trp 


Asn 


Ser 


Leu 


Asp 


Val 


Tyr 


Phe 


Ser 


Thr 








20 










25 










30 






Ala 


Ser 


He 


Leu 


His 


Leu 


Cys 


Cys 


He 


Ser 


Val 


Asp 


Arg 


Tyr 


Tyr 


Ala 






35 










40 










45 








He 


Val 


Lys 


Pro 


Leu 


Lys 


Tyr 


Pro 


He 


Ser 


Met 


Thr 


Lys 


Arg 


Val 


Val 




50 










55 










60 










Gly 


He 


Met 


Leu 


Leu 


Asn 


Thr 


Trp 


He 


Ser 


Pro 


Ala 


Leu 


Leu 


Ser 


Phe 


65 










70 










75 










80 


Leu 


Pro 


He 


Phe 


He 


Gly 


Trp 


Tyr 


Thr 


Thr 


Pro 


Gin 


His 


Gin 


Gin 


Phe 










85 










90 










95 




Val 


He 


Gin 


Asn 


Pro 


Thr 


Gin 


Cys 


Ser 


Phe 


Val 


Val 


Asn 


Lys 


Tyr 


Tyr 








100 










105 










110 






Ala 


Val 


He 


Ser 


Ser 


Ser 


He 


Ser 


Phe 


Trp 


He 


Pro 


Cys 


Thr 


He 


Met 






115 










120 










125 








He 


Phe 


Thr 


Tyr 


Leu 


Ala 


He 


Phe 


Arg 


Glu 


Ala 


Asn 


Arg 


Gin 


Glu 


Lys 




130 










135 










140 










Gin 


Leu 


Met 


Met 


Arg 


His 


Gly 


Asn 


Ala 


Met 


Leu 


Met 


His 


Arg 


Pro 


Ser 


145 










150 










155 










160 


Met 


Gin 


Pro 


Ser 


Gly 


Glu 


Ala 


Leu 


Ser 


Gly 


Ser 


Gly 


Ser 


Ser 


Lys 


Thr 










165 










170 










175 




Leu 


Thr 


Leu 


His 


Glu 


Val 


Glu 


Gin 


Glu 


His 


Thr 


Pro 


Thr 


Lys 


Asp 


Lys 








180 










185 










190 






His 


Leu 


He 


Lys 


Met 


Lys 


Arg 


Glu 


His 


Lys 


Ala 


Ala 


Arg 


Thr 


Leu 


Gly 






195 










200 










205 








He 


He 


Met 


Gly 


Thr 


Phe 


He 


Leu 


Cys 


Trp 


Leu 


Pro 


Phe 


Phe 


Leu 


Trp 




210 










215 










220 










Tyr 


Thr 


Leu 


Ser 


Met 


Thr 


Cys 


Glu 


Glu 


Cys 


Gin 


Val 


Pro 


Asp 


He 


Val 


225 










230 










235 










240 


Val 


Ser 


He 


Leu 


Phe 


Trp 


He 


Gly 


Tyr 


Phe 


Asn 


Ser 


Thr 


Leu 


Asn 


Pro 










245 










250 










255 




Leu 


He 


Tyr 


Ala 


Tyr 


Phe 


Asn 


Arg 


Asp 


Phe 


Arg 


Glu 


Ala 


Phe 


Arg 


Asn 








260 










265 










270 






Thr 


Leu 


Leu 


Cys 


Leu 


Phe 


Cys 


Asn 


Trp 


Trp 


Lys 


Asp 


Arg 


His 


Leu 


Pro 






275 










280 










285 








Leu 


Asp 


He 


Asp 


He 


Arg 


Arg 


Ser 


Ser 


Leu 


Arg 


Tyr 


Asp 


Gin 


Arg 


Ala 




290 










295 










300 










Lys 


Ser 


Val 


Tyr 


Ser 


Glu 


Ser 


Tyr 


Leu 


Asn 


Ser 


Thr 


Thr 


Pro 


Ser 


His 


305 










310 










315 










320 


Arg 


Arg 


Gin 


Ser 


Gin 


Met 


Gin 


Gin 


Gin 


Arg 


Leu 


Ala 


Ala 


Gly 


Gly 


Ser 








325 










330 










335 




Arg 


Leu 


Gly 


Gly 


Gin 


Leu 


Ala 


Ala 


Ala 


Ala 


Lys 


Asp 


Gly 


Arg 


Glu 


Ser 








340 










345 










350 






Lys 


Asp 


Ala 


Lys 


Asp 


Thr 


Gly 


Lys 


Asp 


Ala 


Gly 


Lys 


Gly 


Lys 


Ser 


Asn 






355 










360 










365 








Val 


Asp 


Cys 


Leu 


Ala 


Gly 


Asp 


Asp 


Val 


Gin 


Glu 


He 


Gin 


He 


Glu 


He 
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370 375 380 

Pro Met Glu Tyr lie Asn Lys Trp Asn Lys Asn Asn Asn Ala Ala Ala 
385 390 395 400 

Ser Thr Ala Ser Ser His Val 
405 



<210> 187 
<211> 11847 
<212> DNA 
<213> Drosophila 



<400> 187 

gtgaatcgtt 

tcgcagttca 

tcagctacac 

ttgtaatttt 

gtatgatact 

agttctatct 

ttccgctcag 

ggatcgtatt 

tggtgtacat 

ctacgcaaag 

tacgtatatc 

ggcagcattg 

ccctatcgtt 

agcttcaagg 

ttgttacttc 

cgattcactc 

acgaagcagg 

acggatttcg 

gaaaatcgcg 

tccagaataa 

tgatgacata 

gaataagaag 

cgtcactcac 

cagggtgcgg 

tgtggggcag 

aattcgaatc 

aatcgacagt 

tttttaagta 

tgttttatgg 

aggtgtttgc 

attgaacttc 

ctaaactaat 

gtttatgttc 

gtgtggtatc 

tatcgatggc 

gtcattaaag 

tgtgcgggaa 

ttatcacaaa 

aaacataaac 

caggaagcaa 

ccccatccag 

ggtgaaatca 

gtgcaaacgg 

ggattcggag 

ccggattcag 

ggcacttctg 

ttcggtttca 



tttgcagcga 
ttctttttac 
aattctcttc 
caataactct 
tagagctaga 
gtaacaacat 
gatttgcaat 
aatactccaa 
aattcttaga 
agtcgctcga 
ttctgtttag 
tccctgcagc 
gcttaaacga 
attatgctag 
tcttcacttc 
aagtattaaa 
aatgtggccg 
tttctccaga 
attaaagtag 
caccaccacg 
cctggaagga 
gccaacaatg 
cacagaaaga 
gcggccttgt 
atagagagtt 
gattcgtttc 
tttctggttc 
gtctataata 
tttattgctt 
ttaagcttta 
aatcaattaa 
cttaaaaatt 
gttttattgt 
cttgtttaac 
atttcaaacc 
tgctggaaag 
aagcaataaa 
gacatacaat 
atcactggcg 
cggaccatcg 
tttgccacct 
taatgcaaag 
aagcaattat 
gcagattagg 
ttgcctaagc 
tgggtatttg 
tttcacggtt 



atggcgaaat 
tcattggtaa 
cattcaatat 
gttatgtaga 
ggtaacatta 
taaaaagtgt 
cctaatatac 
aaatcgatgg 
tgattcattt 
tttctaattg 
ttggacaaac 
cgcggcttgg 
aattgtacac 
ctatatcctg 
agctggtgtt 
tatcacttat 
ggatgcagac 
cagggcagca 
gcgtatataa 
agcacatcgg 
ggagtgagaa 
gattacggat 
agggcagcca 
gttcggcctt 
tagaagggat 
gatccgccag 
atttctgtct 
tgcttaatgt 
ctaggatgaa 
atcggcgata 
aatacagaaa 
attaaatatt 
gagagaaaga 
tttgacatta 
ccacaccgcc 
gccaacatgt 
aacgaggcaa 
cccccgcatc 
gccaggagga 
cccttgttca 
ggatcccttg 
gctgtgtgga 
gtacagcatg 
tggaggtgga 
tggttagctt 
ccactaaagc 
gatttaatga 



ttcctacagc 
gattgtcttt 
tttgtccata 
tttggtttgc 
tgggattgca 
gtttaatatg 
gatgatgaca 
aaaaagtgtt 
gacaacattt 
gctttcgctc 
aacacttata 
cgggcttaga 
gtatacataa 
gcacaaatgc 
ttccctcctt 
aggggcattg 
gctatccggt 
cacactccag 
gcggatttag 
gacatgggca 
tcacatgaat 
ggcggtacca 
gcagagcaga 
ccatcctttg 
atgaaagctg 
tctgtttttt 
gtctgcgaaa 
ggtatttgtt 
attgtttata 
cccttttatg 
atttcaaaca 
ttctctgtgt 
ttatgccggc 
gtaatacttt 
ccgctccctg 
gtgcagtatc 
acaattgaaa 
catccagtcc 
aaaacaaatc 
cccattgccc 
atggacgcca 
gggaggcgta 
agagcagcga 
gacagcatga 
ggcatccgga 
tgggggctac 
tacttaggtt 



ttgccaattg 
ctgcatgaaa 
cataatttcg 
tagttttcta 
ctttttccat 
aaataaaagc 
catgtgaata 
tcgccttcga 
cacttgaagt 
tgccatatat 
tatggatagt 
acacaacttt 
acacacatcg 
cactggtatg 
gggcacatat 
ttccgcatcc 
ctagacgtag 
cgtattgcgg 
cgtggagttg 
ggccggaccg 
atgagtcacc 
catccgtacg 
aagacgccca 
gccggtcgca 
gcaattggct 
ttcttttgtt 
gcactcgaaa 
gtctacctgt 
cagctaatac 
ttaactatgt 
aacaaatttt 
gtaaattgcc 
atgttgcgta 
tgctgttata 
cgtccacctt 
ttggtgctcg 
atatttacac 
agtgcgcccg 
agacagacaa 
atcgcccatt 
ctcggtgatg 
ataaagcaaa 
aaggtgaagg 
ggtttcgatg 
taaggaggtg 
aaaaatttga 
aaccaccgtt 



ttttggttgt 
attcataacg 
attgcgtaat 
ggttacatat 
gatttaggct 
agggagttca 
atccaatttt 
caagccgaag 
tggaactttt 
gtatagttat 
atagttccgg 
aatttaaatt 
agtattgctt 
gccgttgctc 
acatgtgcga 
acgagggtgg 
taggcgttgt 
aatgcctcgc 
aagtaaccga 
cagagcgatg 
agaaaaagaa 
gcctatctac 
tgatgatgcc 
cggaaccgcc 
gattaagttc 
gggtcatagc 
aatgggtttc 
tggatgtgtt 
tggaaagtat 
atgtaaatta 
aaagaggtaa 
aagaaatgcg 
tacgccatgt 
tgttgttggc 
gcaagtgact 
ggcgtgtggg 
aggacacttg 
ccccaaacat 
cagtgagcaa 
gcccatgttc 
gccacagcca 
gcaaacgcac 
attcggatac 
gtgtaacact 
ccaaaaaact 
ggtagttctt 
gaaatgattg 



60 
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240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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ggattgcaga 
tttcaggtgt 
agtaatattg 
tatgtttggt 
attttctgat 
ttagtgaatt 
aatcgaatga 
cagaaagaca 
aggagaggag 
ttgaagcggc 
ttgtgccaga 
tgctcgtgaa 
ttggcggttg 
agcggacact 
tcaatgatct 
gaggatttat 
tcgattgtca 
tagaagtagt 
catacaacac 
agtaaacaat 
catgcagcca 
tcattatcgt 
gtgcaactaa 
gtttcattcg 
ttaaaacact 
aattcacagt 
attttaagat 
attactcctc 
acacttgaac 
aaattgtctt 
aatggatgac 
actgtcgctg 
ctgtctcttc 
gcggccaccc 
atcgccggcg 
gcatccctca 
ggactgtgag 
caccacatcc 
cggcggcggc 
agagtccatg 
ggacatcaga 
agccggtcgg 
ggcaatgagc 
ggccggctgc 
ccgaatcctc 
gaactgctgg 
gtccggcttg 
gcgttccttg 
ccggaagaag 
ctcggtgctc 
cccgggcact 
atcgctggag 
gtccgtttca 
cgtctggcgc 
ctcggccagc 
accaccaaag 
ggtcaggctg 
cgaagggatc 
caactgattg 
gtgggtgacc 



aattcaggga 
attcaaaagc 
attgtttttt 
ctaatgcagt 
gattatatat 
ctgagtgtga 
ttcagatgga 
gagatagaga 
gggagatacg 
gcttccggtt 
atcgagtgga 
tactacgttt 
cggttgcgtt 
tgggcagcgg 
ccgcctcctc 
tggaaaattc 
atttggccat 
ttgtttgttt 
acaacgaaat 
gtacacaggg 
tgcaccggca 
tggctccctg 
ctaaatgcta 
aagcatttga 
gtatgatttt 
tttcttcaag 
caaaattcta 
aaaaaaacaa 
ttttcgtgga 
aaaagagcta 
tactcacttg 
ttctcctgcg 
cgcacagttt 
actcctccaa 
ctatctttgc 
accggcgaca 
gccatttggg 
aaggacacag 
gataggaagt 
gcacctccgt 
tagttggcgc 
cccagcggaa 
aggccactcg 
tggggcgact 
tcgggcggtg 
atcacactgg 
ttgggcgtgg 
ccgccaccgc 
ctggatccct 
gagcgtttcg 
ggctgctccg 
gcgaatgcct 
ctgagggcag 
ttaaagtaca 
ggcaggtggg 
acccggtacg 
agctgctgtt 
atcttgccct 
tgggagggct 
ttctcgtacc 



tatctattct 
tttcaatttt 
gactgacagg 
ggcataaaat 
gtattgatat 
aatgtattat 
taatgggtag 
gagaagcggg 
acaagactca 
cttgctctgc 
actggacgag 
actactagtg 
tgcggtggtg 
actgggcacc 
ctcatccaca 
gtgctctgct 
ggcatttttg 
ggttggacat 
ggcgtggaag 
tatttcacag 
aacgcaaagg 
tctctggaag 
tacattatat 
gtattacttt 
tcagaaattc 
gtaagaacta 
gtaagaccag 
ggacaaaaga 
aaactgacaa 
cttaaatcaa 
cggaactggt 
agaagttgat 
ggatgccatc 
cgccaccagc 
gcttgaacag 
gcgtggaatc 
gatttggctg 
tggatgtgtc 
acacattgat 
tcggcacgga 
acacctggtc 
agctcagcga 
gttctggcaa 
ccccaaagca 
ttttcagctc 
ctccactctg 
agtccgtatc 
atccgttgta 
cggagttggc 
tcttcgacag 
gcaacgcccg 
tacgcccaca 
cgtactttcc 
gatagttgtc 
tgtcgtacag 
cagtggactc 
gcgggccatt 
ccgtccatat 
gctgttgctg 
gctcctgctc 



agtatgattt 
aaacataata 
atatttcata 
acgaagtgaa 
agagaaacta 
aatataatgc 
caatgagata 
agagaaagag 
ccggttgcac 
tgccaacatg 
gaggcgggcc 
gttactgtgg 
ccggtggtgc 
gcggcgaagg 
aagatgcaga 
tgcagttgat 
gggttttatt 
aatttggtga 
agagatagat 
gatacgaaag 
atcgggatcg 
taggtcgtcc 
tatatatata 
tccaatgagg 
gcaccgcttt 
tatttaaata 
attccttatg 
atgtgcgaaa 
aaacagaacg 
gcacgagcta 
ggaaacgcgg 
gacattcacg 
gtagcctgcc 
ccctccattg 
tggcgacgtg 
gagtatgatc 
cggcactgga 
actgggcgta 
ggacgccgac 
aatggccggc 
actgagcgtg 
gcgaaagatc 
ttctggcggc 
atcggccagc 
actgaagtca 
ctcgctgagc 
gctgtgctgc 
gctgtgcgac 
ctgatgtcgc 
cagggacaga 
cgattcacta 
ggccgctgcg 
tccgcctggg 
ctctatcagg 
gtcgttgccc 
cggttccggt 
gggatgagaa 
ctgcggattg 
cagtgccatc 
caagtccctc 



ttggtaagaa 
ggtttaaaca 
tgaaataaat 
tgggacttga 
taactattgg 
acaagttctg 
ggtgctgaag 
cgaaacgtag 
cacagtgact 
ctgtgaccgc 
tctggccaag 
tgtcgattgt 
tgctgatgtc 
tgatgccgcg 
tgccgccgcg 
tgttgcattt 
ggttggattt 
tttcgattta 
ataaagagat 
gacttcaact 
aaaagggaat 
ctttctgtct 
tagtaagcta 
gtttataggt 
tgaaaggaac 
tggtaaaagt 
ttaaacccga 
aagttaagtc 
aatttctctg 
tacagtctac 
cccatcggga 
tcgtaggtca 
aggatgctgc 
ccggaaacgc 
acatcccggt 
tccggcttgg 
acgggcatag 
taaccaggtg 
tgcgcgttgg 
ggagccggtg 
ctcaggcgct 
tcggccagtt 
gataacgatg 
tgcgccaact 
atcagctgta 
acattgatgt 
cgctggcgtc 
aatttccggt 
ttcttttcca 
atgttgaagc 
atggtgcgca 
cccgccttca 
ccactgataa 
cgcttcagct 
tcgctctcgt 
tcgctgttcc 
tgctccttat 
aacactggcg 
atcgagggag 
agatcgatgc 



cgttaactag 
tatacggaac 
agatttttga 
tttgattaag 
tgttgtgaaa 
gactgtggga 
ctgagacaag 
agactggggg 
gcatccggtt 
cgccgttcgc 
atctggagtt 
gctgtcggcg 
ggcgccgcag 
gggccgctgc 
tcggatcagc 
tcgtacgatt 
ggttttggag 
catatataca 
agagacaagt 
tctgagatca 
ggattcagat 
tcgtttgcag 
gttaactaaa 
tctatacatt 
ttaaatacac 
tcgtatcttt 
attcaggcca 
aaacattgaa 
gggcagatat 
acaatgacca 
tgtagctgcg 
ccacggacgc 
agcggccttg 
tcacgtccgc 
gctcatcgcc 
gcgatatgtt 
gcaggctggt 
cggcggcatg 
aagaggcgga 
tgttcggtgg 
gtggagcjacc 
ccgagctgct 
gtggcggcgt 
cgtcgtcgct 
ccccgtcgct 
ccagcaggat 
tctcaagttt 
tccgggagcg 
gcgtaaagca 
ccttctcctt 
ggaactccga 
gcctgatgaa 
tggtgccgcc 
ccttgttgtc 
cgctgttgag 
cactgccgct 
cgagctcctt 
ccggtggctg 
cactgtccgt 
tgctgcggct 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
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gaggcggcgt 
ctcgtcggtg 
catggtggcg 
gcttccctct 
ctttagagca 
tttgtatatt 
ataaaaaata 
cgagtactaa 
ttttaggcat 
tgtttattga 
tttcaataaa 
acctccaagt 
cagtcactgg 
ctgttaagga 
ctggaaaacc 
catcgaccat 
atcacaatgc 
accacaaacg 
taccagccca 
agcatgaagc 
gcgtagtatc 
aggtcctccg 
cttttggctt 
caggttgcca 
gaaggataaa 
tctttttaat 
tatattgtac 
tttggaacat 
tctaattttg 
tcgaaaggtg 
ccatcgacca 
caagtgggga 
tggaatatgg 
cgcattcatc 
ttaattgttt 
gcatctgggc 
ccagcacaat 
cacctgtcca 
aggctgttgt 
acggaggcgt 
acaacaaagt 
tggcgtggtt 
gcaaaaccac 
ctaagcgctt 
tgttccggtg 
tgtcataagt 
tgaagttatc 
aaagcacttt 
gtgtagtttt 
attgtagtaa 
taatttcgaa 
taatatctca 
tgagcacttt 
gcagaaggca 
tggccggggg 
actctggggc 
gtgtgccgca 
catttatcgc 
attgtcctgc 
caaaagtaat 



ggcggcggcg 
gccggctgca 
catcaattac 
agcacctaat 
ggatgtcaat 
tatttaggat 
aataatgttt 
agcatttatc 
ctttggatta 
atgatttatt 
tgacatctgt 
tactaagcgc 
gatgcagata 
ggatgttgga 
acagagcaca 
cgaccatccg 
cgggtatcca 
agcactggtc 
gaaagatggg 
agaccgtttt 
tgttaaaaag 
cacaaaggtg 
actaattgat 
tgcacagaaa 
tgctcttaac 
tatcaatata 
ctcattattt 
tacagattta 
ataggttctg 
agtgaaacac 
tccctccatt 
atggcgccaa 
gtggaatatg 
atacgggcgg 
cttcttggca 
aagattaatg 
tgactttaaa 
ctgatatgca 
acacgttgca 
tgaatgtcat 
aattggttat 
ctgggcggga 
gccaaacgta 
tggttggctc 
ttcaggagtt 
cactcgactc 
atcttgtctt 
tgtgcttcca 
acaggggctg 
tgaagctcaa 
agatcctttt 
acgcaattct 
aagctccttt 
taaacgttgc 
cgctggcggt 
gtcctgtcat 
tttaacttcg 
gatggcgaaa 
gacccttgcg 
tggatataga 



atggcactct 
gcatgtcctg 
gcacagccac 
ctaatcgctc 
gttgcccgtt 
actatctgga 
tattcaacta 
ggtttaataa 
atgtgtattt 
gaatttatat 
atcttttcgg 
actgtgcgtc 
gctcaggctg 
gctggtgcgg 
gatcagcaat 
ggtaagaact 
gaagctcacc 
cggatgcagg 
cgtgaaggat 
gtgtgtcata 
aagagggggt 
ccaggtggct 
tatcgctctt 
tcgatttatt 
cgaattcatt 
ttaataaatg 
tatcgaaact 
ttagtttagc 
gttgaatttc 
aaattaatta 
gttgctggtc 
ttaacggctt 
ggcaacgccc 
gcctaatgtt 
attcgccggg 
gtatttggcc 
ccgcttaata 
gcacaggtgc 
catgaacggt 
cgcacagagg 
caccctgcga 
tttgaccagc 
atggcatatg 
cttccgttgg 
tggaggcaat 
accaggcagc 
cggggcgtat 
agtcatttat 
acaacattgt 
ttaaatcaag 
atgtcaagcc 
atatggcgaa 
agcggtataa 
gcaacattaa 
ggcagagcgc 
cgcgacgctg 
ttttgttctt 
aggattcgcc 
gcaatgaccc 
cttaccgcag 



cagttcatcg 
tgggcaaagg 
aagggaaaag 
cacatccgtt 
tgtatacatg 
tgtatcagca 
ctaagagatc 
aaataaaaca 
tttcattttg 
caataaaatg 
ttcagtaaca 
tcctcccggg 
gtgggctgct 
ctgagggcct 
taagcaaact 
cacttaaaga 
gaactggaga 
gaaatctccc 
atgagggcgg 
ttcaatggat 
catctttatt 
ttcagccatt 
tggttggctg 
aaccttttaa 
atcggaatcc 
caattgaaat 
taatgctcct 
caaatttccg 
cctttgtttc 
aaacagacaa 
agcccgaaaa 
gacaggatgg 
acgtttcggc 
tgcttttgat 
ctttggtatt 
aaactaaacg 
tatagataga 
aatatgctgg 
ccgaacatcc 
gccaccagca 
acgaaacaaa 
ttctccccac 
gcatgcatca 
ccaagcagtg 
aggtcaccgg 
tgccaccttc 
gtggtttaag 
tacatattgc 
ctaaaacatg 
attattaatc 
gttttttctt 
cgttctatcg 
taaagacttt 
acggatgtcg 
tcaatggtat 
acatggcagc 
tattattcca 
ctgccgcgtc 
tcaattaaac 
ctttcgattg 



cactcgtccc 
acaatcactc 
gaaaaactct 
cattgaggat 
aactgcattg 
actttaccaa 
ttcagaatgg 
tttttaagta 
aacatctttg 
cagcaaaata 
tttagtaacc 
cggatgtggc 
gggtgtggcc 
tgcgttgacg 
catccactac 
tgcgccagta 
tgagggcgta 
gcaggtgctc 
gcaggatcca 
actccagtgg 
aatggaaatg 
tcctccttct 
aataattgtg 
agatcatgtt 
gtgaaaataa 
gaaataatta 
tttaaagagt 
ttagtggcca 
ttggtatgtt 
aaggcccatg 
tcgcagctgt 
agaaggttgt 
cagctgcaaa 
gtgcgcgcct 
gaaagtgatt 
gctttcttca 
gagcgacata 
ccgtggaaaa 
actttccgcc 
tgtcggccat 
agcccaggaa 
atctaaccac 
tatgtgcaca 
caccactgct 
tatgcagcaa 
agctgcaagc 
gagctgttta 
gacatctatt 
ttattaacgt 
tgtgcattta 
ttatgcactg 
taaggtagta 
aatcaacatc 
tgggtgtacg 
gctacactta 
gatgggcttg 
cttgtcgctc 
actgtttgtt 
tcagttacag 
cgctgcactg 



tatcgtcgtc 
aaagggaagt 
caagcggagt 
gggcaaaaag 
ctttaaagtc 
attttactca 
caagcaaatg 
tctctaggta 
aaatgtttat 
ctcttttggc 
ttccaaactc 
attcagatca 
catgtggacg 
aatcgcctcc 
gaccagcgac 
catcaccagc 
ggccttgttg 
ctccgtcgtg 
cacattggcg 
acgcacaatg 
aattgcggac 
cctcctcctc 
tttcggtact 
gcatttaaat 
ggtggtgtat 
ttaatccacg 
aatgattatg 
tcattaattc 
ttgtggcccg 
actccagtgt 
tggtgccttc 
gggagttggg 
tcatttgggc 
ttcatagtaa 
tggttgccag 
gtaccatagc 
aatgggcact 
gtaaacatcc 
ggacagttcc 
cgccagcgac 
tttgtgaata 
ttgtcacagc 
tgccgctgta 
ttccatttcc 
tttccttggc 
tactggcttg 
tgcgaagatg 
acgttggaaa 
atattgtaat 
gaagggctgt 
ctttattttg 
aatgataatc 
tacccttgct 
tgtgggggcg 
tttgcactgc 
ggatcggctt 
atttaaatta 
tgccatttcc 
gtggtgaaaa 
aaatgatgac 



6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840. 

9900 

9960 

10020 
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caatgcattg 
cagcagcgat 
actgccattg 
cgacaagtag 
tagcgaagtc 
cgtcgtagtt 
gtggctggcg 
tgttatgggt 
gtcgatttcg 
gctgtacttg 
gcaatctgca 
tcctctgcga 
atcctccata 
gcgcttcgat 
aataaataat 
gtaaaatggg 
tcaacagggg 
tcttctgttt 
ttgctacact 
atgcatctgc 
gcagagtgga 
tatcagtggg 
cctttttgtc 
gaattttttc 
aacacatcat 
tgaattgaag 
gcacttattc 
taatattttt 
ttgtccttga 
agattggcaa 
cttggcttat 



ccgaggacag 
agatccagcc 
gccgttatat 
gatgaaaagg 
aacgaggcat 
ttggctgctg 
ttggtggctg 
aatgtcattg 
cagcccaatt 
tcgtttatct 
atgcaaacaa 
ttccagtatt 
ggacgctttt 
ttggcactca 
ttatgaaata 
aattcgaacg 
gcataaaccc 
ctccagtgcc 
agattaaaat 
atttctatat 
atgtggaata 
aaagttgttt 
ccggtacttg 
ttttttatta 
cagggaaaag 
ctgaatccgt 
tgtgcactct 
aaggatttat 
gctgtcgatt 
cttaatgttt 
acatttcaca 



cggccaggat 
agctctcctc 
ttgggccgac 
gcgaggagta 
ccacaaattg 
ctgtcgtgga 
atgttgctgc 
tcgtggccag 
gagtcgcagc 
ttttcccggc 
ggctgccgaa 
tgctcacagc 
cccgtttgag 
agcgcgcaac 
atcggcgagt 
gttgcgccgc 
tattgcctat 
ccgtcaaatg 
aacagaaata 
ccccagcgag 
tggacaagtt 
cagtagtgcc 
taagtaaaac 
ttttaaactt 
ctgaaagcgt 
ccttggcact 
agatgccctg 
gtacttatac 
cgacttgttt 
taatgctcgg 
agccatt 



aattgacgag 
cagctccgcc 
tgtcgagagg 
ggaggcatct 
cgatgttata 
aatgtggcct 
tgcttgtgat 
aaggtcagcc 
tcattcaagt 
tgctgtgcgt 
gtttgccgtc 
tccgactgaa 
ttgctggaaa 
agtcggcgac 
gtgaggctgt 
ccggggtgtt 
ttatgtgccg 
tccgttcgtt 
tattgcgtat 
acagcggtgc 
gttgctttaa 
tcacgcctgg 
aaaagactct 
tttgcgactc 
gcttacactt 
cagacaaatc 
ccctcattcc 
agataacctt 
actcagacca 
acttaaacca 



aagatgaatc 
tggttatcca 
agctcaaacg 
gacgatgtgg 
ggcaaactgc 
gttaatgttg 
gttgctgctc 
acgttaaccc 
tgctccgcca 
aatttaaaac 
atcctggttg 
ggtctctgct 
ttggcaaaaa 
acgggaactg 
gaaactgcat 
tttatgtaca 
cagtaacaca 
aaaagtgcgc 
acgcagtgtg 
tgtcaagtgc 
aagtgagatt 
gcgcagtgaa 
tcataatctg 
aatttggccg 
tactttctcc 
gctggctaca 
aatcttcaga 
gccgaattga 
tccggtcaca 
catcagtgcg 



ctttgagcag 
ccgcaatgtc 
tggaatccga 
ctgttaatga 
tcgtcggcgt 
caggttgeag 
ctgctgcggc 
ctgacatctc 
atgctttttc 
tcgagtaatt 
ccaatgttgc 
ccatttccca 
gtcgagactc 
caaatgaata 
ggcaaaattg 
tacgcaatat 
taaatgggat 
cacttcaact 
tgcactcggc 
tgacacttgt 
aaagcaagat 
aactgcaaat 
tggtaatctc 
taaaaatgct 
cccgcaacat 
cgttttcatt 
gttttgaatc 
ttgcattgcg 
tcctcctgct 
ctctatcatc 



10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11847 



<210> 188 
<211> 2092 
<212> DNA 
<213> Drosophila 



<400> 188 

ttcccgtgtc 

ttgccaattt 

agagaccttc 

ccaggatgac 

ttaaattacg 

tgaccttctg 

acaagcagca 

ccacatttcc 

aacatcgcaa 

tgcctcctac 

ctcgacagtc 

ggagctggag 

gtcaattatc 

tcgaaagctg 

ggtggccctc 

gttcggaccg 

catattgcac 

gtatccattg 

gcccgccctc 

gcgggagatt 



gccgactgtt 
ccagcaactc 
agtcggagct 
ggcaaacttc 
cacagcagcc 
gccacgacaa 
gcaacatcag 
acgacagcag 
tttgtggatg 
tcctcgccct 
ggcccaaata 
gagagctggc 
ctggccgctg 
cgggtgataa 
tgtgcgatga 
ttcatgtgca 
ctgtgctgca 
aatatgacac 
atatccttca 
tccctgcatc 



gcgcgcttga 
aaacgggaaa 
gtgagcaaat 
ggcagccttg 
gggaaaaaga 
tgacattacc 
ccaccaacgc 
cagccaaaac 
cctcgttgac 
tttcatccta 
taacggccaa 
tggatctatc 
tcctcggcaa 
ccaattactt 
cattcaacgc 
acgtgtacaa 
tatcagtgga 
acaaaacggt 
cgcccatctt 
cggaccagtg 



gtgccaaatc 
agcgtcctat 
actggaatcg 
tttgcattgc 
taaacgacaa 
cataacagcc 
cagccacctg 
tacgacgacg 
ttcgctatca 
cttgtcgtcg 
tggcagtgac 
gctgctgctg 
tgcattggtc 
tgttgtgtcg 
ctccgtggaa 
cagcctggat 
cagatactac 
ctgcttcatg 
tctgggctgg 
ctcgtttgtg 



gaagcgcgag 
ggaggattgg 
cagaggagca 
agattgcaat 
atgtcagggg 
gcagcaggag 
caacctgcaa 
ccgacgagca 
ttaacagcca 
gattccacgt 
attgcggtgg 
ctcaaaggat 
atcatttcag 
ctggcgatgg 
ctgtccggcg 
gtttactttt 
gccattgtgc 
ctcgccaatg 
tacacgacgg 
gtcaacaagg 



tctcgacttt 
gaaatggagc 
acattggcaa 
tactcgagtt 
ttaacgtggc 
cagcaacatc 
cattaacagg 
gtttgcctat 
catcgtcaga 
ttgagctcct 
ataaccaggc 
tcatcttctc 
tgcagcgcaa 
ccgacatgct 
gaaagtggat 
ccacggccag 
gtccactgga 
tgtggatcct 
aggagcacct 
cctacgccct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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catctccagt 
catctttaag 
ccttaacagc 
cagtgactgt 
cttggagcgg 
gctctgcggc 
caacgccgac 
gcaaactcca 
caagcgaacg 
caaaaccgga 
gaaggccgaa 
ctggctgccc 
tcccgatgtg 
gcttatatac 
tgtgctgccc 



tcggtgagct 
gaggcgattc 
gtccacatgg 
gatctgaatg 
ccccgcggca 
gccgacatca 
agcacaatcg 
gatcttggcc 
gcggcggtca 
tgcagtcact 
cacaaggccg 
ttctttctgt 
ctcgtggtgg 
gcctacttta 
tgtctggaga 



tctggatacc 
gtcaacgcaa 
gccacaccca 
ccacatccgc 
tcaccttcgc 
gcagcaccac 
acaccacagt 
agaggcccgc 
cagcatgttg 
gtggtgcaac 
cccgcaccct 
ggtatgtcat 
tgttattctg 
atcgcgattt 
aacgaaatcc 



cggcattgtg 
ggccctcagc 
gcagcccacc 
ccgggaggag 
cgcggtgccc 
cggcaccacc 
aaccactagt 
ctcctcgtcc 
gcagcagagc 
cggcggttcc 
gggcatcatc 
cacatcgctc 
gatcggttac 
tcgcgaggca 
gtacaacgcc 



atgctggtga 
cgcaccagct 
agcctgagct 
acgcacagtg 
agtccgctgc 
gcaaacgcaa 
agtaaacgta 
agttccactc 
aagaaccgga 
gtgcgaccgg 
atgggcgtct 
tgcggtccgg 
ttcaactcca 
ttccgcaata 
tactacgtct 



tgtactggcg 
ccaacatcct 
atctgcatcc 
cgcttagtaa 
ccaagtgtcc 
ccgcaaccgc 
gtattcacga 
gattctggca 
agcgccgctt 
ccaaaggatg 
ttctgctctg 
cctgcccatg 
cgctaaatcc 
cgctggagtg 
ag 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2092 



<210> 189 
<211> 603 
<212> PRT 
<213> Drosophila 

<400> 189 

Met Ser Gly Val Asn 

1 5 
Pro He Thr Ala Ala 
20 

Ser Ala Thr Asn Ala 
35 

He Ser Thr Thr Ala 
50 

Leu Pro He Thr Ser 
65 

Leu Thr Ala Thr Ser 
85 

Tyr Leu Ser Ser Asp 
100 

Asn He Thr Ala Asn 
115 

Leu Glu Glu Ser Trp 
130 

He Phe Ser Ser He 
145 

He He Ser Val Gin 
165 

Phe Val Val Ser Leu 
180 

Met Thr Phe Asn Ala 
195 

Gly Pro Phe Met Cys 
210 

Thr Ala Ser He Leu 
225 

Ala He Val Arg Pro 
245 

Val Cys Phe Met Leu 
260 

Phe Thr Pro He Phe 
275 

Glu He Ser Leu His 
290 



Val Ala Asp Leu 
Ala Gly 
Ser His 



Ala Ala 

55 
Gin Phe 
70 

Ser Asp 

Ser Thr 

Gly Ser 

Leu Asp 
135 
He Leu 
150 

Arg Asn 

Ala Met 

Ser Val 

Asn Val 
215 
His Leu 
230 

Leu Glu 



Ala Ala 

25 
Leu Gin 
40 

Lys Thr 



Val Asp 

Ala Ser 

Phe Glu 
105 
Asp He 
120 

Leu Ser 

Ala Ala 

Arg Lys 

Ala Asp 
185 
Glu Leu 
200 

Tyr Asn 
Cys Cys 
Tyr Pro 



Asn 


Val 


Trp 






265 


Gly 


Trp 


Tyr 




280 




Asp 


Gin 


Cys 


295 







Leu Ala 
10 

Thr Ser 

Pro Ala 

Thr Thr 

Ala Ser 

75 
Tyr Ser 
90 

Leu Leu 

Ala Val 

Leu Leu 

Val Leu 
155 
Leu Arg 
170 

Met Leu 

Ser Gly 

Ser Leu 

He Ser 
235 
Leu Asn 
250 

He Leu 
Thr Thr 
Ser Phe 



Thr Thr 

Gin Ala 

Thr Leu 
45 

Thr Pro 
60 

Leu Thr 

Ser Pro 

Ser Thr 

Asp Asn 
125 
Leu Leu 
140 

Gly Asn 

Val He 

Val Ala 

Gly Lys 
205 
Asp Val 
220 

Val Asp 

Met Thr 

Pro Ala 

Glu Glu 
285 
Val Val 
300 



Met Thr Leu 
15 

Ala Ala Thr 
30 

Thr Gly His 

Thr Ser Ser 

Ser Leu Ser 
80 

Phe Ser Ser 
95 

Val Gly Pro 
110 

Gin Ala Glu 

Lys Gly Phe 

Ala Leu Val 
160 

Thr Asn Tyr 

175 
Leu Cys Ala 
190 

Trp Met Phe 

Tyr Phe Ser 

Arg Tyr Tyr 
240 

His Lys Thr 

255 
Leu He Ser 
270 

His Leu Arg 
Asn Lys Ala 
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Tyr 


Ala 


Leu 


He 


305 








Met 


Leu 


Val 


Met 


Lys 


Ala 


Leu 


Ser 








340 


Met 


Gly 


His 


Thr 






355 




Asp 


Cys 


Asp 


Leu 




370 






Leu 


Ser 


Asn 


Leu 


385 








Ser 


Pro 


Leu 


Pro 


Thr 


Gly 


Thr 


Thr 








4 c. \J 


lie 


Asp 


Thr 


Thr 






435 




Thr 


Pro 


Asp 


Leu 




450 






Phe 


Trp 


His 


Lys 


465 








Lys 


Asn 


Arg 


Lys 


Thr 


Gly 


Gly 


Ser 










Ala 


Ala 


Arg 


Thr 






515 




Leu 


Pro 


Phe 


Phe 




530 






Cys 


Pro 


Cys 


Pro 


545 








Phe 


Asn 


Ser 


Thr 


Phe 


Arg 


Glu 


Ala 








580 


Glu 


Lys 


Arg 


Asn 






595 





Ser Ser Ser Val 
310 

Tyr Trp Arg He 
325 

Arg Thr Ser Ser 

Gin Gin Pro Thr 
360 

Asn Ala Thr Ser 
375 

Glu Arg Pro Arg 
390 

Lys Cys Pro Leu 
405 

Ala Asn Ala Thr 

Val Thr Thr Ser 
440 

Gly Gin Arg Pro 
455 

Arg Thr Ala Ala 
470 

Arg Arg Phe Lys 
485 

Val Arg Pro Ala 

Leu Gly He He 
520 

Leu Trp Tyr Val 
535 

Asp Val Leu Val 
550 

Leu Asn Pro Leu 
565 

Phe Arg Asn Thr 

Pro Tyr Asn Ala 
600 



Ser Phe Trp He 
315 

Phe Lys Glu Ala 
330 

Asn He Leu Leu 
345 

Ser Leu Ser Tyr 

Ala Arg Glu Glu 
380 

Gly He Thr Phe 
395 

Cys Gly Ala Asp 
410 

Ala Thr Ala Asn 
425 

Ser Lys Arg Ser 

Ala Ser Ser Ser 
4 60 

Val Thr Ala Cys 
475 

Thr Gly Cys Ser 
490 

Lys Gly Trp Lys 
505 

Met Gly Val Phe 

lie Thr Ser Leu 
540 

Val Val Leu Phe 
555 

lie Tyr Ala Tyr 
570 

Leu Glu Cys Val 
585 

Tyr Tyr Val 



Pro Gly He Val 
320 

He Arg Gin Arg 
335 

Asn Ser Val His 
350 

Leu His Pro Ser 
365 

Thr His Ser Ala 

Ala Ala Val Pro 
400 

He Ser Ser Thr 
415 

Ala Asp Ser Thr 
430 

lie His Glu Gin 
445 

Ser Ser Thr Arg 

Trp Gin Gin Ser 
480 

His Cys Gly Ala 
495 

Ala Glu His Lys 
510 

Leu Leu Cys Trp 
525 

Cys Gly Pro Ala 

Trp lie Gly Tyr 
560 

Phe Asn Arg Asp 
575 

Leu Pro Cys Leu 
590 



<210> 190 
<211> 6617 
<212> DNA 
<213> Drosophila 

<400> 190 

gttccgttac ttggtctttg acttcttacc ccagaatatg attcttggca gcaggagaaa 60 

ccagtacgac ttgttttggt cgtattatga gtgaaatatt accactcatg tcattttgaa 120 

cgtctttaaa gctcacaccc ttatggaccc attataagaa tatggcgtgg caaaatttcc 180 

catttttatc gaggctggaa aaacttttcc acacatgcaa tcatgtttcg tttcacttat 240 

catggctaaa acctagcaaa tagaaaaagt ttttttttat ccacaaacat atctgtattg 300 

agtgccagca atctgtattg ctttagctta agatgtttac tattaataga gccaagtggc 360 

aaatgacata cggcttacag agaaaataat ttccaattca attacattta cgtacatata 420 

ttccttccta caagcagaca ggattattga tctaaatata aataaatata taataattat 480 

tttaataata taatattaaa atgtagatta acagaatata ttttttttgt atgaaagcct 540 

ctagggtatt tgtgtgactt ttttctctct gcataagtga agatgtttac tattaataga 600 

gccaagaggc aagagaaaag tggcagcaat caagtgcttt tatttcggga ctgcatgtta 660 

aaggcgattt cctgatcgaa gactttggat acctgatgcc atccgtctga atgtgacgtc 720 

agcctggctg gggattcaat ttaaattcat ccttgagtct aatgactttt aatgttgttt 780 

gcttcacagg cgagtgcgca atttttgggc ctgtcgtcga ctgcttatcg gctatgtgcc 840 
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accgcagccg 
cgatttgctt 
tggagtggat 
caccatagcc 
gcccagtccc 
ggcgaatccc 
tgtcgcccat 
cggcacaaat 
aacagccctc 
cacctgcatt 
caacctggcg 
caccattgcg 
cctttggtat 
ctcggaaatg 
tatagcaatc 
aatatttatt 
attaacaaag 
ataagcgtca 
cacaataatt 
acaaccaaat 
taattaaagt 
ataaatgcgt 
ttgcattttt 
aaatgttctg 
gttttcattt 
tgaaacttaa 
cacaaattaa 
aaaagggtgg 
ctttcatgtc 
gtttgccatt 
agcatgaaaa 
tctattaaat 
aaatatttat 
gtgtaggtgg 
cgtttatccg 
tctttccgtc 
atttgccatt 
ggtctaccag 
tgacgcatca 
gcccatgtgt 
gtactgtacg 
cgaaggactt 
ggtaggtgag 
ccagaaacaa 
aaaattactg 
gccctcattg 
tgtttttttc 
tgtgtaaagc 
tcaaaagtaa 
gcttaaatat 
cgttataagc 
tttatttata 
ccaggcccaa 
aatttactgt 
tggcctgcca 
attaatttgg 
tagatagtgg 
gtgatatttt 
taagaccact 
aatacgttgc 



caaaggatac 
ttgtttttct 
agagtccaca 
aacgacagca 
aatcagagtc 
agcgagagcc 
ctgctgaaca 
gcgagcacaa 
accgtttgtt 
gttatatcgc 
gtttccgact 
tatgcgtaat 
ccggatatgt 
gccgccaatg 
tgtcatccat 
tacattatta 
tggatcatgc 
gtcggaaaca 
attcataact 
tcccatcgtt 
caggtccata 
tgtaaagata 
ggaatggtac 
accattttgc 
acgatgttgc 
aaaccttaca 
ttgttacgaa 
ttttctataa 
tctgaagttg 
taatagtatt 
aggtctgaaa 
ttggtatgaa 
gttgcactca 
tcactgcgtg 
aaacatcgac 
atttgtaggc 
tggctggcgg 
aacgaaggat 
tcgagtgtcc 
tcgccgtttc 
tgctgatcgg 
tcgatgcgaa 
ttaagttggc 
caagaacatc 
tcaacgtatt 
ccttccgccc 
cctcttttcc 
tcttaaatgg 
tcaagccggg 
ttgtaaaaag 
tgttaaattt 
taattagaaa 
ttttagatcc 
tgcttttaac 
aatggaaatt 
ccagcaagag 
gatattattt 
accaaaggcc 
ttcagtgtga 
ttatccttac 



gacagacaat 
ttcccaactt 
attaatcgac 
acgatgatgg 
cctccattgg 
ccgagatgtt 
taacgaccga 
tggcagcgga 
atgcactcat 
gaaataactt 
taatactgtt 
gcggcaaata 
atcccttcac 
caacggttct 
tccggtgagt 
aacaaatgta 
ataattagga 
ttatgatatg 
aaacagtctg 
tttaaatata 
aatgaaccaa 
gaaagttagt 
aacaaaactt 
ggtcaaagaa 
atttccttta 
ttgttgtttt 
tacaatcact 
gaagggattc 
ggattaaata 
atagttattt 
cccattgcag 
actgtttgaa 
atatcccaaa 
tactttacaa 
atcatcttgt 
agcacaccat 
ccttcctttt 
actcgtgcac 
ttttcgctgt 
gggatttatt 
agtgaaactg 
tcgcggccta 
ggcttgattg 
cacggctgtc 
tgcgttggcg 
acttcatttg 
ctcaacggtg 
ccatctagcc 
gcacttaaat 
tgtttcttga 
taaaggaatt 
ttcttatcaa 
caataagccg 
tgaaacaccg 
gatggaccaa 
tggaaaaaaa 
atcgtttgct 
cgtttgcgca 
gagagcccct 
ttgaccgtat 



attaataacg 
ggcggtggaa 
gacagctagt 
catcaatcaa 
agtgggcatc 
gctgctgaag 
gaatctttca 
ttcgccggtc 
ttttgtggcc 
tatgcacacg 
ggtctcaggt 
tcctgcaggc 
ggacgccatg 
caccatcaca 
ttttaacatt 
taatttcttc 
cagagaatat 
aaacaacaaa 
gctgtggaat 
ccaaattcat 
gttcattaca 
tagtcaatta 
ttccttatac 
tacaacagct 
gttcagaaac 
caaacacaat 
tggagtatgt 
caagtccgat 
ttgttgtcga 
tgaaacgaaa 
tcactttacc 
ttccttccca 
ccttaaagag 
ccgtaatatt 
ccttaatact 
gtcaaaattg 
ggcccttccg 
ggtgggtggg 
ttattccctt 
ttcttcggtg 
aagaggagtc 
aatgcccagg 
attaaagaca 
aaaacttttt 
aacatcaaac 
cacttgcgtc 
tcgttaattt 
tatgaagcaa 
gaactcaaga 
atctatagac 
cagcacacat 
accatacagt 
ttttttctta 
accattgatt 
ttcagattaa 
acacacccgt 
aagaggtaat 
aactattcga 
tttccggtga 
ttgcagtagc 



cattgatacg 
aggagtggtg 
atgttgcaag 
tcattcatgg 
ggtatcgcct 
aacgacaaat 
aatctcctgg 
gacgagtccc 
ggcgtgctgg 
gccaccaatt 
gagatggcat 
atcccccagg 
tgcatcatgg 
gcattcactg 
taaacgttat 
agtgttattg 
gaatagataa 
agtacctcac 
gataaaacaa 
tagctttcat 
gactatgtaa 
attgcttagt 
aggtatcacc 
tccttaagcg 
tcttgaatag 
ttccgtttct 
ccttaagaat 
tacaaagtta 
aaagagtaaa 
ggatattcgg 
aaaagaagtt 
tttacaccca 
aaagttaaaa 
ttccatctta 
taacatataa 
tcgcgagcca 
caggccatgc 
actgaaatgt 
agatggaaaa 
gacccatgac 
gactgctgca 
gacgagtcat 
gcaatatgcc 
ccgccgaaag 
agggaattcg 
atgtgggctg 
ttcgcaaaca 
aggtgaagcc 
ctgctgggtg 
atatgtccgt 
ctcgtgtatt 
gaataaattt 
cagcctacga 
tcgaaattta 
gcccagagcc 
tctcagggct 
accttggctt 
tttcggggga 
cctagttgtt 
tgtagccgtc 



agtgggcggc 
tggcgtggcg 
gcgtcgccat 
ctcatgtgtc 
catccacaat 
tcctaacaca 
gttccacgaa 
tgaccctgcg 
gtaatcttat 
tctatttgtt 
ttttgaggga 
agctgtacaa 
gcagtgtgct 
tggagcgata 
atatatttta 
cctcgctctc 
attaaacttg 
aaattatggg 
gttaggcttg 
ttatagacac 
ttaagttcca 
accaacatat 
agttccttcc 
aaagcgtctg 
ctagaagaaa 
tagcattttt 
atgcagacac 
aggcaaaaaa 
aggtttctat 
acatgtgctt 
aaaagtgatg 
cattcaaagt 
agcgtaaaaa 
tatctgccca 
tgtagcctaa 
ttaaatttat 
aattttcggt 
gaaaaccaac 
cgacttttac 
ggcgatttgt 
atcgctgccg 
cagaatgttg 
ccaccgccaa 
tcttaaattg 
tcatagcccc 
aacgtgcttc 
tcggtgcgta 
aggtgtcacc 
cacacagcaa 
taacgtgcca 
attcacatat 
aagtgtaaat 
aacaacgttt 
ccattggccc 
ttaaccataa 
tagttattca 
ttatcaaaat 
tataaaagat 
caggggtctt 
gccttcttcc 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 
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tctgctgggc 
atataggcat 
gggtgctcta 
agttccgcga 
taatagtaat 
gatgctttta 
aattggtaat 
ggccaggaac 
ccgccagaat 
ggagccatat 
gtccctgcag 
ggcagccgga 
gaaacaggag 
ccaggtgtct 
ggacagacac 
cctgctggtg 
tctcaagctg 
cttttgtggg 
tcgaagctcc 
cgagggcttg 
ttatacttag 
agaaaaagtc 
gcatggtcca 
tgacgaatac 
tgttaagcta 
atccttttaa 
ttccaagttg 
caggtgcagc 
cctggcattc 
ataaaataaa 
gcaaatctag 
gttgcagtag 
aaagtttcag 
gctaggaagt 
gtgaattggc 
aaagtagcat 
ttctttaaag 



tccattccac 
cagtcgggac 
ctttctctcc 
ggccttcaag 
ctattattaa 
aagttaaatt 
ctatgtattt 
caccaccacc 
ggatcaatgc 
ggatcctacc 
gacagcattc 
aatggagttg 
ctctatgggc 
caactctcgt 
tacgcatccg 
acaccacctc 
actagggtga 
agccacagtt 
cgtaagtttc 
acctatggct 
ttaggcattg 
ctgcgatttg 
gacaatttat 
acgccccaca 
ccagctaaat 
atgcgttttc 
tccaggatca 
gtttccaagc 
tgttgatttc 
ataaagtgcc 
gaaggaaaac 
atatatatat 
tttgcacctg 
ccggattgaa 
acgtggagtg 
acaaatttaa 
gcaggca 



gcccagcgac 
gccttcaacg 
acctgcatca 
gtaagcccat 
ttgggtcctt 
taagctttaa 
tttacacttt 
agcagagtca 
gtctgcagcc 
gagtggttca 
gcaccaccac 
gtggtggtgc 
caggtccggg 
cgctgggcga 
ggcgagccaa 
aaagtggtga 
taagtagaag 
tgccggatcc 
cgtggcgaaa 
cacccaaatc 
tttagttagc 
aatgcaaatt 
gagagtcata 
tcattttgca 
aatgtgaata 
ctttcacaat 
aagggaaaat 
aatctgttcc 
ccacacaaaa 
aataaattag 
atgacataaa 
tttcgcagtg 
gcccattgcc 
acgaatgtgc 
catggagagt 
ggacccgact 



tgatggcagt 
attacttccg 
atccgctact 
gggatcaacc 
cgcatgtttg 
ttattcaaaa 
taagatcact 
gcaccaccag 
ggccagttgc 
gtttcgttgc 
cacaaccacg 
tggtggcggt 
tacagcagtt 
tgccaattca 
aagagctctc 
cccatcggaa 
ggatgaggtg 
ggagacttgt 
acgaaggcaa 
ccagtgattt 
gcaataaact 
tcgggtgacc 
aagccgggat 
atggcaaccg 
aataaaaatt 
gcatgccagt 
caaaaaaaac 
taccttttgc 
actaaaagcg 
cagtcttaat 
ttttaaatgg 
taagtcacga 
ggaggtccat 
agagagtggc 
ggtgtcggtg 
gatgttatgt 



ttacggactg 
gatacttgat 
gtacaacatc 
cttctcagtt 
tgaggacggt 
ttgtttttcg 
ctgacgcgac 
cacaactaca 
agtgtaaaca 
cgggatgcga 
acgataaaca 
ggcggtgggc 
ccccatcgaa 
ctgctggagg 
ttggccacca 
gtcagtcagc 
gccaacacaa 
caatcggcat 
aaaacggagg 
cagctggctg 
tttctatctg 
ataaaaaatt 
aatgaggcta 
catagcactc 
gttctcaata 
tacattcaag 
agtttacatc 
gacatttttc 
ctgaaaaaaa 
attttcagag 
attgctcatc 
ctccacttgc 
tagtcaagag 
acgaatgggc 
ccacaaaata 
ccaaagggaa 



aatctgatta 
tacacgtccg 
atgagccaca 
aaccacttca 
cacaaatgca 
taaattaatg 
agtttggtct 
gtgcccttct 
acaacgcgct 
accaccaatt 
gcaacagcat 
ggcgtctgcg 
tgctgcaggc 
cggaggtggt 
aaagtggtgc 
ccgccactcg 
gcactccgcc 
cggtggctgg 
atcccagcag 
gtagctagag 
taaataagcc 
taacgaatgg 
cggcgtatcg 
ataaattaag 
ttacagcaaa 
ttcgcctgca 
tcaagttttg 
ctgcagattg 
taaaagaagc 
cccctcagaa 
aggatatcgt 
acctggcaac 
ggctttgtag 
ttatcctgcg 
aatcaactta 
ttattaatgg 



4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6617 



<210> 191 
<211> 1983 
<212> DNA 
<213> Drosophila 



<400> 191 

atgttgcaag 

tcattcatgg 

ggtatcgcct 

aacgacaaat 

aatctcctgg 

gacgagtccc 

ggcgtgctgg 

gccaccaatt 

atcccccagg 

tgcatcatgg 

gcattcactg 

aaattgtcgc 

cttccgcagg 

gaaaacgact 



gcgtcgccat 
ctcatgtgtc 
catccacaat 
tcctaacaca 
gttccacgaa 
tgaccctgcg 
gtaatcttat 
tctatttgtt 
agctgtacaa 
gcagtgtgct 
tggagcgata 
gagccattaa 
ccatgcaatt 
tttacgccca 



caccatagcc 
gcccagtccc 
ggcgaatccc 
tgtcgcccat 
cggcacaaat 
aacagccctc 
cacctgcatt 
caacctggcg 
cctttggtat 
ctcggaaatg 
tatagcaatc 
atttatattt 
ttcggtggtc 
tgtgttcgcc 



aacgacagca 
aatcagagtc 
agcgagagcc 
ctgctgaaca 
gcgagcacaa 
accgtttgtt 
gttatatcgc 
gtttccgact 
ccggatatgt 
gccgccaatg 
tgtcatccat 
gccatttggc 
taccagaacg 
gtttcgggat 



acgatgatgg 
cctccattgg 
ccgagatgtt 
taacgaccga 
tggcagcgga 
atgcactcat 
gaaataactt 
taatactgtt 
atcccttcac 
caacggttct 
tccggcagca 
tggcggcctt 
aaggatactc 
ttattttctt 



catcaatcaa 
agtgggcatc 
gctgctgaag 
gaatctttca 
ttcgccggtc 
ttttgtggcc 
tatgcacacg 
ggtctcaggc 
ggacgccatg 
caccatcaca 
caccatgtca 
ccttttggcc 
gtgcacgatg 
cggtggaccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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atgacggcga 
ctgcaatcgc 
gtcatcagaa 
gcccagcgac 
gccttcaacg 
acctgcatca 
atcactctga 
caccagcaca 
agttgcagtg 
cgttgccggg 
accacgacga 
ggcggtggcg 
gcagttcccc 
aattcactgc 
gctctcttgg 
tcggaagtca 
gaggtggcca 
acttgtcaat 
aggcaaaaaa 
tga 



tttgtgtact 
tgccgcgaag 
tgttggtagc 
tgatggcagt 
attacttccg 
atccgctact 
cgcgacagtt 
actacagtgc 
taaacaacaa 
atgcgaacca 
taaacagcaa 
gtgggcggcg 
atcgaatgct 
tggaggcgga 
ccaccaaaag 
gtcagcccgc 
acacaagcac 
cggcatcggt 
cggaggatcc 



gtacgtgctg 
gactttcgat 
tgtagccgtc 
ttacggactg 
gatacttgat 
gtacaacatc 
tggtctggcc 
ccttctccgc 
cgcgctggag 
ccaattgtcc 
cagcatggca 
tctgcggaaa 
gcaggcccag 
ggtggtggac 
tggtgccctg 
cactcgtctc 
tccgcccttt 
ggctggtcga 
cagcagcgag 



atcggagtga 
gcgaatcgcg 
gccttcttcc 
aatctgatta 
tacacgtccg 
atgagccaca 
aggaaccacc 
cagaatggat 
ccatatggat 
ctgcaggaca 
gccggaaatg 
caggagctct 
gtgtctcaac 
agacactacg 
ctggtgacac 
aagctgacta 
tgtgggagcc 
agctcccgta 
ggcttgacct 



aactgaagag 
gcctaaatgc 
tctgctgggc 
atataggcat 
gggtgctcta 
agttccgcga 
accaccagca 
caatgcgtct 
cctaccgagt 
gcattcgcac 
gagttggtgg 
atgggccagg 
tctcgtcgct 
catccgggcg 
cacctcaaag 
gggtgataag 
acagtttgcc 
agtttccgtg 
atggctcacc 



gagtcgactg 
ccagggacga 
tccattccac 
cagtcgggac 
ctttctctcc 
ggccttcaag 
gagtcagcac 
gcagccggcc 
ggttcagttt 
caccaccaca 
tggtgctggt 
tccgggtaca 
gggcgatgcc 
agccaaaaga 
tggtgaccca 
tagaagggat 
ggatccggag 
gcgaaaacga 
caaatcccag 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1983 



<210> 192 
<211> 660 
<212> PRT 
<213> Drosophil, 

<400> 192 
Met Leu Gin Gly 
1 

Gly lie Asn Gin 
20 

Ser Pro Ser lie 
35 

Asn Pro Ser Glu 
50 

Leu Thr His Val 
65 

Asn Leu Leu Gly 

Asp Ser Pro Val 
100 

Cys Tyr Ala Leu 
115 

Cys lie Val He 
130 

Tyr Leu Phe Asn 
145 

He Pro Gin Glu 

Thr Asp Ala Met 
180 

Asn Ala Thr Val 
195 

Ala He Cys His 
210 

Ala He Lys Phe 
225 

Leu Pro Gin Ala 
Ser Cys Thr Met 



Val Ala He Thr 
5 

Ser Phe Met Ala 

Gly Val Gly He 
40 

Ser Pro Glu Met 
55 

Ala His Leu Leu 
70 

Ser Thr Asn Gly 
85 

Asp Glu Ser Leu 

He Phe Val Ala 
120 

Ser Arg Asn Asn 
135 

Leu Ala Val Ser 
150 

Leu Tyr Asn Leu 
165 

Cys He Met Gly 

Leu Thr He Thr 
200 

Pro Phe Arg Gin 
215 

He Phe Ala He 
230 

Met Gin Phe Ser 
245 

Glu Asn Asp Phe 



He Ala Asn Asp 
10 

His Val Ser Pro 
25 

Gly He Ala Ser 

Leu Leu Leu Lys 
60 

Asn He Thr Thr 
75 

Thr Asn Ala Ser 
90 

Thr Leu Arg Thr 
105 

Gly Val Leu Gly 

Phe Met His Thr 
140 

Asp Leu He Leu 
155 

Trp Tyr Pro Asp 
170 

Ser Val Leu Ser 
185 

Ala Phe Thr Val 

His Thr Met Ser 
220 

Trp Leu Ala Ala 
235 

Val Val Tyr Gin 
250 

Tyr Ala His Val 



Ser Asn Asp Asp 
15 

Ser Pro Asn Gin 
30 

Ser Thr Met Ala 
45 

Asn Asp Lys Phe 

Glu Asn Leu Ser 
80 

Thr Met Ala Ala 
95 

Ala Leu Thr Val 
110 

Asn Leu He Thr 
125 

Ala Thr Asn Phe 

Leu Val Ser Gly 
160 

Met Tyr Pro Phe 
175 

Glu Met Ala Ala 
190 

Glu Arg Tyr He 
205 

Lys Leu Ser Arg 

Phe Leu Leu Ala 
240 

Asn Glu Gly Tyr 
255 

Phe Ala Val Ser 
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260 265 



Gly 


Phe 


He 


Phe 


Phe 


Gly 


Gly 


Pro 


Met 






275 










280 




Val 


Leu 


He 


Gly 


Val 


Lys 


Leu 


Lys 


Arg 




290 










295 






Pro 


Arg 


Arg 


Thr 


Phe 


Asp 


Ala 


Asn 


Arg 


305 










310 








Val 


He 


Arg 


Met 


Leu 


Val 


Ala 


Val 


Ala 










325 










Ala 


Pro 


Phe 


His 


Ala 


Gin 


Arg 


Leu 


Met 








340 










345 


He 


Asn 


He 


Gly 


He 


Ser 


Arg 


Asp 


Ala 






355 










360 




Leu 


Asp 


Tyr 


Thr 


Ser 


Gly 


Val 


Leu 


Tyr 




370 










375 






Pro 


Leu 


Leu 


Tyr 


Asn 


lie 


Met 


Ser 


His 


385 










390 








He 


Thr 


Leu 


Thr 


Arg 


Gin 


Phe 


Gly 


Leu 










405 










Gin 


Ser 


Gin 


His 


His 


Gin 


His 


Asn 


Tyr 








420 










425 


Gly 


Ser 


Met 


Arg 


Leu 


Gin 


Pro 


Ala 


Ser 






435 










440 




Leu 


Glu 


Pro 


Tyr 


Gly 


Ser 


Tyr 


Arg 


Val 




450 










455 






Ala 


Asn 


His 


Gin 


Leu 


Ser 


Leu 


Gin 


Asp 


465 










470 








Thr 


Thr 


Thr 


He 


Asn 


Ser 


Asn 


Ser 


Met 










485 










Gly 


Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 








500 










505 


Leu 


Tyr 


Gly 


Pro 


Gly 


Pro 


Gly 


Thr 


Ala 






515 










520 




Ala 


Gin 


Val 


Ser 


Gin 


Leu 


Ser 


Ser 


Leu 




530 










535 






Glu 


Ala 


Glu 


Val 


Val 


Asp 


Arg 


His 


Tyr 


545 










550 








Ala 


Leu 


Leu 


Ala 


Thr 


Lys 


Ser 


Gly 


Ala 










565 










Ser 


Gly 


Asp 


Pro 


Ser 


Glu 


Val 


Ser 


Gin 








580 










585 


Thr 


Arg 


Val 


He 


Ser 


Arg 


Arg 


Asp 


Glu 






595 










600 




Pro 


Phe 


Cys 


Gly 


Ser 


His 


Ser 


Leu 


Pro 




610 










615 






Ala 


Ser 


Val 


Ala 


Gly 


Arg 


Ser 


Ser 


Arg 


625 










630 








Arg 


Gin 


Lys 


Thr 


Glu 


Asp 


Pro 


Ser 


Ser 










645 










Pro 


Lys 


Ser 


Gin 













660 




PCT/US01/09341 

270 



Thr 


Ala 


He 


Cys 


Val 


Leu 


Tyr 








285 








Ser 


Arg 


Leu 


Leu 


Gin 


Ser 


Leu 






300 










Gly 


Leu 


Asn 


Ala 


Gin 


Gly 


Arg 




315 










320 


Val 


Ala 


Phe 


Phe 


Leu 


Cys 


Trp 


330 










335 




Ala 


Val 


Tyr 


Gly 


Leu 


Asn 


Leu 










350 






Phe 


Asn 


Asp 


Tyr 


Phe 


Arg 


He 








365 








Phe 


Leu 


Ser 


Thr 


Cys 


He 


Asn 






380 










Lys 


Phe 


Arg 


Glu 


Ala 


Phe 


Lys 




395 










400 


Ala 


Arg 


Asn 


His 


His 


His 


Gin 


410 










415 




Ser 


Ala 


Leu 


Leu 


Arg 


Gin 


Asn 










430 






Cys 


Ser 


Val 


Asn 


Asn 


Asn 


Ala 








445 








Val 


Gin 


Phe 


Arg 


Cys 


Arg 


Asp 






4 60 










Ser 


He 


Arg 


Thr 


Thr 


Thr 


Thr 




475 










480 


Ala 


Ala 


Gly 


Asn 


Gly 


Val 


Gly 


490 










495 




Arg 


Arg 


Leu 


Arg 


Lys 


Gin 


Glu 










510 






Val 


Pro 


His 


Arg 


Met 


Leu 


Gin 








525 








Gly 


Asp 


Ala 


Asn 


Ser 


Leu 


Leu 






540 










Ala 


Ser 


Gly 


Arg 


Ala 


Lys 


Arg 




555 










560 


Leu 


Leu 


Val 


Thr 


Pro 


Pro 


Gin 


570 










575 




Pro 


Ala 


Thr 


Arg 


Leu 


Lys 


Leu 










590 






Val 


Ala 


Asn 


Thr 


Ser 


Thr 


Pro 








605 








Asp 


Pro 


Glu 


Thr 


Cys 


Gin 


Ser 






620 










Lys 


Phe 


Pro 


Trp 


Arg 


Lys 


Arg 




635 










640 


Glu 


Gly 


Leu 


Thr 


Tyr 


Gly 


Ser 


650 










655 





<210> 193 
<211> 4902 
<212> DNA 
<213> Drosophila 

<400> 193 

ttgggcaaat taaggaaact ttttgtggtt tcagttactc gtttaagcca cattttccca 60 
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acatatgctg 
tgcctgacaa 
catgctgcta 
gctttcatcc 
ttaatggcca 
ccattgaacc 
gtattcatta 
aaaactgaca 
aaaactgaga 
atatgtattt 
tttattttta 
ggtcgaattt 
tataattcga 
tgaccccaaa 
ctgctcgtat 
ctcgcttatc 
cagtgtcgcc 
tggcacgcca 
tcactttcgc 
cggcccctgt 
tgcgcgtaag 
agcacatcgt 
gggaatgggc 
gattatcgaa 
acacgcggtt 
agccatttag 
gcaggaggaa 
gaaaataagt 
tcattttgag 
ccacataata 
ctcaccgttg 
tgaggaagcc 
taaaatggga 
cgttaaagaa 
gctatagaaa 
tcatgatgtt 
tatagtcgag 
catagaccgc 
ccaccgccac 
tttcatatcc 
tacatcgtaa 
gatgaggaca 
cacagcaaac 
aatatatatg 
ggcaatggct 
tttttatcat 
acggaaaaca 
gtaactcata 
cgatgttccc 
aagggcagct 
gtgctgcctg 
agtggaagga 
ctgtgaacgc 
ccaatatgca 
cctgcggcat 
tatttaataa 
tgccgaatat 
agttccattt 
attttagttt 
agataaaagt 



cactttatga 
attgcatttg 
attagtcacc 
actgtctaga 
gggattatga 
ctcttcttga 
aatgggaaaa 
ttagataaca 
tcgcttttat 
atatatcact 
tcaaaagtcc 
ttgaactgag 
actatagctt 
cgcggaatat 
cacagttgtg 
gagtacagat 
tccattcacc 
attgaatttg 
cttgcgcact 
tgtgctgcct 
tttaaggcgt 
gggctggcgc 
atcgcccacc 
gaggtcttgc 
caggcgcacc 
cccagtggtg 
atcaaggaat 
aacgcaataa 
aaacaggaca 
gaatttaagt 
tgtgcagccg 
cccttccctg 
ttaaatacgt 
tcgattaata 
tgaatatccg 
gtagagcagc 
gatgctgaac 
catcagccgc 
tgggaaagtt 
tgcttaccca 
catcgcctgg 
tagagcacgc 
acatgggcaa 
tgtattcgca 
gtaaatttga 
gctcttaaat 
cataatgaat 
actgggggac 
atgccctgca 
atccagatgg 
tatcgaaagc 
atgaaggttc 
ggtaatggtt 
gatgctgtca 
tcctaagaac 
tagctgaaat 
ctcactctac 
attgtttgca 
tgcactcacc 
tggtggccgt 



attaaaagtc 
ctgaaattaa 
aacagcggct 
ctccagacca 
tgcagtgccc 
taatccccac 
aaatatttac 
gtttctattg 
atttaaattg 
gtactccaaa 
attctttttt 
tacataacta 
tgggctatat 
ttgtctaaat 
tgtgtgtgta 
agaggatatg 
gtcgccccct 
gtcctgagac 
ttgccactcg 
cctgccacgc 
gaccgcgaga 
tgcagtggct 
gatgacagct 
cggctgcgat 
gactgacccg 
gtcgtcgcca 
ggagggataa 
acatcttata 
tacactatat 
gcatatatcg 
cagtgaaccc 
atttttgcca 
tttttaattc 
aacttgtaac 
gcagcgaaac 
gggttgatgc 
acgtcgttga 
tgggcgtgaa 
ggataagggc 
gcatccggat 
gaagcgcctg 
agatggccgt 
aaaagtcgtt 
tgcgatacga 
ttgcctaaac 
tttttgtaat 
caaaggatca 
atacgattaa 
tcaccacgga 
caaatatgaa 
agaaccaatt 
ccttacctga 
agaactgtcg 
ctgaaaggat 
ataaatagca 
tatagcctat 
attttcttgt 
ctactgaact 
tgagcataac 
gtgcataaag 



atttatttta 
aacaaaatgc 
ttcagcgaat 
aaaagcgaat 
gctttttact 
ttgatccaca 
tgcaagagaa 
gactgctaac 
tataataaaa 
tactatttac 
tattaaactc 
catagtttat 
tacggttatt 
taaaacacaa 
atgtacgtag 
ctgaccccct 
tttcgtgcgg 
ccttgaacgg 
actcgtcacc 
cccccgtcac 
tggccaagtc 
ccccgggaaa 
gcgatatttg 
tctgaccctg 
ttgtcgttcc 
tggaagtcat 
tcagaaaggg 
gtggttttaa 
taatcctttt 
aatgataatt 
gtttgattac 
ccaagtccaa 
tagcttgaag 
ctgaagctta 
tcaccttaaa 
aagttgagag 
accactggga 
agggggccca 
gcctgatgca 
gactcgcgtt 
caggagtcgg 
catgggtccg 
tttcatctac 
tatggttttg 
tgagtcgaac 
tcacaaaggc 
aaggaccggt 
gggtctcgca 
gaactgaatg 
ctttacggcc 
aagcaagttt 
acggatgaca 
cattggccgc 
aattatccgg 
aaatagcaaa 
agcctcaaaa 
ttatttacaa 
gaattaaatt 
aaaatcatgt 
ttgttccgcg 



ggctgcacac 
actgggacac 
gttggccaat 
ttgtgcgcca 
gacagtaatt 
ctctttcaaa 
atgttattaa 
aaagtcggtc 
ccattagggt 
gcttaactat 
cactttataa 
tcgattacgt 
ggtaaggaca 
acaagctgga 
gtggcactgg 
ttaaaaggcg 
tttgtggctg 
gcttgagctc 
gctcacgccc 
acctcctggt 
atctgtgagc 
ctgcaaatcc 
cgtctggagc 
catctggaca 
attgccactg 
tgtggtgcga 
aaagtgcagt 
aaagtctcgt 
cattttcctc 
atgtaatttt 
gtcgaagagc 
agtgtctagc 
tttaaaatca 
ttttcaattt 
agcctcacga 
gaagtagaga 
ctcaatgccc 
gcagatgaag 
accatgagtt 
tgggcgctta 
ctccgtttca 
ccaaagaaca 
agcaggcatt 
gcttacagcc 
ctcgtaaggg 
tctctattcg 
tctacgaatc 
tcaatcatac 
gcttggggca 
cgtgacaact 
ggggcaaaat 
aatggcaata 
cgtttccgag 
gtgccagagg 
atatatagtg 
tgcaaatcga 
attttaattt 
ataattcaaa 
cggatatagc 
aaatgacaat 



acgttatgtt 
tgcgttatgg 
cgcgagaatg 
gataaacatc 
agccagacgc 
aaaaaaaaat 
tggatattga 
agctgtttaa 
atcaacctta 
gtagattcat 
atgaaacaac 
gcactataac 
accatataat 
gtggcgcttg 
aatggactac 
gcccgctctt 
cctttgcgtc 
ttgagcactt 
cccgaccccg 
gggcgggtga 
tcgtcaatgg 
tcctccaaga 
atgcgacgcg 
ctgtccaagg 
ccattggcag 
acgctgtcct 
cgccagacac 
gattcaaatt 
ttaaataatg 
tctcagtgca 
gctgtaagta 
caaagttacc 
ctaaattgac 
cctaacgcaa 
aacttgtggc 
acaccggacg 
gaggtggatc 
aaggccacgg 
atttcggata 
tcccccggtt 
acttcacccc 
ggaagcccga 
tcacaaaaat 
acagccggaa 
attgtcgctg 
aactttctat 
aattaaccag 
gcaccgtgca 
gggccagcaa 
tggacatcgt 
ggaaatgggc 
tatcgttcga 
agaacgctct 
ttatagaggt 
tgattataaa 
tttgagcatc 
aatatgcctc 
aaccatgaac 
gaggttaaac 
gcatgtgatg 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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aggttgccca 
agtgaaagtg 
agaaggctgg 
ttctcattgt 
attttgactg 
tttctttgga 
taatgttttc 
ccacttgcaa 
aggtggaccc 
ataaaataat 
cattaaggct 
gttggtaagc 
agcgtgtttg 
gcgtgtccta 
gctaagtggc 
aaggctgaga 
tgaaacgagc 
gcaaatggac 
gtgttttcgg 
attgttgctg 
aagtttctac 



aaacgccggc 
gtgtcatcgg 
tcaggttgtc 
gaaacagttg 
ccatagggtg 
aatttcagac 
ataacgcccc 
tctgttgtca 
tatgtgctat 
acaagacaaa 
ttttactcca 
tgtgcgcttt 
tgtcctgtgt 
ggcgaaagac 
atttagtcaa 
aggacaggcc 
cagaagctca 
accgggtgac 
ctgctgctgt 
ctgaccgctg 
aatatatgca 



gataaatatg 
ggtcaccgtg 
cgcggagatg 
tagatcggtt 
tgtgtctttg 
ttctgcgtat 
ttacgggggc 
tgagtgtggg 
ttaaataaaa 
taacgcattt 
gatttctaaa 
gtttagcttg 
gtttacctgg 
caggatcagg 
aatatttgtg 
tgccgatccg 
aggtcaggtg 
ttaataacat 
tctgctctgt 
ttgaccgttt 
caaccaaaag 



agggcgtagc 
ggcaaaagct 
ttcagcactt 
gccaggacac 
tgtgtgtgta 
gtattctcaa 
tcatcaattt 
attaaattat 
ctatgttttt 
taacgaattt 
ttaccataaa 
ctaaggtcca 
catcctggga 
gcgagtggaa 
tacactggat 
tctcccaccc 
gtggataaga 
taacatagtt 
tctgttcaca 
ttggacgttt 
aatgctgtcg 



ccacgcttag 
cctccggctc 
gtgtcagatt 
cggaactgtt 
tacgcactat 
tcgaagcttt 
cgacttaggc 
gtgtacggat 
caaaatttgt 
agtaataaaa 
ttatacatat 
ccttagagag 
aactggcagg 
gctgaaaatt 
gaatttaagg 
aacaccaggt 
cggataagga 
tcattccatt 
ttctggctat 
tcattatcat 
aa 



ggtggccagc 
caggccctgg 
taagagaaat 
agtgggcagc 
tattttcgaa 
cgcttgcaaa 
cagagccact 
gtgtgtgata 
taaaatttaa 
caaattcaaa 
acacaatttg 
cccgtcttaa 
cggcagtcag 
aatttgaatt 
atgttgctac 
ggcaaacgaa 
gacgacgatg 
cagatgatga 
acttttcgtt 
tatcataaaa 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4902 



<210> 194 
<211> 1788 
<212> DNA 
<213> Drosophila 



<400> 194 

atgctgccca 

aaatttctct 

ctccagggcc 

ctgctggcca 

ctcatcacat 

ctgtttaacc 

tataacctct 

gttctctcgg 

cgatatattg 

gtaaagttca 

cagttctccg 

gcccatgtgt 

gtgctctatg 

aggcgatgtt 

gtggcggtgg 

gcggtctatg 

ctcgactata 

aacatcatga 

cttggtggca 

caaacgggtt 

gcgacgacca 

ggtcagtcgg 

cgcagccggc 

ctgtcatcgg 

ccactgcagc 

tcgcggtcac 

gtggcaggag 

agtggcaaag 

ctcaggacca 

ggggcgacgg 

<210> 195 



ctaacagttc 
taaatctgac 
tggagccgga 
ccctaagcgt 
gcattgtcat 
tcgctatatc 
ggcacccgga 
aaacggcggc 
ccatttgtca 
tatttgccat 
tggtgatgca 
ttgctgtgtc 
tcctcatcgg 
acgatgtaaa 
ccgtggcctt 
gatccacctc 
cgtccggtgt 
gccacaagtt 
aaaatcaggg 
cactgcggct 
ccactgggct 
tgcgcctgaa 
aagacctctt 
tgggcgatgc 
gccagcccac 
gccttaaact 
gcagcacaac 
tgcgcaaggc 
aattcaattg 
tgaatggagg 



cggtgtcctg 
acaagtgctg 
ggagcttttg 
gggctacgcc 
ttcgcggaac 
cgacatgatt 
taattatcct 
caatgcgaca 
tccgttcagg 
ctggatagct 
gggcatggga 
gggcttcctg 
ggtgaagttg 
ccgggggata 
cttcatctgc 
gggcattgag 
tctctacttc 
tcgtgaggct 
aagggggctt 
gcacacaacg 
aaatggctct 
ccgcgtgtcc 
cgataatccg 
ccattccctc 
gatgtgctcc 
tacgcgcatc 
aggggccgcg 
gaaagtgaaa 
gcgtgccaga 
cgacactgaa 



gcaaccgatc 
aacatctccg 
cccacggtga 
ctcatattta 
aactttatgc 
ttgttatgct 
ttcagtgaca 
gttctaacca 
cagcacacga 
gcccttttgc 
acatcgtgca 
ttctttggcg 
aaacggagcc 
agcgcccaaa 
tgggccccct 
tcccagtggt 
ctctcaactt 
tttaaggtaa 
cctcatactt 
gacagcgttc 
gccaatggca 
ttggacagtg 
cgtcgcatgc 
ttggaggagg 
attgacgagc 
acccgcccac 
gggtcggggg 
gtgctcaaga 
cgcaaaggca 
gagcgggccg 



tacaactgtt 
cggacaacct 
ccccgatgac 
tcgccggcgt 
acacggccac 
caggaatgcc 
gcatctgcat 
ttaccgcgtt 
tgtccaagtt 
tggccctgcc 
cgatgaaaaa 
gacccatgac 
gactcctgca 
cgcgagtcat 
ttcacgccca 
tcaacgacgt 
gcatcaaccc 
ctttggctag 
acagcgctct 
gcaccacaat 
gtggcaatgg 
tccagatgca 
tccagacgca 
atttgcagtt 
tcacagatga 
caggaggtgt 
gcgtgagcgg 
gctcaagccc 
gccacaaacc 
ccttttaa 



tcacaatgag 
gaccagcctt 
accactttca 
tttgggcaac 
caacttttat 
gcaggacctc 
attggagagc 
cacagtcgaa 
gtcacgggcc 
ccaagccatt 
cgactttttt 
ggccatctgc 
ggcgcttccc 
ccggatgctg 
gcggctgatg 
gttcagcatc 
gctgctctac 
acactttgga 
tcgacgtaat 
gacttccatg 
aacgacaacg 
gggtcagaat 
aatatcgcag 
tcccggggag 
cttggccatc 
gacggggggc 
tgacgagtcg 
gttcaagggt 
gcacgaaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1788 
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<211> 595 
<212> PRT 
<213> Drosophila 



<400> 195 



wet 


Leu 


Pro 


Tl-t l- 

i nr 


Asn 








vai 


Le u 


m a 

nla 


Thr 

l fir 


Asp 


Leu 


ri n 
bin 


Leu 


i 

1 








c 
3 










i n 
1 u 










1 0 




Pne 


His 


Asn 


blU 


Lys 


rrie 


Leu 


Leu 


Asn 


Leu 


i nr 


ri- 

bin 


va i 


Leu 


Asn 


i ie 








ZV 










ZD 










JU 






Ser 


Ala 


Asp 


Asn 


Leu 


Thr 


oer 


Leu 


Leu 


bin 


bly 


Leu 


p* 1 1 > 

GlU 


Pro 


blU 


CjIU 






35 










4 0 










A C 

H D 








Leu 


Leu 


Pro 


Thr 


Val 


Thr 


Pro 


Met 


Thr 


Pro 


Leu 


Ser 


Leu 


Leu 


Ala 


Thr 




50 










55 










60 










Leu 


Ser 


Val 


r- 1 . . 

Gly 


Tyr 


TV 1 — 

Ala 


Leu 


lie 


pne 


Tl- 


Ala 


bly 


Val 


Leu 


bly 


Asn 


65 










70 










75 










80 


Leu 


lie 


Thr 


Cys 


lie 


Val 


lie 


Ser 


Arg 


Asn 


Asn 


Pne 


Met 


His 


Thr 


Ala 










85 










90 










95 




Thr 


Asn 


Phe 


Tyr 


Leu 


Phe 


Asn 


Leu 


Ala 


lie 


Ser 


Asp 


Met 


lie 


Leu 


Leu 








100 










lUo 










110 






Cys 


Ser 


Gly 


Met 


Pro 


bin 


Asp 


Leu 


Tyr 


Asn 


Leu 


Trp 


His 


Pro 


Asp 


Asn 






115 










ion 
lzU 










IOC 

lzb 








Tyr 


Pro 


Phe 


Ser 


Asp 


Ser 


lie 


Cys 


lie 


Leu 


Glu 


Ser 


Val 


Leu 


Ser 


GlU 




130 










1 










14U 










Thr 


Ala 


Ala 


Asn 


Ala 


Thr 


vai 


Leu 


i nr 


Ti- 
ne 


I nr 


Ala 


D W i-w 

pne 


i nr 


vai 


Pl 
GlU 


145 










150 










155 










1 60 


Arg 


Tyr 


t i — 
lie 


Ala 


lie 


Cys 


His 


Pro 


r» u _ 

Pne 


Arg 


Gin 


His 


Thr 


Met 


Ser 


Lys 










165 










170 










175 




Leu 


Ser 


Arg 


Ala 


Val 


Lys 


Phe 


lie 


Phe 


7\ 1 — 

Ala 


T 1 — 

lie 


Trp 


lie 


Ala 


Ala 


Leu 








180 










loo 










190 






Leu 


Leu 


Ala 


Leu 


Pro 


Gin 


TV "1 _ 

Ala 


T 1 — 

lie 


Gin 


Pne 


Ser 


Val 


Val 


Met 


Gin 


Gly 






195 










200 










205 








Met 


Gly 


Thr 


Ser 


Cys 


Thr 


Met 


Lys 


Asn 


Asp 


Phe 


Phe 


Ala 


His 


Val 


Phe 




210 










215 










ZZU 










Ala 


Val 


Ser 


Gly 


Pne 


Leu 


Pne 


Phe 


Gly 


bly 


Pro 


Met 


Thr 


Ala 


Ti- 
ne 


Cys 


225 










230 










235 










240 


Val 


Leu 


Tyr 


Val 


Leu 


lie 


Gly 


Val 


Lys 


Leu 


Lys 


Arg 


Ser 


Arg 


Leu 


Leu 










245 










250 










255 




Cain 


Ala 


Leu 


Pro 


Arg 


Arg 


Cys 


Tyr 


Asp 


Val 


Asn 


Arg 


uiy 


lie 


Ser 


rtld 








z ou 










o a c; 










o n n 

Z 1 U 






bin 


. 1 nr 


Arg 


vai 


l ie 


Arg 


Met 


Leu 


vai 


Ala 


Val 


Aid 


vai 


A 1 i 

Hia 


rne 


pne 






O *7 c 

Z ID 










ion 
ZoV 










ZOD 








t l „ 
lie 


Cys 


Trp 


Ala 


Pro 


fne 


n: - 
HIS 


nl a 
Al a 




Arg 


Leu 


Men 


Ala 


w 3 1 

va i 


Tyr 


pi „ 

bly 




0 OA 




















-inn 
JUU 












l nr 


Ser 


biy 


lie 


U-LU 


Ser 


bin 


Trp 


Phe 


Asn 


Asp 


Val 


rne 


Ser 


J. ±G 


o f\ c 










Tin 
J1U 










315 










*3 o n 


Leu 


Asp 


Tyr 


1 nr 


C n v- 

ber 




vai 


Leu 


Tyr 


Phe 


Leu 


O -w 

ber 


l nr 


Cys 


T 1 a 

lie 


Asn 










IOC 










330 














r X U 


Leu 


Leu 


Twr 

lyr 


Hen 
nail 


lie 


Mot* 


JC I 


Hi <3 


Lys 


Phe 


nl y 


OX LI 


Al a 












340 










345 










350 






val 


Thr 


Leu 


Ala 


Arg 


His 


Phe 


Gly 


Leu 


Gly Gly 


Lys 


Asn 


Gin 


Gly 


Arg 






355 










360 










365 








Gly 


Leu 


Pro 


His 


Thr 


Tyr 


Ser 


Ala 


Leu 


Arg 


Arg 


Asn 


Gin 


Thr 


Gly 


Ser 




370 










375 










380 










Leu 


Arg 


Leu 


His 


Thr 


Thr 


Asp 


Ser 


Val 


Arg 


Thr 


Thr 


Met 


Thr 


Ser 


Met 


385 










390 










395 










4 00 


Ala 


Thr 


Thr 


Thr 


Thr 


Gly 


Leu 


Asn 


Gly 


Ser 


Ala 


Asn 


Gly 


Ser 


Gly 


Asn 










405 










410 










415 




Gly 


Thr 


Thr 


Thr 


Gly 


Gin 


Ser 


Val 


Arg 


Leu 


Asn 


Arg 


Val 


Ser 


Leu 


Asp 








420 










425 










430 






Ser 


Val 


Gin 


Met 


Gin 


Gly 


Gin 


Asn 


Arg 


Ser 


Arg 


Gin 


Asp 


Leu 


Phe 


Asp 
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'435 










4 40 










445 








Asn 


Pro 


Arg 


Arg 


Met 


Leu 


Gin 


Thr 


Gin 


He 


Ser 


Gin 


Leu 


Ser 


Ser 


Val 




450 










455 










460 










Gly 


Asp 


Ala 


His 


Ser 


Leu 


Leu 


Glu 


Glu 


Asp 


Leu 


Gin 


Phe 


Pro 


Gly 


Glu 


465 










470 










475 










480 


Pro 


Leu 


Gin 


Arg 


Gin 


Pro 


Thr 


Met 


Cys 


Ser 


He 


Asp 


Glu 


Leu 


Thr 


Asp 










485 










490 










495 




Asp 


Leu 


Ala 


He 


Ser 


Arg 


Ser 


Arg 


Leu 


Lys 


Leu 


Thr 


Arg 


He 


Thr 


Arg 








500 










505 










510 






Pro 


Pro 


Gly 


Gly 


Val 


Thr 


Gly 


Gly 


Val 


Ala 


Gly 


Gly 


Ser 


Thr 


Thr 


Gly 






515 










520 










525 








Ala 


Ala 


Gly 


Ser 


Gly 


Gly 


Val 


Ser 


Gly Asp 


Glu 


Ser 


Ser 


Gly 


Lys 


Val 




530 










535 










540 










Arg 


Lys 


Ala 


Lys 


Val 


Lys 


Val 


Leu 


Lys 


Ser 


Ser 


Ser 


Pro 


Phe 


Lys 


Gly 


545 










550 










555 










560 


Leu 


Arg 


Thr 


Lys 


Phe 


Asn 


Trp 


Arg 


Ala 


Arg 


Arg 


Lys 


Gly 


Ser 


His 


Lys 










565 










570 










575 




Pro 


His 


Glu 


Lys 


Gly 


Ala 


Thr 


Val 


Asn 


Gly 


Gly 


Asp 


Thr 


Glu 


Glu 


Arg 



580 



Ala Ala Phe 
595 



<210> 196 
<211> 9889 
<212> DNA 
<213> Drosophila 



585 



590 



<400> 196 

taattgctgc 

attataetgt 

gaccaaaaga 

caagaactgg 

tttgcgcgct 

caaaaggatt 

tgctgacttg 

agtaaacaag 

aaggttgagt 

ttaatttgag 

gaaaaaccgg 

gtgtgtacaa 

tttaatattt 

caaatttaat 

acctaagcca 

tcattcctta 

aacccttcga 

ctttgatcgt 

ggtcagcaat 

aatcgtcaac 

caatttgagc 

ggcgcacacc 

gatggatcac 

aatggcactg 

catctcaaga 

gtccatcgcc 

caccgtccag 

tagcctctac 

caagtatccg 

ttacatcctg 

acctttaaag 



cgggtaattg 
tgcactataa 
tgtgtatttc 
gagagcaaat 
tgtgcaaatt 
ttataacaat 
cacctaggca 
acagctgagt 
ttttgtttaa 
catttgagca 
ttttcagaga 
cgttttatct 
aattacagtg 
tagacatgta 
actaaatgat 
agtaaccaat 
ataacttatt 
ctgcccagtg 
tggtccacac 
tatacgctca 
gtgagcacca 
aatagtcgcc 
gcacctcaac 
ttctctctgc 
agaaactcac 
gatgtcctgg 
tggctggcca 
ctgagcacct 
atgaagtcgc 
tcgctggtgc 
ttcttataaa 



caactgagct 
aatcattcga 
gaatacttct 
aaatatgcga 
cgaatgtttg 
tccagtgtcc 
aagcaatttc 
gaaagacttt 
gatgattgca 
tttttctgct 
aagttgcgag 
aagaaaacca 
ttcacaaaaa 
tctattgttt 
gcaatgctgg 
ttaagatgtt 
taagccaaga 
ttccgagtgc 
tggccaactt 
acatgatcga 
cgcccctgcc 
atgaagcacc 
tatcccgatc 
tgggcaacct 
ggcacacgtg 
tcacctggtt 
atgagctcac 
atgtcctggt 
tcaacatggc 
tcagcttgcc 
atcatttgcc 



caggtcagcg 
cccaagtccg 
caaacgttat 
tttcctcgac 
caacttaatt 
agcccatcgt 
ccttgaaaac 
ggcttcatgg 
catggcacac 
gaaccgggaa 
taatacatat 
gaaaaccaga 
aaaaaaacga 
ttttttttcc 
tgacatataa 
gtgctcccaa 
tatagacaaa 
ctcgatggat 
cacgcgactt 
cgtgggtgtg 
cgcctacgca 
accgatggcg 
cgggttgctg 
gctgaccatc 
gagtgctatc 
ctgcattatc 
ctgcaagctg 
cctcatcgga 
caagaggtgt 
acaggtaagt 
tagaacaaac 



agtttgctaa 
attaacttaa 
ttccttaagt 
ttatagttac 
ggactgcata 
actgatgtgg 
aatcaaattc 
gtgaacatat 
aggttgattt 
aacccacatt 
ttaatacgat 
agctttttag 
aattcccctt 
acttgcagct 
aacgaaagtg 
gatccctaaa 
atggaggacg 
ttggagacgg 
gtggctggtg 
ggcatggcca 
atctccaata 
gaacaggttc 
aaagtgtatg 
tggaatatct 
tactcactga 
ggggaggccg 
gtgaagctct 
gtggaccgct 
cataggctac 
tttgatacac 
agataagata 



ccatggcgtc 
gtgggtctta 
caaatcattt 
ttgtgtgaaa 
atagattggg 
ctaattgttt 
aaaacaatca 
ttgtcggttg 
taatctaaat 
tgcttaccta 
ttcaagtcct 
cttagacagc 
ttctttgctt 
tatttatcca 
gaaacggagt 
ccaccgtcca 
agtggggctc 
aaaacgaggt 
ccgctcctga 
cggatatatc 
gctcctcact 
cggagcacgt 
tcctggcggt 
acaaaacccg 
tgttccatct 
cgtggtgcta 
tccagatgtt 
ggatagcggt 
ttggcggtac 
tgaaaataag 
gttacacaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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taagcaagat 
gtgggatatg 
ttgattagga 
aatttctgtt 
ttaaaaagat 
cagtgaagtg 
ccattacgct 
gcctgtgccc 
aataatatgt 
taggcgccag 
taaatgtggc 
gggcataggg 
ggtttttttc 
aaaaaaaaaa 
cgatttcggt 
gtcagctggc 
caactctaga 
tgtctccaca 
ggtgaggcac 
ctcgtgtttt 
aggtaattag 
aaaagtgact 
agaatttggt 
aagatttgtc 
cctaagtgga 
actgctcgcc 
cgacgccatt 
gtacaaaaat 
ttctgggtgc 
gcaaaaaatc 
aaaatttggc 
tggaaaggac 
gtccagcttg 
gtcatttatg 
ttttatggtc 
ctactcgaat 
cgccggaaac 
ttcatggtgc 
gagccacatc 
cgaaatgaaa 
tttggctcca 
aaaaaagttt 
agtgtgagcc 
agttatccac 
aaattcagca 
gaactagtca 
gataatcgat 
tcagaatatt 
gaataaagta 
ttacctgaat 
taaaaaatga 
acaaaattaa 
acaatccttg 
atcgtgcctt 
cttaaacgat 
ttggttagaa 
tactgtcgca 
agtgctgaag 
cgagcatttt 
aatttaagaa 



tgtaataata 
ttagattcaa 
aatcagtaaa 
cgacgtaaat 
ttagccaaag 
cgaagaagac 
aagttaaagc 
cctactcctg 
ttactttata 
aatgtgtgca 
tttaggccgc 
ttgggcgtat 
ttctggtttt 
ggtccgaaca 
tctttgtgcg 
ggataagcct 
acgactcggc 
tcccgatttc 
ttcacttctc 
tccttaaagt 
cacgaaatct 
gcaaactttc 
tagccataag 
acctgagaga 
tggtaagtgg 
cgcaaatgta 
tccttttatc 
cgaggcgaga 
gaaggcatgc 
tactgagctt 
cgttttgttg 
ataatgtatt 
ggtgtaaata 
gatggccact 
cacttttcga 
atgcgacata 
gtaggaatca 
aggcaaccaa 
gaatccaata 
attcacttcg 
cttctatcga 
agtaccacac 
caaaaatgtg 
aggtcctccg 
gccttaggtg 
atattacaag 
aggggtcctt 
tacagcagaa 
aataaccgga 
aacaattgga 
aatccattta 
ttattataat 
ttaaatctaa 
taagccaaac 
tgccaagcac 
cggaagttct 
agtttttgtc 
gcttgcttgt 
taatgaatgg 
agaatattga 



taaaagtcat 
tctttagata 
agctttaaga 
gcaaaaagcc 
ggcaaattta 
acgcttatga 
tgcgacttag 
gcactccatt 
acacgtatac 
gacattaaaa 
tgccacgccc 
ggggacaccc 
tcctctcggc 
agtgtgtctt 
cagataggca 
gaatgggact 
ttcagatttc 
atactcgatt 
gaattcagtt 
ggaaattcgt 
cacttcaaag 
cactgctctt 
tggggacaag 
tgccaaaaat 
atttcccgca 
tgctaaaatg 
tgaataatgt 
ttatatttgc 
gcgagtcagt 
aataacattt 
ctgggaattt 
ggttttggat 
tttgatgacg 
gacacccaat 
catctcgctg 
aaatttatgc 
tctcaaacaa 
acagccaacc 
tgagacgcgc 
actgccaata 
ccccaactcc 
gcacacaacc 
gcacaatgaa 
cagagtatta 
gcataattaa 
agtcccttgg 
aaatgcttag 
tttgaatcaa 
tttaattatt 
aataagtgca 
atattttaat 
tattaaaaca 
cttaaacgta 
aaatatgtat 
ctgaccgatg 
tgaagttgtt 
taaagttttt 
gtgtggcatg 
caacatggaa 
aagccacaat 



ttttggccta 
atttaaatga 
tctcctctta 
ttgagggtac 
aagcgtctgc 
aaataatcca 
ccgtggtaat 
tgatccttac 
gccccgtggc 
atgattgcct 
acacacacgc 
aatcgaattc 
ttttcgcttg 
ctgcctgacg 
cactgggtga 
gcattaatgg 
cggcaccctg 
cgcctggcct 
aatctttagc 
ggacacttca 
ttgccaaggg 
tatgtgaccc 
tgggtagaaa 
agatgaaaaa 
cgcaagttat 
tagctcattt 
attgcgtgga 
tcactttggg 
ttgttaaatc 
tgatgttttt 
gggccgactt 
ttttaaaata 
aacgtcacaa 
taaaagaaac 
agatcccggg 
cgcttacaga 
ttttccggat 
aaccaaccaa 
cgctgaactc 
aacaacaaat 
tccgcgtcct 
acacagaaga 
aacaggccgc 
ttgcgaattg 
aatttcgtag 
cgatttcacc 
ctttaagtga 
taattgcaat 
aggtgacaaa 
aaaacagcaa 
ttgggctcat 
aatgagcagg 
gctgagtgtc 
attttaacaa 
tgtgaaaact 
ccctcacaat 
cgttgcaact 
gttttagcgt 
ataatgaaca 
tatccttgct 



taaatggtct 
aaaaaagtta 
atttatgtac 
ccgtaaaaac 
atctgaaaaa 
gtcacttttg 
attcacctgc 
gctcgcgctg 
acagtgcaaa 
ttcgcacaat 
gtgctgatgg 
cggtcgtccg 
gggacgggcg 
acagacatta 
tatattcctt 
gcgtcactcg 
attgtataat 
ttccttgatt 
tggagtgcta 
gtttagggtg 
ttagtaatat 
gggtcataaa 
ggtcaaacct 
gaaccaaagg 
gcaactagca 
aattagggag 
acaataatta 
gcaaaatgtg 
atctgacttg 
ttccaatcga 
ttggtccagc 
gaaccagact 
attacgatga 
tttgcaatca 
aaactttctt 
cgtgacggtc 
aactttttcc 
gtggcacatg 
gtcggtttca 
gttttcaaca 
tcgaacccaa 
atcctgccac 
cataatacag 
aatttcaccc 
aattgtttac 
ttcagtgtgc 
attaaaattt 
ttatatgtgc 
atcgtaacat 
aaagtataat 
etggattcat 
taacaagtca 
caatttaaat 
tctgatagtt 
acgaacttga 
gaatcaatct 
tgtctaactt 
ggagcgccct 
aacacttgaa 
ttcttttact 



aaaaatattt 
aaggtgccct 
cctttcggaa 
attattctat 
atggcccttc 
gtggggcgtt 
cactgcccct 
atgaagttga 
tagtcttgag 
aagtgtacac 
gcgtaggaat 
accgacaaag 
accaaataaa 
ttcatattcc 
tgccctggca 
ctttgttcta 
atatgccgcg 
tatggccccg 
cgagctcctg 
tcactaaggt 
agttagattg 
tatcgttcgc 
tctgtataaa 
gaaatttcct 
cactttcccc 
cccagcaacc 
cagacgtacc 
tgtgtggttt 
tttaaatgcc 
tgtcaagcgg 
cgacccataa 
ttgatgaact 
tatgacagaa 
ttaagccaat 
catttcgctt 
gaaacgttcc 
atgtcacacg 
tattggtaag 
aataaaggag 
gcagcattta 
ggaaaaaaga 
aataactcga 
tcaaacacaa 
acatatcatc 
aaattggagg 
gcgggctttt 
gaatttagtt 
aagcattggc 
ttaaatatat 
aaagtttaga 
attttaggtg 
gtaagctcta 
ggtcaattta 
ttacaaacgt 
gcaatacaat 
tgttttattt 
tgccgcaaat 
ttaaactttt 
aagttgaaat 
agtcatgacc 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
54 60 
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tataaaaata 
aggcaatggc 
gcttctcgtt 
gcgcggaaag 
aaacccataa 
catcgttggt 
gctcgaaaag 
ctcggtaaga 
tgttccatca 
acccataacc 
agtcagcagt 
tagtccttgg 
tcttatttac 
aacaggcaat 
gttcaatcca 
tcatcttcca 
gattctacac 
tcctgctgcc 
gtgagttttc 
gcaggcagcg 
cccacacaga 
tccgtggtga 
atgttcatgt 
tgaagctgaa 
cgatttgtca 
ttacgacttg 
tcgcagctgg 
caatttttaa 
ggcgtaccca 
gattacggcg 
ttctgtacag 
atttaaattt 
taagtgaatg 
tgcacattca 
gcagtgtgca 
cctgagcccc 
aaaggtgaaa 
agctgggcga 
acccactcat 
ggggcggcaa 
taattctatt 
gttacttttc 
ttgctgcttt 
aatgatagtg 
tattaataca 
aaatttttac 
acattgtttc 
ccaaagcagc 
atgtttgttt 
cttttccgtc 
cgcaggggcg 
acaggtggca 
tccaacggca 
gtgggtcccg 
atccgtaagg 
cagcagcgtt 
cggggcggag 
agcgtgtgag 
gacacagtag 
gaggacgcct 



tatttttcta 
ttggaagtta 
atttctgcgc 
ttttcccatt 
aaagcgcaaa 
gtatttaaga 
ctttttcatg 
cttgaacaat 
atttggaaat 
aagccgcttg 
ttgttaccat 
tgcaatcagg 
acaacacggt 
aaattgatag 
tgtgtcttaa 
tgtagcgcgt 
ggcggattgg 
gctgtgcatc 
agcgagaagg 
aaaagatgtt 
cgaatcgcca 
tcatcatagc 
tcctcaatcc 
tgacaagcag 
gtttgtagcg 
gggtcatggt 
caaatccgtt 
ttaatcgtta 
actttgatga 
aggcaatccg 
taggggaaaa 
taaaaatgct 
ggatatttgt 
taatggcatt 
aaataattaa 
aattcgaccg 
ggtgtgcgac 
cgatctgcag 
ctacggtgcc 
caacaacgcc 
aaccaccaca 
actcaagcaa 
tctgatcaaa 
taaatttaaa 
tattttactt 
aaaataaatc 
atcgcttgtt 
gaactatctc 
tgataacgca 
tggtttttat 
actcgcagcg 
gttcccgcca 
cagccggacc 
gcaatgggac 
gatcggctct 
tgctgctgca 
tgggcgtggg 
agcagctggc 
aagcagcagc 
gctcctgctg 



ctataaaaag 
taggcgaaat 
ggggggtttt 
tcctccattt 
taaaaatact 
tgctacactt 
gtacgagaat 
ggagcagtag 
gatttcagcc 
ccacccacaa 
ggaccgcatg 
acatggatgg 
agagatatat 
gaaggtaaat 
cctttaattg 
ggcccattcg 
caggagcaga 
ctgtttggca 
agtttacatt 
ccagggatca 
gaggctgata 
gtttctcatc 
ggacaaaagg 
atatcgattc 
cacggtttgg 
tgtacgagta 
tggagctata 
cgccacccga 
cttgccagag 
ggcagaagga 
tgaatgagta 
ataattgaca 
ccgccgatta 
tccgctgaaa 
tttggaagcc 
caggaacgcc 
aagggcaagg 
gacgccatct 
ttccacctgt 
tacagcttga 
tttcgtactt 
tacactttta 
gagtttttat 
taaaatagta 
aatctagtgt 
ttaaatcctt 
aatatcattt 
aggatcagat 
aaaagcacac 
cacctcaccg 
cactccatcc 
gatgcgggcc 
gggtgcagct 
gcccggtggc 
gttggcccga 
cgagaagccc 
cgtggccagc 
gatggcggta 
agcaacagca 
ccattgcctg 



ccactggcac 
taataaaaat 
tgtgtagttt 
gggtcagctg 
tttagcgtat 
cctttatcag 
ttccaccttt 
caactgcaga 
aaacaagaaa 
tggaaagtag 
tgggtatttc 
tttgctaata 
ggttttaaaa 
atccttaaga 
tttactatgt 
tggaggagtt 
tgtacgccac 
cctacatgtc 
cccgtactaa 
aagttggcca 
cacaaggcca 
tgctggacgc 
gtgagtttcc 
ctgataccaa 
gcgaatcctt 
aaataaaaac 
cccactgcca 
ccgaccacac 
atggtccgta 
acacaaggtc 
gcaaaatatt 
aaatagaatc 
acctgaattt 
ctctttctcg 
aaatttttag 
cacaatgcgc 
aaattaattt 
tcttcttcgg 
gtcctggcaa 
acaggtgagt 
gtttttgcgc 
agacagccaa 
tcttggtaat 
gaaagttttc 
tgctttttaa 
gcaaagagtt 
actttattta 
atgatttgct 
acaccgtgat 
catcgtccat 
atgctaacgg 
ttccgccagc 
ccctttaagg 
tccgtctcta 
caacccagct 
tcgaccctgg 
ggtctgctgg 
gcggcggcgg 
tcagccgatg 
agcagggagc 



tttacttcgg 
caaccattag 
ttactacgcc 
ttgacacagc 
tagttatgcg 
ttatggccac 
ctacgtggca 
atgtatacat 
atcaagcaaa 
atcaaagcca 
catgtaaaca 
aaaatcttat 
atttccattt 
aacacttgta 
tgcatgttcg 
ttaccagtgc 
cttcacgctg 
caccttccgc 
cctgataacc 
actactcaac 
agatgaagtc 
cctactacgt 
gcccgttccc 
aaggctcatc 
tgtcctgttc 
gagaatacga 
taaaaatgat 
ggtcgcaatt 
aaagctggcc 
gaattgctac 
tccaaagttt 
atttttgctg 
catatcctaa 
ctgtggaaat 
cgcagtgtat 
ccacgcgaat 
gaatttcacg 
catgtcaaac 
agggggcaag 
ggtgagccaa 
ttttatcgtg 
gcaagctttt 
atttcacatt 
cctaaagatc 
tgacagtttt 
ttcttttatg 
tttattattc 
atgattttgt 
tggcaataaa 
tcccccgtcc 
cggtgacgca 
agagctacta 
agcaggttgg 
gcggcgccac 
gtctgaggga 
tgctcagcta 
acaacaacga 
cggtggcggt 
cggatgcgaa 
tggtgcggcg 



tttcggcgga 
acgattatcc 
actcggcgga 
ggccgccaga 
tagacacatc 
acattttcct 
ctctgctaaa 
tttccaaaaa 
agcacttttc 
caacgttttg 
tgcctgtact 
tacggcacca 
tcctgcttaa 
taaagttcaa 
ttccagttct 
gtcacccacg 
gtcttcacct 
accatttcca 
tgataatcgt 
ggccaaattg 
gcttcgcata 
catgatgatt 
ctgccgaatt 
aatcagcagt 
cctaccatat 
cacgacctag 
ttcttatgct 
tttccttaac 
caagttgcaa 
aaaccgagca 
atttgtttcg 
tcactatttt 
tgaattggat 
cccagcgatg 
ccataagcga 
gtagagggga 
gattgcttgc 
agcctggtca 
tcgagcggcg 
cagaataagt 
tcaaaaaata 
gaagtctgaa 
taagagtiata 
gatgtataac 
tgctgtgctt 
caaatggaaa 
taaacagctg 
cacacttcct 
ctgataaatt 
actttccatc 
ggtggacggc 
ccgcagctcc 
cctgctgcac 
gccgcagttg 
gcaggagcac 
cgacagccag 
gcgagtgtcg 
ggatgcaccg 
tgtggcgatg 
gcagttggag 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
74 4 0 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
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ctccaagtgc 
ggccggagct 
agatcaccgg 
cacaatgggt 
ctacactggg 
acttcaacaa 
aaattgcaat 
tgccaattcg 
gcaatcagag 
acgacttcaa 
ttttcagttt 
gtaaaataga 
tattttgaga 
atgttcaacg 



ttccagtgca 
ggggattcca 
ggcgtgcaac 
gtgcttagaa 
agaaactcgt 
ggatacaaat 
taaatctggt 
ccagcgtgta 
tcagttacaa 
atgagtttct 
ttcattttgt 
aatcatctaa 
agaatattgt 
acacaaggaa 



gtgaggcgac 
gcgattccct 
atgtgcttag 
gtagcgtagg 
ggtcactttg 
ataagagata 
tgtgaaaaaa 
atccgatata 
tttacaatta 
ctcattagca 
tcccccagcc 
aaacagggta 
aaaaaaaaaa 
ttattttgat 



tgccagtgga 
cggtcaacgt 
atagtgctaa 
attagagcta 
catgttctct 
atatctatgt 
atagatttaa 
agcaacaata 
tatgactgaa 
gggcaacttt 
gagtaactca 
aatctaacta 
gattgtaaag 
taatttaaag 



tttgatccgg 
ggggagtggc 
tcgagagtta 
gtccgtacga 
gaatgttcta 
acatcccata 
cttctgaatg 
attacaatta 
taattggact 
gtaatgttaa 
aaaatcgatt 
ttgtaaatac 
tttaaaataa 
tactaaaaa 



ctggctgagt 
gtcgatgatt 
agtcttcaaa 
agtggttgcc 
ctaatacttg 
aaaccataaa 
tttcaaatgt 
taatcaatgt 
atgaatatat 
ttaacagcgt 
ctataattgt 
atttactcat 
atatgagttc 



9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9889 



<210> 197 
<211> 1644 
<212> DNA 
<213> Drosophila 



<400> 197 

atggaggacg 

ttggagacgg 

gtggctggtg 

ggcatggcca 

caactatccc 

ctgctgggca 

tcacggcaca 

ctggtcacct 

gccaatgagc 

acctatgtcc 

tcgctcaaca 

gtgctcagct 

ttttaccagt 

accttcacgc 

tccaccttcc 

tactcaacgg 

atgaagtcgc 

tactacgtca 

ctgcaggacg 

ggtgccttcc 

aacgcctaca 

acgcaggtgg 

tactaccgca 

gttggcctgc 

gccacgccgc 

agggagcagg 

agctacgaca 

aacgagcgag 



agtggggctc 
aaaacgaggt 
ccgctcctga 
cggatatatc 
gatccgggtt 
acctgctgac 
cgtggagtgc 
ggttctgcat 
tcacctgcaa 
tggtcctcat 
tggccaagag 
tgccacagtt 
gcgtcaccca 
tggtcttcac 
gcaccatttc 
ccaaattgcc 
ttcgcatatc 
tgatgattat 
ccatcttctt 
acctgtgtcc 
gcttgaacag 
acggcacagg 
gctcctccaa 
tgcacgtggg 
agttgatccg 
agcaccagca 
gccagcgggg 
tgtcgagcgt 



ctttgatcgt 
ggtcagcaat 
aatcgtcaac 
caatttgagc 
gctgaaagtg 
catctggaat 
tatctactca 
tatcggggag 
gctggtgaag 
cggagtggac 
gtgtcatagg 
cttcatcttc 
cggattctac 
cttcctgctg 
cagcagcgaa 
cacacagacg 
cgtggtgatc 
gttcatgttc 
cttcggcatg 
tggcaaaggg 
gggcgactcg 
tggcagttcc 
cggcacagcc 
tcccggcaat 
taagggatcg 
gcgtttgctg 
cggagtgggc 
gtga 



ctgcccagtg 
tggtccacac 
tatacgctca 
gttccggagc 
tatgtcctgg 
atctacaaaa 
ctgatgttcc 
gccgcgtggt 
ctcttccaga 
cgctggatag 
ctacttggcg 
catgtagcgc 
acggcggatt 
ccgctgtgca 
aagatgttcc 
aatcgccaga 
atcatagcgt 
ctcaatccgg 
tcaaacagcc 
ggcaagtcga 
cagcgcactc 
cgccagatgc 
ggaccgggtg 
gggacgcccg 
gctctgttgg 
ctgcacgaga 
gtgggcgtgg 



ttccgagtgc 
tggccaactt 
acatgatcga 
acgtgatgga 
cggtaatggc 
cccgcatctc 
atctgtccat 
gctacaccgt 
tgtttagcct 
cggtcaagta 
gtacttacat 
gtggcccatt 
ggcaggagca 
tcctgtttgg 
agggatcaaa 
ggctgataca 
ttctcatctg 
acaaaaggct 
tggtcaaccc 
gcggcggggg 
catccatgct 
gggccttccg 
cagctccctt 
gtggctccgt 
cccgacaacc 
agccctcgac 
ccagcggtct 



ctcgatggat 
cacgcgactt 
cgtgggtgtg 
tcacgcacct 
actgttctct 
aagaagaaac 
cgccgatgtc 
ccagtggctg 
ctacctgagc 
tccgatgaag 
cctgtcgctg 
cgtggaggag 
gatgtacgcc 
cacctacatg 
gttggccaac 
caaggccaag 
ctggacgccc 
gggcgacgat 
actcatctac 
cggcaacaac 
aacggcggtg 
ccagcagagc 
taaggagcag 
ctctagcggc 
cagctgtctg 
cctggtgctc 
gctggacaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1644 



<210> 198 
<211> 547 
<212> PRT 
<213> Drosophila 

<400> 198 

Met Glu Asp Glu Trp Gly Ser Phe Asp Arg Leu Pro Ser Val Pro Ser 
15 10 15 
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Ala 


Ser 


Met 


Asp 


Leu 


Glu 


Thr 


Glu 


Asn 








20 










25 


Thr 


Leu 


Ala 


Asn 


Phe 


Thr 


Arg 


Leu 


Val 






35 










40 




Val 


Asn 


Tyr 


Thr 


Leu 


Asn 


Met 


He 


Asp 




50 










55 






Asp 


lie 


Ser 


Asn 


Leu 


Ser 


Val 


Pro 


Glu 


65 










70 








Gin 


Leu 


Ser 


Arg 


Ser 


Gly 


Leu 


Leu 


Lys 










85 










Ala 


Leu 


Phe 


Ser 


Leu 


Leu 


Gly 


Asn 


Leu 








100 










105 


Lys 


Thr 


Arg 


He 


Ser 


Arg 


Arg 


Asn 


Ser 






115 










120 




Tyr 


Ser 


Leu 


Met 


Phe 


His 


Leu 


Ser 


He 




130 










135 






Phe 


Cys 


He 


He 


Gly 


Glu 


Ala 


Ala 


Trp 


145 










150 








Ala 


Asn 


Glu 


Leu 


Thr 


Cys 


Lys 


Leu 


Val 










165 










Leu 


Tyr 


Leu 


Ser 


Thr 


Tyr 


Val 


Leu 


Val 








180 










185 


He 


Ala 


Val 


Lys 


Tyr 


Pro 


Met 


Lys 


Ser 






195 










200 




His 


Arg 


Leu 


Leu 


Gly 


Gly 


Thr 


Tyr 


He 




210 










215 






Pro 


Gin 


Phe 


Phe 


He 


Phe 


His 


Val 


Ala 


225 










230 








Phe 


Tyr 


Gin 


Cys 


Val 


Thr 


His 


Gly 


Phe 










245 










Gin 


Met 


Tyr 


Ala 


Thr 


Phe 


Thr 


Leu 


Val 








260 










265 


Cys 


He 


Leu 


Phe 


Gly 


Thr 


Tyr 


Met 


Ser 






275 










280 




Ser 


Glu 


Lys 


Met 


Phe 


Gin 


Gly 


Ser 


Lys 




290 










295 






Lys 


Leu 


Pro 


Thr 


Gin 


Thr 


Asn 


Arg 


Gin 


305 










310 








Met 


Lys 


Ser 


Leu 


Arg 


He 


Ser 


Val 


Val 










325 










Cys 


Trp 


Thr 


Pro 


Tyr 


Tyr 


Val 


Met 


Met 








340 










345 


Pro 


Asp 


Lys 


Arg 


Leu 


Gly 


Asp 


Asp 


Leu 






355 










360 




Gly 


Met 


Ser 


Asn 


Ser 


Leu 


Val 


Asn 


Pro 




370 










375 






Leu 


Cys 


Pro 


Gly 


Lys 


Gly 


Gly 


Lys 


Ser 


385 










390 








Asn 


Ala 


Tyr 


Ser 


Leu 


Asn 


Arg 


Gly 


Asp 










405 










Leu 


Thr 


Ala 


Val 


Thr 


Gin 


Val 


Asp 


Gly 








420 










425 


Met 


Arg 


Ala 


Phe 


Arg 


Gin 


Gin 


Ser 


Tyr 






435 










440 




Thr 


Ala 


Gly 


Pro 


Gly 


Ala 


Ala 


Pro 


Phe 




450 










455 






His 


Val 


Gly 


Pro 


Gly 


Asn 


Gly 


Thr 


Pro 


465 










470 








Ala 


Thr 


Pro 


Gin 


Leu 


He 


Arg 


Lys 


Gly 



485 
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Glu 


Val 


Val 


Ser 


Asn 


Trp 


Ser 










30 






Ala 


Gly 


Ala 


Ala 


Pro 


Glu 


lie 








45 








Val 


Gly 


Val 


Gly 


Met 


Ala 


Thr 






60 










His 


Val 


Met 


Asp 


His 


Ala 


Pro 




75 










80 


Val 


Tyr 


Val 


Leu 


Ala 


Val 


Met 


90 










95 




Leu 


Thr 


He 


Trp 


Asn 


He 


Tyr 










110 






Arg 


His 


Thr 


Trp 


Ser 


Ala 


He 








125 








Ala 


Asp 


Val 


Leu 


Val 


Thr 


Trp 






140 










Cys 


Tyr 


Thr 


Val 


Gin 


Trp 


Leu 




155 










160 


Lys 


Leu 


Phe 


Gin 


Met 


Phe 


Ser 


170 










175 




Leu 


He 


Gly 


Val 


Asp 


Arg 


Trp 










190 






Leu 


Asn 


Met 


Ala 


Lys 


Arg 


Cys 








205 








Leu 


Ser 


Leu 


Val 


Leu 


Ser 


Leu 






220 










Arg 


Gly 


Pro 


Phe 


Val 


Glu 


Glu 




235 










240 


Tyr 


Thr 


Ala 


Asp 


Trp 


Gin 


Glu 


250 










255 




Phe 


Thr 


Phe 


Leu 


Leu 


Pro 


Leu 










270 






Thr 


Phe 


Arg 


Thr 


He 


Ser 


Ser 








285 








Leu 


Ala 


Asn 


Tyr 


Ser 


Thr 


Ala 






300 










Arg 


Leu 


He 


His 


Lys 


Ala 


Lys 




315 










320 


He 


He 


He 


Ala 


Phe 


Leu 


He 


330 










335 




He 


Met 


Phe 


Met 


Phe 


Leu 


Asn 










350 






Gin 


Asp 


Ala 


He 


Phe 


Phe 


Phe 








365 








Leu 


He 


Tyr 


Gly Ala 


Phe 


His 






380 










Ser 


Gly Gly 


Gly 


Gly 


Asn 


Asn 




395 










400 


Ser 


Gin 


Arg 


Thr 


Pro 


Ser 


Met 


410 










415 




Thr 


Gly 


Gly 


Ser 


Ser 


Arg 


Gin 










430 






Tyr 


Arg 


Ser 


Ser 


Ser 


Asn 


Gly 








445 








Lys 


Glu 


Gin 


Val 


Gly 


Leu 


Leu 






460 










Gly 


Gly 


Ser 


Val 


Ser 


Ser 


Gly 




475 










480 


Ser 


Ala 


Leu 


Leu 


Ala 


Arg 


Gin 


490 










495 
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Pro Ser Cys Leu Arg Glu Gin Glu His Gin Gin Arg Leu Leu Leu His 

500 505 510 

Glu Lys Pro Ser Thr Leu Val Leu Ser Tyr Asp Ser Gin Arg Gly Gly 

515 520 525 

Val Gly Val Gly Val Ala Ser Gly Leu Leu Asp Asn Asn Glu Arg Val 

530 535 540 

Ser Ser Val 
545 
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1/89 



Celera Sequence No. : 142000012783465 

caaaaactaamcaacrflaaagccggcaagcaatggtatacaaaatraatraaflaaaaatrtgacaagagaaaatccacaaftaaafltgaaataaaattgaatgaaaaattta 
ccaaaaaccaagttgtttttattccaagtgtatgtagttggcaatcaaacgattcaactgcagttaacatgatttattcgctacgtaagtgtgcttggctaflgttgaataaa 
tatcattaatatctgtcagctacattaaaggttcaaacattacatatatatatatctatatatttatcttaacattattaaatgtttaagggcttaaaaaaaaatatcctta 
ggtacacgggtacgtatgtattattctgcgtctctttctatatcctctcaggatttgaatatattttgtattataaaatattacgcataaaatcatcggggggaacctgtta 
attcttggcatttgattcccgcattttctctggatatggcatagatatttttctagggttagctgtgtgaaacctaaatatttttaagttaaatttaaacaacgatttatac 
ggctgacacattttaaatttttctgggttcaagcttagattttgtttacatgtttttttcttaatgcttaacactatgctttatatactttttcaaaatatattttaaaact 
tgttcagtaaaaagttttaaagtgataattcttaacagagtttttttagaaactttttttttagaaagatcaagcatacaattatttgtttttatcaataattcatatgact 
agaacctttcttttctaaatgcatttcgggcatagtatcccaggataaaaaatataacaggaacattttaaataattgaaatcagtatgcattttaacaaaattaagttata 
aaaaaaaatattttgctatcatgaaataagtgtgccggactggtcttcaaaatctttgctcggcgatgcggtcaatgttacgatcgaccctcgcgcggtacgccttaaacgc 
ccaactctcggctgggcgtggtcgtggccatctgtccagtattactcccggccggcggcggggtcactaaactgggcaccaacttgttcagcagcgtgcgaatggcaaaggt 
gacatcgtgattggcgaagcagaacaggaaggagaccacaaagccctgcaggctgaccaacaccaccgatagcatctgatagaagtggtccagctgggtgccggcatccgga 
cggtagggcagcaggaagtgttgcagtccgaacagcggtaccagaataatggtggcccgaaccgccttccggatggccaacggtgccggctgggcggactggggatgtaact 
ttcgcacgatcactcgcagtacgttgatcagaaaaactgttgggtaaaatgcatatagctgagcaatattgtacttttaaagaaaattatgcatattactaaagctggccag 
cagcgaaagagttatgggcaccgagaagatccacaagtagagactgtcagttatccagcagctgcaataatgtgaaaagaaatgttaccatggatatagcgtagaatgtaat 
ctcttaataaatgcaatgcaaaaaccaaatcaatcaggattatgtcttgtgtaaaattgtgaaaataacttactgcttattgtccggactactgaaatgcctggccagaccg 
tagacgatggctatgggtatgggtgaaaaccagctgatgacaataaaccagcgcatcacaatcgtgtccttgacaaagacctggcgaatggaaagaagaaggggcgtcagga 
tctgaacggacattcctgacatctcaacttgacgcagacgtaaagcagttggccaacgttgcgatttgctctcagacaaattgctgggggtgaaccattgccatgtcagcgg 
ctgcaggaaatcacatctgccatgcacagaagtcgcccataaagaatgcagacttgtgcaggcaattgccagattgccagacgagtaaggataactcaccacaacgagcacc 
aggtgcaggtgcaggccctcgcaaaacatccagaaatagttgaccaacataaagtagtgcaccaccagatgcagcccaatgcaccacagctgaaagttagttatgtttatgg 
ttttgtataattgcctagctacttgtagggctatgttattttcatagttacttttctcagtgcatcttaccggattttcggctatggtttcggaacgctccaccaccagacg 
ataccataatatccaggccacacaggtgcaggccagcgaggcgaacagatgtacatggatgcggatgcgggtgcagcgcagggatctgcaaacggaatggaacggaacggaa 
agaaacgaaaagggatatccggccgtcagttaccggagggaaaggaaccgtgtgggtcgtagacaaacacttacttgaagcccaagaaaataataatcgatatgagcagggc 
cagcagggagagggcataacccttcacgtacagctcgttgatgaactggcgaaactgcagtggcaactttcttgattaatgcccatgtaccacccgcttgatgccgccgaag 
cacttacctccaggtcctcatagtccacgcagttggtgtaattggaccaggtctgattgctaaccggatgacggtaccaggagccgttctccaggcagctatgaatgcgaaa 
tgcttacgataattgtgactggctaagtgcggactgatgggtaaagggggtatttctaacaaagcgcctgacaatgcacaagttacccaactttaccaggagccagctgagc 
tcagttggcgtgtaaagtgagatgaacgctatccattaaattggagatacatacatgtagtgcaagttttagagtgctgctacagtacacttagggagattatctctgcaag 
agttgaagaaggttttatttttctttcattttccatcatctttttctgagatatgctgcgagtttatgtaatagacacgaatgtaaactctataggattttccctgtttgta 
gcgcactcatacataaacattgtattttatgacacacagaacacattaaagcaaatgccgaatatggcaagatattttcgaatttatg'ctcaaacgaatattacatattgca 
gagcttatttgaccatgcgaaatcctgaacggaatgaa74atctatcctattcgtggtggcgtaatatacaggcatattgggcatttaactcacgtcttgtgggccagaaatt 
ttctgttgaatccttcgacgaaatcgggacaatattgactcagcacagtgccggccggagtccttggccagcagagatatccgtcaaagttcagggggcaaaagagtcggtt 
ctccttctgatttgtggccatcgtggcactgacgacggtggccatcagctcctccttctccgccagctcctcgtccacgtttcgcatgtcaataacgccctcgttcatcgcg 
gcattctccagatatcgcggcactgggctaaagttctccggcaaatcggaatcgtagtctgtgcacggcgacaagtgccattaaaatgggtttactttttaatgtataaaaa 
tatagcatttttaataatagtatctatgggctgcataactgaaagtgatgtaattcagttttcaagttcaattcaattgcatgcttgctttacatatgtaaaataaagtggt 
atttcaaatgtacataacttgcttaagatagttaatagatcatcgttctttatagcagggtataattcataaattctggttgaaaatatatgtacatttggtaaaggtgttg 
agtgctatttaaatttataaatttgtcctgagatgccaatgatttgcattgaaacgaaaacccatgtttaaatttgcatccccaagaagtaagtcaatgattttgtatatat 
ttaaatacattactttgaaatgcattaaatgaaatcagatgggctgtaatttaatttaattctcttctctttcttatctgtcttatgcctctgctattttttcgcggcaaat 
cacgtatacgcctgttggccgccacrcacctgttgctgggccaagtgaaaatgatgggtcgtccgtatcgtatgtgacattctggtatcggaacacgcactccgcatacaaa 
tgctttaaaaatatgcgcagattatcctgcgtttgggcctgcgatgccacatccacattttgcgactccgaatctgaagaggttgtcgccattgtgggttattcaacgtgac 
taactcttgtcaacggctaatgcactaaaaaaaaatataataatataaatataattcaaagaattacgggataaataggataatcaactgcaataatttatcagaggcaatt 
tgaaaggttttaaacaagtgttgcttacccatagaaacttgtatgagctagtttttaaaatttattctgatttaaatatattttcttatatttaacattagttctcactgta 
tttcgccatagcgaagcgaacttgagtcccttgtggccgtccattcgtgtgtgcatttctaatttggttaggcttcaaagagttgcctgagttgggttgatatttgttgtgg 
ggtcctgtctgcggggaaatggtgccaaaggaaatggaatatccacacggggcacacacacacactcaaaatatcgaagcacctgagtgtcgtgtgagccgagtcacctact 
cgtctctcatatgttgtaaatcagtttattgtctaattaatggcggtgcttaactgtgtcgctatttctgctaccgtggttgggcgaacttaattgcatttcgtcgcacaga 
gagacaaagcaaatgccacgggttaaaaactcaaatgaaatgcactttgatcgtgggggcacacaaacacacaaacacataaacacactcacacacccgcaccgaacacgtg 
gcccaaaaggcgggatgagctgcaaagttcttgcctaacttggttatatttttaatcaaaactcatttcacttgattgagtttcgacagaagcctgcaccaggcactggaaa 
atctagcatgcacgatacatacaaagtaagatgattttcccgctcccaagctaaactttaatttgcggcattaatttattacaccgaactgggacagtttcccatgatggat 
atgacgcctggctaaacattcttgcatgttatatatatttttgcaagtgtctcggccaggcaattagccgcaggtaagcggagc 

(SEQ ID NO: 1) 

Exon: 4236.. 4062 
Exon: 3530.. 3230 
Exon: 2674.. 2564 
Exon: 2518. .2427 
Exon: 2325. .1679 
Exon: 1380.. 1001 

Start ATG : 4236 (Reverse strand: CAT) 
Transcript No. : CT4121 

ATGGCGACAACCTCTTCAGATTCGGAGTCGCAAAATGTGGATGTGGCATCGCAGGCCCAAACGCAGGATAATCTGCGCATATTTTTAAAGCATTTGTATGCGGAGTGCGTGT 
TCCGATACCAGAATGTCACATACGATACGGACGACCCATCATTTTCACTTGGCCCAGCAACAGACTACGATTCCGATTTGCCGGAGAACTTTAGCCCAGTGCCGCGATATCT 
GGAGAATGCCGCGATGAACGAGGGCGTTATTGACATGCGAAACGTGGACGAGGAGCTGGCGGAGAAGGAGGAGCTGATGGCCACCGTCGTCAGTGCCACGATGGCCACAAAT 
CAGAAGGAGAACCGACTCTTTTGCCCCCTGAACTTTGACGGATATCTCTGCTGGCCAAGGACTCCGGCCGGCACTGTGCTGAGTCAATATTGTCCCGATTTCGTCGAAGGAT 
TCAACAGAAAATTTCTGGCCCACAAGACCTGCCTGGAGAACGGCTCCTGGTACCGTCATCCGGTTAGCAATCAGACCTGGTCCAATTACACCAACTGCGTGGACTATGAGGA 
CCTGGAGTTTCGCCAGTTCATCAACGAGCTGTACGTGAAGGGTTATGCCCTCTCCCTGCTGGCCCTGCTCATATCGATTATTATTTTCTTGGGCTTCAAATCCCTGCGCTGC 
ACCCGCATCCGCATCCATGTACATCTGTTCGCCTCGCTGGCCTGCACCTGTGTGGCCTGGATATTATGGTATCGTCTGGTGGTGGAGCGTTCCGAAACCATAGCCGAAAATC 
CGGTAAGATGCACTGAGAAAAGTAACTATGAAAATAACATAGCCCTACAAGTAGCTAGGCAATTATACAAAACCATAAACATAACTAACTTTCAGCTGTGGTGCATTGGGCT 
GCATCTGGTGGTGCACTACTTTATGTTGGTCAACTATTTCTGGATGTTTTGCGAGGGCCTGCACCTGCACCTGGTGCTCGTTGTGGTGAGTTATCCTTACTCGTCTGGCAAT 
CTGGCAATTGCCTGCACAAGTCTGCATTCTTTATGGGCGACTTCTGTGCATGGCAGATGTGATTTCCTGCAGCCGCTGACATGGCAATGGTTCACCCCCAGCAATTTGTCTG 
AGAGCAAATCGCAACGTTGGCCAACTGCTTTACGTCTGCGTCAAGTTGAGATGTCAGGAATGTCCGTTCAGATCCTGACGCCCCTTCTTCTTTCCATTCGCCAGGTCTTTGT 
CAAGGACACGATTGTGATGCGCTGGTTTATTGTCATCAGCTGGTTTTCACCCATACCCATAGCCATCGTCTACGTTTTTCTGATCAACGTACTGCGAGTGATCGTGCGAAAG 
TTACATCCCCAGTCCGCCCAGCCGGCACCGTTGGCCATCCGGAAGGCGGTTCGGGCCACCATTATTCTGGTACCGCTGTTCGGACTGCAACACTTCCTGCTGCCCTACCGTC 
CGGATGCCGGCACCCAGCTGGACCACTTCTATCAGATGCTATCGGTGGTGTTGGTCAGCCTGCAGGGCTTTGTGGTCTCCTTCCTGTTCTGCTTCGCCAATCACGATGTCAC 
CTTTGCCATTCGCACGCTGCTGAACAAGTTGCTGCCCAGTTTAGTGACCCCGCCGCCGGCCGGGAGTAATACTGGACAGATGGCCACGACCACGCCCAGCCGAGAGTTGGGC 
GTTTAA 
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Start ATG : 1 (Reverse strand: CAT) 

^rTSSDSESQNVDVASQAQTQDNLRIFLKHLYAECVFRYQNVTYDTDDPSFSLGPATDYDSDLPENFSPVPRyLENAAMNEGVIDMRNVDEELAEKEELMATVVSATMATN 
QKENRLFCPLNFDGYLCWPRTPAGTVLSQYCPDFVEGFNRKFLAHKTCLENGSMYRHPVSNQTWSNYTNCVDYEDLEFRQFINELYVKGYALSLLALLISIIIFLGFKSLRC 
TRI RIHVHLFASLACTCVAWILWYRLVVERSETIAENPVRCTEKSNYENNIALQVARQLYKTINITNFQLWCIGLHLWHYFMLVNYFWMFCEGLHLHLVLVVVSYPYSSGN 
LAIACTSLHSLWATSVHGRCDFLQPLTWQWFTPSNLSESKSQRWPTALRLRQVEMSGMSVQILTPLLLSIRQVFVKDTIVMRWFIVISWFSPIPIAIVYVFLINVLRVIVRK 
LHPQSAQPAPLAIRKAVRATIILVPLFGLQHFLLPYRPDAGTQLDHFYQMLSVVLVSLQGFVVSFLFCFANHDVTFAIRTLLNKLLPSLVTPPPAGSNTGQMATTTPSRELG 
V* 

(SEQ ID NO: 3) 

Name: Calcitonin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384694 

GCCGTAAACTATGGCGATGCCCCACTGTGCCCGCACACAAATTCCGCCACCTCGCTCCGAAATCGTGTGTATTTGGTTAATAATAATCTAGTTGGGCATCGATTGTTCGG 
CCAAAATGAAGCCACTATCCGTGGCGGAGTGTTAATTAATTTGCACTTAAGCGCAGCTGCTGGAAGCACTTGATAGAAGTGGGCATGCCGGTTCTCTTGGGAAGTTGGCT 
TACTATAGTTTAGCGTTAAGGCGTTAACTTGTGGCCTTTAAAGTTCTTAGAGAAGACTTTCTGCAGCTGCTCAGGCTGTTCATTGCGGCATCTAGCATAAAATTTAGAGT 
TTGAAAGTAAGTTTCTATGAAAAAAAAATACACGTATATATAAGTATAACAAGAAAAGGACCAAATCAAAGTTTATATACCATTGCTGTACTTATATATTTGAATATTAA 
ATATACACAAATAAATATATACCGAATAAGAACACTTTATGATTAAAAGCAACAATTATTGATTAATCTACTTCTACCTAAACTGGTGTTGAAACTGGTATAATGAAAAA 
CAACTAAGGTACATTTACTTCCATTTAATTTTCTATTGGCAGCTATATAATAAACAGTTCCGATCCAACGTGAACCGACCTATCACCATTTTTGATAAAGTTTTACGCAG 
AAACGTAAAAATATCTAAAAGCATGCTATTTTGTATTTGTATTTTrGTATTTTGTAGAGAAAAGTTGTAGATCTATATTTAATTTTGCACACATGAGCATTTTAATCAGT 
TTTGTATACATTATTTCGAACTCTTCACCAACAGAATAAAATGAAATTAAAATGGTCCGATGTCGGGGGCCAAACAAAATAAAGGCGTATTGAGTTGTGCCATGTTTCAG 
GTGGAAAAACCAGCTGCAATATAATTGCTTTTGAATAATGCGAATrTCGGGGGGAAGTTTCTTTTACCAGCACAATATAAACAAAACTGCTCAACTATTTCTCGATATTT 
ACATTTTCAGGGTCCATCAGCAACATAGATGACGCGGAGTATCACTGCAATAAGACGCAGGTGCGCAAGGTCATCTCGGGGGTGGTGGGTGCCGCCTCCTCGGTCACATC 
CATACAAGTTGCCAATTTATTGCGGTTATTCCGAATTCCCCAGGTGAGTTTCCATTGTGTGATCACTAACTGTTCGACTCAGTGCGTGCGAAGTCCAAAGGTCATGGCAT 
AAAAGGGCAGTCCTTTTTGTTTTTAGTTCAGATTTTGCTTTATGCTCGCAACTACATGTATATGTGCTAGTTTGGCCAACAAAAGTTTCTCCTATTTCGTAATTAAAATT 
TCCAGGAGCCACATGGTAACTGTTGCAATGGCTTGCAGGTCTTTCGGTTGGCTTTGCCGATTCCTAAGTGTCTGCACGTGCCACTAAflTTAGTCGAGTCCCGCAGGCCAA 
CTAGAGGAACATAGTGCTGCAGTTGCCAAGAAGTGTACAGTAGGGACTCTCAGCAATCATAACAATTAATGTAATTATTTTACTATATGTTCAATTTATCTTTAATATAC 
AATTCCAAAATGATGATTCTTAGCTGCTAAAGAACTGACTTACATATTAATTTCCTACCAAAACCAATAACAAACTGGACAGATCGCTTCAGTTGAGTACTTCGACTATC 
AGATACCCGTCACTTGCATTACCGCTTAAAAAGCTTGTATCTAGCATTGCTCGTATTTCAACTGCAAACGACATTGCAGTTTTTTGGGTGTTCCTCATGTAGTGACTAGA 
GCGTGAAAAACGCCACAAACTAATATCCATTTCTGATTCAGGAAGGAATATAACATTTTCCGAGTGCCGGCATGAATCACGTGCAGGAAGTTGAAGGAGATTGAGTGCCC 
GGTTCGGCTGCTCCAGCTACGTGAGCTGGAGCCAAAAGTCAACAACGCGACAATAACTAAGTAGCCAACTGCCAGATCCCTTCTCGATATATATATATAGTATGTACTCA 
CTACTACTTGGATGCCATTAAACGCACACGAAGTGCCAGTCGAAGGAGCAGCAGCCGTTCCATGCAGGCCCAGCTATCGTCCCGAAAGGACCTCATAAATCAGCAATAAA 
CATTTGATTGCATTTGGTTTGCACGCGTGGATTCTGCGCTGGCTTCGCATAAACAGCACTTTCGAGCAAATTTTGTGTGTTCTGCAATTCAAATTTCTGGTCGATAGCGA 
CTTAGTGAGTTTCGTTTAGGCTTAGCGGAAATTGGCTGCAATGCAAAATGTTCGCAAGGACAC^AATTCAAATGACAATGCAGTTAAAAATTCCGTAAATGCAATATTTTA 
TGGCTTATCGTATTTGGATAACAATGTTTTAAATGCTTGAACAAAGCATTGCTGTCTTTTTTGTTTAAAGTAAAATTTTTGCCGTCAGAAGTTTGTGCATTGAATTAAAT 
GGTTATTCAAACATTACATTTTATTTATCTTGAAAAAAGTACAAAAATATGTATTAAACTCGTAGTATTATATATAAATTAAATATATTTTATTATTTTTTGTTTTTATG 
GACTGAGTAATAGCACTTGAATTTTCCAACTAGTTTTTCACAATTTTTCGTACTTTAATAAAAGTAAATATTTGCATCTCTAGTGTCAATTATCAAAAGTCAACAAAGCC 
AAAGCTACACAATAATTTTGCTATAACTCGACGCTTTAAACGCTCATGAAAATCAAAAGTAATTCTGATTAAATAAAAGTGAGAATGCCATAAACCATAGGTAAATTAGA 
CTCAGAAATAAGGTCTGGTATGAGTTGCAATGCCTGTCTTGCCATGAGCATTTGACAAGTAATCAAGCACTTACGCAGAAGCTGTTGACTAATGGTAATGACACTTAATT 
AGAAGCCAAGTAGTTCATTAAACACTTGCAGTTGACTTGCCACGCCATCGACGATTTGCCCCAGGATGCACCTACTTCCTGCCCCAAGTGTTTGTGGGCGGATCCTTCTC 
TAACCAAACGGTTTATGTGTGCGTGAAATTGGTTTCCCGGCTTGGGCCAGGTCACCACAGTTGGCCTTTTGCTGCAGTTGCCACAGCTTTGGGTTTCGCAAGGTAATGCC 
AGCTGATGTTTTCTTCGACAGCAGCATTGTTTGTGGTTTTTGGAAAACAGCTTAAGCGAGAAAATATGAGCGAAAAACTGGGAAGGAGCTTACGAAAGGTTCTGCAGAAG 
CGATAGCCCAGAACGAGTGCGAAAAAATATGAAAATTAAATGGCGACCTTAAAGTGAAAGGGGTTCTTACATAAAATATTGTAGCACACTTGCCACTTTGTGCCACCCAA 
GCAATAGCGTTTCCAACTAAAATGTTATCCCATCATATTGTAACCCGCCTCACCACATTTTTTCCATCCAGGTGTCGTACTTCTCCACCAGCCCCGAGCTGAGCAACAAA 
CAGCGATTCGAGTACTTCTCCCGCACTATTCCCTCGGATCACTACCAGGTGAAGGCCATGGTGGAGATAGTGAAGCGGATGGGATGGAGCTACGTGTCGATTATCTACGA 
GGAGAGCAATTACGGAATCAAGGTGAGTGTTTATGTATGGAACTTTGATTAAAATGGAAGAAAGGAGTGCATCTATGGCTGAGAAATGGAGTTAAAGAATGAGAAGCTCT 
ATGTTTTAAAAGAGCTTGTTAAAATTCAAATTTAATGTATGTTGCTTCATTTTACTTTGGATACTTCTTGGATAAAATCTATTTTGGATAACCATTAAACATTGAAAAGT 
CAATAATAAGTAAGTTCAATAAGTTTCAGCCACTTAAATTTCCTCTTTTTCAAAGGACTACATAAATTTCACCTAATAGGGCAAATCATTTAACTAGTAATCCTTAAACG 
GTGGAACTTTAAAATAATGCCAAGTGCCGTTTAAAAGCGTTAGCAGCACTCCATTATCATATGTTTATATCCTTAGCCGCTTGCAGTGTAAACGATATACATTTTAAGTG 
AACGCCTTGTATCCTATCCAACTTGTACCTGTGTAAACATTTTACCCATCCCACCACATGCGTTTTTATTGCTCACCCACACTGCACTTGTTTTCAGGCTTTCGAGGAAC 
TGGAGGAGCTCCTCGCACGCCATAACATTTGTATTGCCATCAAGGAGAAGCTGGTGAAGGATTCGGGAGTGGCCGAGGACATCGCCTACGACAATATAGTGCAGAAGCTG 
CTGACGAAGCCGCGGGCTCGAGGTATGTAACTGGAGAGTGGGAAGTTGGGATTGCAGAGAGGGATCGGAGGCGATGTGGCGGAGGAAAAGTGCATTTCCAACCGGAACAG 
CGACTGCAGTCTGCCAGTCAAAATGCAATTTAAAATGAAAATGTCAAAGCGAAGGCGCAGTGCAAATGCGAAAGGAAAGCCCTTCGCAACGCCCGCACAATGAATGCCAT 
CCGGAGTCGAGAACTATGCTAGGATTTTTGGCTGGGCGACGGAAATTGCGTAAATTAAATGAAAATATTTAAAAACTCCCTCGCCAGCGCGCCGTTGTGCCATCTGTGCC 
AGCGGGAAATTAAAAAGTTCACGACAAAACAAACGGAAAAAGTTATTTGCAAATAAATTGAGAGATAGTCGCCCAGAAGTGGGCAAACTGATACGTCCAATACTGTAGTT 
GTGCAAAAAAGTGTGTATTTGTTATGGCCAAAAATGAAGGTCCTGTCTATTGTCCGTGGACAAAAAGCCAGGAGTACTTGGCGTCCCTGAGGTCTGATGAATGACACCAC 
CTATTCGACATGGCAAGCAATTAAAAATTCCAGTCAGGGTTGGCCAAGTTCAAATTTATCGGAACGATTTTTCATCCTAAATGAAAACGATTTTTCCACATGGCATTGCC 
CCAATCTGCGCCAATCATGCGATTCAGCAGCGAGATAAACAAAAAATAAATGTCAATCAATTTGACTTTTCACATTTTGTGTTCCCCTGTCGAATATCAGCTGCCATTCG 
CAGACGCCGAAAAGAAATATAAATTAAAAATAATTACCGCCCACAGCTTTGCGCATATCATTAGTCAGTGCCTCGAAAGGGTGAAGAATATCAGGGTCGTGCCCGCCAAT 
AAAAAGCACACGGGTCGACCAATAACAATTGACTTTTGTCTGGCAAACAATTTTTGATTTACATAAATGCGCCATCAGGAGTCGTAAAGCAGTCGGTCGCCCAAGAAAAG 
GGTTCCTCCGCCACCATTATTTATTGTGATTTGTGATTTGACGCCGCTTACAGGGCGCTAGCCGCCTCGCAACGATTAATCAATTTGACAAAGTATTTTCCCTGGCATCA 
GTTGTTTAATTTTGCATATCGAGGGTTCTTTCACTGCTCCTGGTCCTGGTCGTGGTTCTGGTGGGGGCTCCAGTTGGGTCCCTGAAAAATGATGCCCTTCCGGATCCGTA 
ACTGCGGCTGGTAACTGCCGCCAGGGGTGAACTTTACGCTTGTTCGGATCCCCACACATGTCACTCCGGAGCTAAATGGAATGTGGGCGGGGGGCCGACATGTTTTTCCT 
TTTTATCGCCAATGTGGCAAGCTGATGCGATTTCAGCCAGGACATGACCGGCCCCCACCCGCTTTAGATCACTTTCTGGCCCGGCTTTAAAGTTTCCCCCTAATCTGCAT 
AAATATCCTGCCGAGTGGGCTCTTAAATTAAATTTTATATCCCTGCTAAGAATAACCGTGGTTCTCGACTGGTATAAACAGGCTACTGTGTACGAGAAAGCCGCTCAAGT 
GCATGTCTAAGGCAAAGTGAAAATCCAAGGAAAATATCCTTAACTTCACAAAAGTATCGACAAGTTTTCCCGTTGTCGCCGCCGCTTCCTTTGAATAAGATTCCCTCCTC 
TTACTTCGAGTGGCGGAGCGGCAGTGGCAAATCGATACCGGGACATGTTGAAGGGGCAAAAAATTATGGCTAAGACAAAGGACCTCTTGGGCGGACCGTTAAGAAGTGAC 
AACTGCAGCTCACGTATCCCATACAAGTTTGTCCAGAGCGAAACAATAAGGGCTGAAAGCGTTGGAAAATTTGGTAAGCACCTAACCAAAGGATAGTTTTAGAAGCTCCT 
GATTTCCATCCGTAGAACACAAGCGACTTAGGTGTCGGCTTTACCAATTGGCCATTGGGAACGTTGTTTGCTTGGAGTTTCTAATGATATCTTCACACTTTTTGTGGACT 
TTCTGTTCGCTTCTTGGGACCAATTAAAATATTGTTTTCCTTGCAAATTGAAGAACAATACCTTGGCAGAAAATCTAAATATAGTAAAGATACGTCGCCAAATGGCGAAC 
ATCTGAAAAAGTGGTTGCATTACATCAATATATACCCTTCGTCATGTGTACCCCTTGTACCCCCCATACCCTTGTTTTATAGACTTGTTTTACGCCTAACGGCTATAGTA 
ATCCAACTTAATTTATATAGCATTTGCGTAATTAATATTCAAAGCGAAATAATGGGCTGTCATTTAATACTATAAAAACACTTTAATCCGCCCTCGGAAAGGAGAACACA 
AGAGCAGTTGGCAGCTGCAGATTGTTTTCTTTTGCGCACAAAAGCCGCTCATTCACTATTGATTGCAATTAAAATTAAATTCTTTCTTCCTCCTTTCGCATCCTTTTAAT 
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ACGCCGCTGCCCAGGGGCCflTTATCTTCGGTTCGGATCAAGAGGTGCGCCAAGTGATGCGGGCGGTGCGCCGGGCGAATGCAACTGGTTCCTTCTCCTGGATCGGATCCG 
ATGGCTGGAGTGCCCGGAATTTGGTGTCCGACGACTACGAGCCGGAGGTAGGTGAACTATTCCGTCAGACCGGATGACGGGCCAGTTACCGTCAGCCGGAAATTATGAGC 
CTAAATGGGCGGAACAACTCGGTTTGATTAATGACTCGGATTTCGGTCACATACTCACGCACGCACATACGBGGGCCCAACCCCATTACATGATAACCAAAAAATTATAA 
ATGATGGCGGCCGAAAGAACACTTTCACACGCCCTTTCCCATGATGAATTATGGTTCAATTTGGATTAATCGTGCCTTTGTCTGTGCGTGGCTAATTAATTTGTCTAAGC 
CCTTCAAGCCCAGAAATGGTCCTCCTCTCTGATCAAGCTCATTAAGTATACGCCGCGTGTGTCTTGGTGTCATCAACAGGTGGAAGGCACGCTGTCTGTGCAGCCGCAGG 
CGAATCCGGTGCGCGGGTTCGAGGAGTACTTCCTCAGTCTGACGGTGGAGAACAATCAACGCAATCCCTGGTTTGTGGGTAAGTGAATTACCGATTAAAATTAAGCTGGT 
GTTCATTAATGCTCTGATATAAGTAGTCCTTTATTAGTATCCCCTTTCGCTAAATGGTTATCTTGATTTCAGAATTCTGGGAGGATCACTTTCAGTGCCGATATCCCGGC 
AGTACGAGCACTCCGTACAATAACTACACCAAACAGTGCACCACAAAGGAGCGACTTTCACGCCAGAACACTGACTTCGAGGACCAGTTGCAGTTTGTCAGCGATGCGGT 
GATGGCATTTGCATACGCCCTAAGGTACGATTAAGCTTATTTTGGAAGCAACCCACTATCATTTTAAACGTATTAAGAAAATAATAGTTCTTTTGATCAACCCTTTTAGT 
TTTTATCAACCTATGGAGTAGCAATAAATAAAATGTAATGTGATTTCCTCTCCTTAGGGATATGCACCGCGACCTGTGCGGCGGAGGTCCTTCGCTCTGCGAGGCCATGA 
AGCCAACGAAGGGTGCAGACTTGCTGAAATATTTGCGAAAAGTGGAGTTTGAGGGTGAGTGAGTAATAGAAGCCGTGTATCCACTTTTGCCGGATAAAAGCAAGTTAAAT 
TAACGGAGTAACTATTCCCGGGAATTTTCAAAACAGGATGTCCGACTGACTCAGATCCCTTTGCTCTCGCTTAGTTCCTTCAGTCGGCAAGTGATGCGCACATAGCTTAA 
GTGGAAATTTCCTGCCAACCCGTCCTTGCAGTTGGGCCAAAAGCCCCGCACACACACTTACACACACAATGGCATATCTCAGCCTGCGAGAGTCCTACAAGGCGCATTGG 
CCATGCTTTCGAGTGGAAAATTGACTTGCTTGTCATCGTCATTGAACTCGATTGAAATTCAATAAATTACACAGACAGACGGACGGGAAGGAGCCAAAGGCCCTGACTGA 
CTAGCGCAGATTGTCAATTTTCTGATAGGGCGAGCGGGCAAGCGGCAAAGTCGGCGAAGTCGTCTCTGGCAGGCGTGAAACCATAAACCCATTTAACTCCTATTCGATCA 
ATCCCTCCGCAGGCCTCAGCGGCGACGAGTTCCGCTTCGACGGGAACGGCGACGGTCCCGCCCGGTACAACATCATCCACTTCAAGCAGTCCCAGGCGGGACAGTACCAC 
TGGGTCAAGGTGGGCGAGTACACCGAGGGCGAGCTGCGGCTTAATATGACGGGTAAGATAAAGCCCCCAGTAGGAAATACTTAAAAGTATAATACCAGGAACCCTTCAAC 
ATTAAGTTTACCTGACTGAATTCGTGCATCCTTAGTCTCCTGTAACCTAATGTATTCGTGCTCACTGCTCCTCATCAATACTAATTTTGATCTCATTATCCCGCTAACCC 
AGAGGTGAAGTTTAAGCGGCTGAGTCCCAAGCCGCCGGAGTCTGTCTGCAGCCTGCCATGTCTTGTGGGTCAGGCCAAGAAGTACGTGGAAGGCGAGAGCTGCTGTTGGC 
ACTGCTTCAATTGCACAACCTATCAAGTAAGTGAAGCCCATCTGAGGCCAGTAATTTATTTGTTGGCCCTTCGAATAGATTGAAATGTAGCGCCAAGTGAATTATTCCGG 
AACTAACGCGTTGACGGGAGATGGGAAAAATGGGGCAAAAACAATTAAGCCATCAGGCCTGAAAGGGGCTGCAAGTGAATGGCAAACATGCCGCCTTTGTGCTCGCCCCA 
TCCATTTATCAAGTGTTCCTTTGTGCGGTGCGTCCATTGCAGATCCGCCACCCGGATGACGAGACCCATTGCAAGCTGTGCAAGCTGGGCACTCTTCCCGACGCCCACAA 
GCAGTACTGCCGCCCCATCCCGGAGATATATTTGCGTCCGGAGTCCGCTTGGGCCATCGGTGCTATGGCGTTCAGTGCGACCGGGATCTTAGTAACGCTCTTCGTAATGG 
GCGTATTTGTCAGGTGAGCAGCGCATACACCGTGATAAAAACGGGTATAGGAAATTACTTTTAATAATATTTAATAGTTATTAATGGCTTTTATAAAACCATTTGAAACA 
ATTCAGTCTTAAGATCGAACATTTAACAATGAAATTTGCTTTCTAAATCCAAAAACCCATCAGGATGTTATATTCCTTTTCTATTAAGACCATTTTTAATAATACATAGT 
CCAATATTATCATAAAACCTTGTTAATGTATATATTTTCTCAACTTCAAATAATTTAGACAATATATAATACTTTAACAATAATTAAATAATTTGAAGATAAGGTAACGG 
GTATATTATAGTCAATGCATTCGTTCTTGTGTTTATAAGTATAGTTATTGTTATACTAGTGACAAGCTAAAGGATCTAGATCAAGCTGATCTGAGTGCCAAATATTTCAT 
GAGGGCCTGTTTTCTAGCTATGTTTTTCTCTGACTGTATTTATTGGCACACAATTGTCCGCTTGCATGGATATGGAAAGCAATTAAAAAGTTTCACCCTTTCCGGGCAAA 
CTTCGCCACAAAGTTAACCGGCAGACACATAAAAATGATGCATTAATTGGCGCAAGAAAACGTCGTCGCAAGTTTGAAACTGAAGTAACGCCTTTCCCTCCTTCACCTTC 
CACCCAACAGGCACAACGACACGCCCATAGTGCGTGCCTCTGGTCGGGAGCTGAGCTACATCCTGCTGGCGGGGATTTTCATGTGCTACGGTGTCACCTTTGCCCTGGTC 
CTGAAGCCGACCAACATAGTGTGTGCCATTCAGCGGTGAGTGTGCAAATTGCCGAGATGATGCGTGTGAATTTTCCTCCGAAACTCCCTCTTTTCCGCTTTTCCGCTTCC 
ACACAGGTTTGGTGTCGGCTTCTGCTTCACGGTCGTTTATGCAGCACTGTTGACCAAAACGAATCGCATTGCACGGATCTTCAAAGCGGGCAAACAATCCGCCAAGAGGC 
CCTCCTTCATTAGTCCCAAGTCCCAGCTGGTAATCTGTGCGTGCCTGGTTAGTGTGCAGGTGAGTGTGTGGAGTGGTCTGGTGGCGCTCCTGGCGGTGACCGACGGGGCG 
GATGTGGAATCGAGATGCCCTGCGATGACTCCGTGTCATTCTTTCCATTCACTCCATTCGCAGATACTCATCAACGGCGTTTGGATGGTAATTGCGCCATCACATGCCAT 
GCACCATTATCCCACACGCGAGGATAATCTGCTCGTCTGCGACTCGTACATCGACGCCTCCTACATGATTGCGTTCTCCTACCCAATCTTCCTGATTGTTATTTGCACGG 
TCTACGCGGTGCTCACGCGAAAGATTCCGGAGGCTTTCAACGAATCGAAGCATATTGGTGAGTAATTGAGCTTTTTATTTCCAGAAGAGCTATAATTCGGCGTAATTGGC 
ATTGTATTGGAAGGGTCTGTACAAAAAGCCATGCATGGGTTTGCATTGTCTATAACCTGGTTCATTATAGCAGCAATGCTATTAACTGGTTTATTTAGTGCATAAAATAG 
CACAACATGTATGAATGAACTCCTTATAATTTAGTTATGTGTTTTGTATTTGGGTAATTTGGTTATTTCGAAACAATACACGCATGAAATATTTATAAGTCTATATTTTA 
TGCATTGCAACCTCAGAGTTCTAGTGCAGAAATGTTTTATAATTATGAAGCCATAATGCCCATAATTGGCCACTTTGATTGATTACTTAACCCTATTGTTACAGGATTCA 
CCATGTACACGACTTGTGTGATTTGGCTGGCTTTTGTACCGCTCTACTTCGGAACCGCCAACCACGTTCCGCTGCGCATCACCAGCATGTCGGTGACCATCAGCTTATCC 
GCCAGCGTGACCATCGCCTGTCTCTTTTCGCCCAAGGTGAGTGAGTCCGGATAGCTGGGAGTGGGTTGGGCTTCGGCTTGGGGGGATTCATATCCACACGGGTCCTCTGT 
AAATTGATTGGTAATTGCGTTATGCATTAATTATTGCGCATTTATCTGGCGACCGTTGAATGATTTTTGGTGCCACAGTATGCAGAAGCCATCGGCGATTTTATTCCATT 
TCATTCCGTTTATTCAGGACGCGTGGACGAGTGTTTCATTAAAATGCAATAAGACGGAAGATGAGGACAGGCGGAAGCCAAGGGTTTGGCAATTCAACAACCGGCTGCAT 
CGATTGCGATGTGTTGCTTCCAACCATATTGTTTTTAATGCCTACCCTTCCTCTTGGTCCATATTCCTTGGTCCTTGCCCATTTCATGATGTTTCCCCGAGATGAAGCTT 
AAATGTCAACAAGCCATGTTCTCCCACGGAGCTGAACTGGTCATGATCTCCTCCCAAAGGCGCGCGTCCTTTATGTTGGCATACTCGGTCCCCATGCCAACATATTTTTT 
AATACGCCCTGCTATTAGCTATTAAGATGCAAGTAAACTGCCAGACAAGCCAAATATAGCCAGCCAGTTGAGTTCATGGACTAGATGATGCAGCATGGCAAAGAAGCTGT 
TTACCCCGATATTAAACGTCAAGGAGGAAGCTTACATTTAAGCACTGATATCTTCATTACACGGAAACACGACCTTTGACAGCAATGTGCAATGTGAAACGCAGGAAAGG 
AATTTTATAAGAAAATGTCGAATAGCTTGTTTGATTATTTTTTTTGTATAAGTATTCCAAGCATGAAAGTTAGATCAGAAAATCCAAGTTATATTGGCAAATAATGAAAG 
TCTCTTAATTGTATGGCTATTAAAGGGGAAACACTTTTCTAGTTAGTTTCCAAAGGCTTCCATATAACACTTAAAGATGCCAGAAAGGACCTTCGCCAGTGTCTATGCCA 
TTGAAGAGCAAGGAGGTAGAAATAGTAAATCAGCCAGAAACTTGATTAGCAACGGAGCGGAGCTCCCACAACCGCACACGTCATAGTTCTTGGTCCCGGAGCGGAGCAAG 
CTACAGTTTATGGTCTGTCTGCATATTCCGGGTGCAAAGTAGAACAAAACCCGGGGAGGCGAAGGGCCGTAAACACTCAGCAAAAGGCTATTATTAAATGTCGGAGTGGA 
GCATACTTCTTGGCCCAGTGGCTGCTAACCGGGCCAACTGTACGCTATCAGATGTCACCTAATGCACGGTTGGTTTGCGACGAGGAAGGAACACTGTACTGCGCACACGG 
CAAATACAGTGTGCACAAGCACACGAACCGCACCGTACACCTTGAGCCATAAAAATGCAATTTACAACTGGTTGTGTCTGGCAAAAGGAAAACAATAAAAGGCTGCCATG 
AATGCGGCATGCCACACAACAACTGGGGATCACCGGACCAAGAGGCAACAAACTAAGGCCCTGGCGCAAAGGTGCCGGCAGCCACGCCTCCTTCCGCAGGACAAAGGCTG 
GCAACCGAGGTGGCAAGCCATTCCGACTTCCGACTGCCACGCGCCCTTTCTGCTGTCCGTCCGGCTCTGCCGCAGTTTGTTTTCGAACTTGCCGCGTTTGTTTGCGGTGA 
AAACAATAACAAGGGGCGTCCAAATGGCAGCCAAACAATGTCAAAGCAAAGCGGAAATGACCCACCGTGCAGGCTACGTAACAGTACGCAGCGTATCGCAAGTTGAAACC 
TGGAGCACTGCTCGCAATTTGCGCTAAATTTTCACTAAGCGTTTTATAGATGGTGCCATTCTAGTAAGGAGTATGTGTGCTCTGAACCAGGAACTCGCTCACGTGGCCAA 
AAACAAATTCGATTTGGCTTTAAATATATATTAGAAAGTCAAGACACCTCAAGAAATTGGTGGGAAAGTGTTTTCATGTATTGAATAGTAATCTTATTAGCGGTATTTTT 
TACACCTCACTAGTAGAGTGAAAGGGTAGACTAGTTTCGTTGAAAAGTATGTAACAGGTAAAAACAAGCATTTAAAGTATAAAGATTATTGATTGGAATCAATAGCAGAT 
ACGATCTGACCATGTCCGTCCATTCGTATTAACGTCGAAACTAATAAGTTGAGATAAAGCATACTGATTCCAGAGACGTAGACGCAGCACAAGTTTCCGTATTCAATATC 
CGTTCCGGCAGATCTTTCTATAGTCGGGGAACTCGACTATAGCGTTCCCTAAATTATATTTAAATTAATCAAACAAAAATGAACGAATGCGTGAACGTGTTATGATTTAA 
TCCGTCTATTACATTGTTCATTGTTATATTATTCTTAAAGTGTAAGTATGGTAATATGGGGCTCAGGCGTGCACTTCTGCTCCCTGTAAGTGGCCCAAGTCGTTGGCCTT 
TGCTTTGACCACATTACTGCTGGCCGTAGAAGCAGGAAATGGCTAATGACACTTGAATTTGGATTAGGAATGGGACTTGGTGGTGGGCGTGGCAGGTCACGCACTGACTT 
CTAATCAAAGCCGTAAGATAAGCCATCTTGGGGAACAACGAAATTGGATATCGCCTCCAAATAGCTATTACACACAATGGGTCGTTCGATTAATGCCTCGAACGGGAAAT 
TAACCTAATTGGAAAGACGCACTGCTTTTCTGACTCGACCCTACGCTTGTGTCCTTGCAGCTGTACATAATACTCATACGCCCCGAGCGCAACGTGCGCCAGAGTATGAT 
GCCGCCTCGCTACGGCAACATGCACCGCACCGCTGGCACAGGACCCTCGTCCATGATGGCCGCCGCCGTGGTGACCGCCGCCACCTGTGCCCAGGAGGAGAAGATCCAGA 
AGCATATCACTCCCACAAACACAGGTCAGTAATTAAGGGATATCGGTTAACGAGTCCCGGGTCAGTAAGCGTAGCGCTCAAAATACACACACTTCGTGTCGGGCAAATGT 
ATTATTGACATATGTAATTTGCTGTTGAATAGTGCAATTAAGAGGTATAACAGTTGCTTGCCTCGTTAATCATTATCGATCAATTAATCAAATTATTGAAAGTGAGTCGA 
TTTTATCTCCTGACAATGTATTAATCGTATTTTGCAATGAATTTATATCAATTAAATTATATTACTTTATTAACCCATTGCACCAGCTGTACCTATCTCTCGTATTCATT 
TTTAATTTTTGTGGAAATGCTTCAATTAATTGAGAAAAGTTTATCATAAATTGGGTAATATAGCAAGTCTTAAAATAACTCAAGCTGTAAAAGCTGAACAGTTTTATCAA 
AGTAATAGTTAGGCTGACAATCAAAAGGTATATCCTTCTTAATTTAATTGATTTTAAAATGACTAGGATAAAGAATAAAACTGAATTTTTTCCATTCACTACCCATAACT 
TGCAGAAAACTCGCTAAAGACGAAGAAATTGTGCGAAATGGCCACTCAGACAATTTCCAGTATAATCACGTCGCTGGACATCAATGCGTACAATCAAATACCATATGCCG 
ATCAATATGTGCCAAACAATGCCACAACGGCCACGACCACGCCCACAGATAAGGGCAGTGGCAACGGCAACAGCAACAGCAATAGCAATACGAATGGTAGTAGCAATAGC 
AACAGCAACAGCAATGGCGATGGCGATGTGGCAACAGAGCAAGTCGTGGCCAACAATAATAAAATCAATCAGCGACAACATGGCCAGCCGGCCGTGGCATTTGCCATCAC 
ATCGACTAACAATCACATAAGCGGCCCAGCAGCAACCATAACAACAACAGCGGCAACTGGCGAAACAACAACTGAGGCGCTGGTGGCTACAATAACAACTCCATTGGCAA 
CGGTGGATGGCAGCGAAACCATGTCCGTGGCTCCCAACAATGGGCATGGAAACAACGGGAGCCAACGTCCGCCGGTCCTGCAAACAAACTTATAGCTAAACGCGCAACAA 
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TTGCAGCAACAGCTGCTGAAGGAGCAGCAGCGRCAACAAGCCGAGCTGCAAACGCAACATCCGCAGCAGCAACATGGAACACAAAGCTCTTCCAGCGACACCGCCAGCflC 

GGGCAGCGGCAGTGCGGCGAAGAAGCGGGCCGCAATTCCCGTTTAGCTGCCCCCGGAGATTCGAGATTCTCGCAACCATACGTAAGTAGAGGAAAGTGTCTGTTGACCCA 

AGCGATTCCCGGCATCAGTCTAGGGATGCGGAAAAGCAGACAAGAGGAGGGCAGGAGGTCGTGCTAGGACGGGACTAAATTGGGAGCAGCTGGGCCACAGAAATATCTTC 

AGATCTTAGAAGGCGAAATAATTCATTTACCACTTTTGTTCGCTTATTAATTTGAGTGAATTCTGGTTATACCTTACCCCCATAGACACAAATTATCTTAACGTTCTTTT 

ATATAAGTTTATCGATAGCTGTACAACAATGTATCTCAATAATGAAGATATTCCTGTGGCACTTAAGGCTTTCTTCAGATTTTAACATGCACGCAAGAGTCATAAGTAAG 

TAAGAGTCGGCTGATCAAAAACCTTCGCAACTCTTCCCAAAAACAAATGTGTTTCTCCATTT 

(SEQ ID NO: 4) 

Exon: 1001.. 1143 
Exon: 3372. .3542 
Exon: 4058.. 4202 
Exon: 6615.. 6757 
Exon: 7120. .7228 
Exon: 7333.. 7504 
Exon: 7648.-7754 
Exon: 8263.. 8412 
Exon: 8583.. 8716 
Exon: 8953.. 9143 
Exon: 9801.. 9935 
Exon: 10017.. 10179 
Exon: 10294.. 10507 
Exon: 10885.. 11036 
Exon: 13701.. 13884 
Exon: 14430.. 14572 
Start ATG: 3468 

Transcript No. : CT5032 

GGTCCATCAGCAACATAGATGACGCGGAGTATCACTGCAATAAGACGCAGGTGCGCAAGGTCATCTCGGGGGTGGTGGGTGCCGCCTCCTCGGTCACATCCATACAAGTTGC 
CAATTTATTGCGGTTATTCCGAATTCCCCAGGTGTCGTACTTCTCCACCAGCCCCGAGCTGAGCAACAAACAGCGATTCGAGTACTTCTCCCGCACTATTCCCTCGGATCAC 
TACCAGGTGAAGGCCATGGTGGAGATAGTGAAGCGGATGGGATGGAGCTACGTGTCGATTATCTACGAGGAGAGCAATTACGGAATCAAGGCTTTCGAGGAACTGGAGGAGC 
TCCTCGCACGCCATAACATTTGTATTGCCATCAAGGAGAAGCTGGTGAAGGATTCGGGAGTGGCCGAGGACATCGCCTACGACAATATAGTGCAGAAGCTGCTGACGAAGCC 
GCGGGCTCGAGGGGCCATTATCTTCGGTTCGGATCAAGAGGTGCGCCAAGTGATGCGGGCGGTGCGCCGGGCGAATGCAACTGGTTCCTTCTCCTGGATCGGATCCGATGGC 
TGGAGTGCCCGGAATTTGGTGTCCGACGACTACGAGCCGGAGGTGGAAGGCACGCTGTCTGTGCAGCCGCAGGCGAATCCGGTGCGCGGGTTCGAGGAGTACTTCCTCAGTC 
TGACGGTGGAGAACAATCAACGCAATCCCTGGTTTGTGGAATTCTGGGAGGATCACTTTCAGTGCCGATATCCCGGCAGTACGAGCACTCCGTACAATAACTACACCAAACA 
GTGCACCACAAAGGAGCGACTTTCACGCCAGAACACTGACTTCGAGGACCAGTTGCAGTTTGTCAGCGATGCGGTGATGGCATTTGCATACGCCCTAAGGGATATGCACCGC 
GACCTGTGCGGCGGAGGTCCTTCGCTCTGCGAGGCCATGAAGCCAACGAAGGGTGCAGACTTGCTGAAATATTTGCGAAAAGTGGAGTTTGAGGGCCTCAGCGGCGACGAGT 
TCCGCTTCGACGGGAACGGCGACGGTCCCGCCCGGTACAACATCATCCACTTCAAGCAGTCCCAGGCGGGACAGTACCACTGGGTCAAGGTGGGCGAGTACACCGAGGGCGA 
GCTGCGGCTTAATATGACGGAGGTGAAGTTTAAGCGGCTGAGTCCCAAGCCGCCGGAGTCTGTCTGCAGCCTGCCATGTCTTGTGGGTCAGGCCAAGAAGTACGTGGAAGGC 
GAGAGCTGCTGTTGGCACTGCTTCAATTGCACAACCTATCAAATCCGCCACCCGGATGACGAGACCCATTGCAAGCTGTGCAAGCTGGGCACTCTTCCCGACGCCCACAAGC 
AGTACTGCCGCCCCATCCCGGAGATATATTTGCGTCCGGAGTCCGCTTGGGCCATCGGTGCTATGGCGTTCAGTGCGACCGGGATCTTAGTAACGCTCTTCGTAATGGGCGT 
ATTTGTCAGGCACAACGACACGCCCATAGTGCGTGCCTCTGGTCGGGAGCTGAGCTACATCCTGCTGGCGGGGATTTTCATGTGCTACGGTGTCACCTTTGCCCTGGTCCTG 
AAGCCGACCAACATAGTGTGTGCCATTCAGCGGTTTGGTGTCGGCTTCTGCTTCACGGTCGTTTATGCAGCACTGTTGACCAAAACGAATCGCATTGCACGGATCTTCAAAG 
CGGGCAAACAATCCGCCAAGAGGCCCTCCTTCATTAGTCCCAAGTCCCAGCTGGTAATCTGTGCGTGCCTGGTTAGTGTGCAGATACTCATCAACGGCGTTTGGATGGTAAT 
TGCGCCATCACATGCCATGCACCATTATCCCACACGCGAGGATAATCTGCTCGTCTGCGACTCGTACATCGACGCCTCCTACATGATTGCGTTCTCCTACCCAATCTTCCTG 
ATTGTTATTTGCACGGTCTACGCGGTGCTCACGCGAAAGATTCCGGAGGCTTTCAACGAATCGAAGCATATTGGATTCACCATGTACACGACp'TGTGTGATTTGGCTGGCTT 
TTGTACCGCTCTACTTCGGAACCGCCAACCACGTTCCGCTGCGCATCACCAGCATGTCGGTGACCATCAGCTTATCCGCCAGCGTGACCATCGCCTGTCTCTTTTCGCCCAA 
GCTGTACATAATACTCATACGCCCCGAGCGCAACGTGCGCCAGAGTATGATGCCGCCTCGCTACGGCAACATGCACCGCACCGCTGGCACAGGACCCTCGTCCATGATGGCC 
GCCGCCGTGGTGACCGCCGCCACCTGTGCCCAGGAGGAGAAGATCCAGAAGCATATCACTCCCACAAACACAGACGAAGAAATTGTGCGAAATGGCCACTCAGACAATTTCC 
AGTATAATCACGTCGCTGGACATCAATGCGTACAATCAAATACCATATGCCGATCAATATGTGCCAAACAATGCCACAACGGCCACGACCACGCCCACAGATAA 



MVEI VKRMGWSYVSII VEESNYGIKAFEELEELLARHNICIAIKEKLVKDSGVAEDIAYDNIVQKLLTKPRARGAI IFGSDQEVRQVMRAVRRANATGSFSWIGSDGWSARN 

LVSDDYEPEVEGTLSVQPQANPVRGFEEYFLSLTVENNQRNPWFVEETtfEDHFQCRYPGSTSTPYNNYTKQCTTKERLSRQNTDFEDQLQFVSDAVMAFAYALRDMHRDLCGG 

GPSLCEAMKPTKGADLLKYLRKVEFEGLSGDEFRFDGNGDGPARYNIIHFKQSQAGQYHWVKVGEYTEGELRLNMTEVKFKRLSPKPPESVCSLPCLVGQAKKYVEGESCCW 

HCFNCTTYQIRHPDDETHCKLCKLGTLPDAHKQYCRPIPEI YLRPESAWAIGAMAFSATGILVTLFVMGVFVRHNDTPIVRASGRELSYILLAGIFMCYGVTFALVLKPTNI 

VCAIQRFGVGFCFTVVYAALLTKTNRIARIFKAGKQSAKRPSFISPKSQLVICACLVSVQILINGVWMVIAPSHAMHHYPTREDNLLVCDSYIDASYMIAFSYPIFLIVICT 

VYAVLTRK1PEAFNESKHIGFTMYTTCVIWLAFVPLYFGTANHVPLRITSMSVTISLSASVTIACLFSPKLYIILIRPERNVRQSMMPPRYGNMHRTAGTGPSSMMAAAVVT 

AATCAQEEKIQKHITPTNTDEEIVRNGHSDNFQYNHVAGHQCVQSNTICRSICAKQCHNGHDHAHR* 

(SEQ ID NO: 6) 

Name: metabotropic glutamate receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384693 

CAGTTAGTTCCTAGTTAACAAGTGAGGTTTGGTTGAAATCTTTCTTTGCTATATAGGTATTTCATTTCCGTATGCTGTATCGGTTTTGTAAAATAATAAAGCATTTCGCGAG 
CTCTCTCTCTCTCTGTATCTCAAAAATATAAAGTAAATGCCCCTATGTGTGTGCTTAAAGCATCTAGATTGATGAAAATGTACACTTCAAATGCTTCGCTCTGGTTTGCTTA 
CAACGTAGCTGAAATGCAGCAACAAAATGGCAATGTATTATTTCCCTTTACTTCTTATATTCCTCTTGATTCGCAGTTGGAAGGAATTTTAGTTGTATAGGTGGAAAACGGC 
AGCTGGATAGCAAGAAATGACAATCTTTCAAAACGTAGGATATCTTTTTGGTTTATGAAACACAAAACAAATCTTAAGCCTGTGTGTATATATATATATATCTTACGAGCAT 
AATCAGCGTAAATCAAAAACAATTGATTTCTTACAAAATACATAAAACATAATTCGCCTGAATTCTATATATCATGTACATACAGTTCCTAGATCTTATACATATACCTACA 
TCCATATATATATACTTGATTTTCTGGTTGTTTAAATTAGTTTTTAAGTATTTAAAAATGGCATGTGGTGACATGTATTACAATACATCATACATATATGGTACATACATTT 
ATATTTAATATCGATTTCCAAGTTGGTATAAAATGCATGCAAGTCGAAGGGCCCGATTCCAATTCCAATTCCAACCCCCCATCGCTCGAGTTTTGGTTAAGCTTGCACGGCA 
CCGCAGGGGTATGGATGTGCCCTATGGTCTAGGTTTTAGTTATTTCATATACATTTATAAAGTTTTGGGATTTTCGAATCACTGCACGTATAGCATGACAGCATGTTGTTTC 
ACACAATGCTCCATATTTCGACTAACGACTAATACTAGGTATAGCTAGAGATCCTAAGCTACAATTTTTGCCTAAGACTGTTCCGGGGGAGTTGCGTTAACTGATCACTTAG 
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CCAGTGGTTCCfiGATAACATCGCACCGTGTTGTTCTCATCGAGCATGGCCGflCAGGTGGCGCTGCTGATGGTGCGGAGGATGCTGATGGGGflTGCGGATGGGTTAGCTGGTG 
GGGATGCGGATTGGGATTGCTGTGTGATGTATGCGACGACGATGATGTGGTGGACTGTGGTGACAGTTTCTGTAGCTGCTGGCGCTTCTGCGCGGGCGTCTGATTCGCCGGC 
GGCAGCGGTGGTCCCGAGTTGCCGTGACCGAATAGCTCATTTACCGCCAAACGGCCGCCTGGCGACTTGGCATTCAGTGCTTGCTGCTGCTGCTGCCGGAATGCGCGACGCA 
GGGAGCCCTCACCAACGCCGCCGCTAACCCCAAAAACCGCACCGTTGTGCTCGGGTTGCGGTCCTGCTCCTGGTGGCAAAGCCCCGTAGTCAAGAGGCGCTCCATGTTTGCG 
CAGCTGACGTCTGTAAAGGGGATAAATGTAATTTTTTCGTAACGTTGATATAGTCAAACATTAGAAGTACTTACTGAAACCGCCGCTTAAAATCCTCGTCCTGCATCGCCAT 
CAGACGACGCATCCGCTCATCGCTCTGCGGTGGCAATCCATGAGCATCATCCTCCAAGTCGCTCTCGTCCTCCTCCTCATCCTCATCACAGACGCTGTCGTGCAATTGCTCC 
GCGGCCGTTGCCTCCTCATCCTCCTCCTCGTCTTCGTCGTCATCGTCATCCGCCGTGCAGCGTCGAGCCGATCTTTGCTGAGCCAGCGACTGTTGTTGCTGCTGCTTCTTGA 
GCTGCTGCTGTTGGCTCCTGGACGAGACACTGCCGCCACTCAGGCTTCCGCCGCCGTACCGACTACCGCCGGCGGCCACCGCCGCTCTCATGTTGTTGAAGACGCCAGTGCC 
AACGCTATTGTTGTACACCTCGTCCACATCGAAGTCACCCACCTCGTAATGCTGCACATCCGAGTTGCGGTTGCTGAGGATCTCCTCGTAGACCGGCCCACCATCCGCGCCG 
CCGACGCTGCCTTGGTTGTAGTAGGCATCGCCCATGCCCAGATCACAGTTGGAATGCTTATTTCTATACTGGCGCTGCATAAACTCCTGCTGGTACTGCGCCATGTGTGGCG 
GTGGAGCAGGGAGCTGCTGCTGGCTACCAGCTCGTCGTTGCAGGTGCAACAGGTGAGCATGTTGCTCGGCCTGCTCCTCGTCCGAGGAGAGTTGCTGCTGCTGCTGCTGTTG 
CTGCTGCTGATGTAGCTGCTGGCGACGCAACTGTTGCTGCTGCTGGTAGTAGGCAGCCGCATTGCCAGTGGCTAGTGGTGGGTCCAACGCCTCCGCGTATTGCGGATAGCCT 
GTTGGCTGGCCAGGCTTGGTCTTCTTGTACGATGGATAGAAAAGCACCTGGGAGGCGGCGGCCGTGCGACTGTGCAGAGATCCACTAGGCGAGGCGGGAACTCCTCCGGCGC 
CCCTCTTCTTTTTGCCATTGTGGCCGCTGTTCGGCTTTTGAGTCCAGTAGAATACCTCGTGCGCCTGTTGCGGCGATGATGCAGCAGCACCCGCATTATTGTAGTACTCCCG 
CGGTGTTTTCAATCCTCCTGTTCCAGTTGATGGTTGCTTCCAGCTGTAGTAGTCCGGCTGATGGTAGTAAGACTCGCCCAAAGGTCCCTGTTGCTGCTGTTGTTGTTGTTGC 
TTGTGCGCAAAAATCAGGTGTGTGGACCCGGCAGAGCTGGTGGGTGAGGCATGACCAGTGGAGCGGAGAGTGCTACCTCTTTCGAGCACAACATGGCCACCAAGGGTCTGCG 
GTGGAGCCTGTAGTGTGCCCGTCATTAGGCCACTTTTTAGGGTTCTTCTTTGCTGGGCCTCGTTCAGTTCACAATCAGAGGTGGCTCCCATAATGGCATCCTCTGTGATGGG 
CACTGGTGTCTGCTGTTGCTGCTGGGGATCGTCATGCGCCTCTTCGCTCACTCCTAACGGCAGCTTGGGCTTCTTGGACGTGGAGACGCTGCAAAGGGTTCCGGCTGTGGGT 
CCATTGCCGGTAACAATGCCCGAGGAAATTGATGTTTTCGAGCAGCGCAAGCACTTGGGCAGCCAAGCGTGCTGTCTCACATACTTCCGATACTCCCGCCGGATCTTTTCGT 
TTTGAATGCAATGGAACACGAAGATATAGAGCCCCAGCAATGTGTTGAAGCAGATGAAGCAGTATCCATACACATCAGCCGTGTCATCGTCCATTTTGGCTCCTCGCAAATA 
GAAGTAGGCCGAGCACCAAACAGCGCTGAGCAGCAAGAGGAACACAAAGGAGCAGCGTATGTCGAACCTAGGATTTGAGGTACATGGGCATCGAAGGATCTTTAAAGTGTAT 
ATGGATTTTCAATATCTTACCGCACGCTAGCGAGGCGAGTATGTTCCTTGGTCTTTAGACCCGTGCGACTTTTGCGGCACATTATAATCCAGGAGAGGAATGTGTAACCAAT 
GGCAGCCTGTAAAAGTGAATTAATTTAAAATGGTAACTGCGCTTTTGAAGCTTCATCTCATATTATGTAACTTACCACAAAGAAGACAAGCACTGGTATTACAAAGGTGGCA 
TAAAACAAGGCATTCGCCTCCATCAGCACGCAATAATCGTTTTGGGTATAGGTGCTGGGATCGATGACCAGCGAGATGGCCACCACGCTCAGCGACAGTCCGTAGGACAAGA 
GGTAGTAGCAGTTCACCTTGGGCGTCTGGTCCACCTCCAGCAGGAGCTCGTCCGAGGTGAGCGTTGAGTACGAATGGAAGGCTGCAGAGGCAAAGGGTTAGGGCTGCAAAGT 
GTAGATGAAATTCGGCCTTCTTACCTTCGTAACAGAACCAGGCGGTGCCCGATAGGATGGCACAGTGTAGGAAAATAGTAATGAAGCCGCAGAAAATGCTTGTTTCGGTCTG 
TTCAATGCCCAGGAGAAAGAGCAGCTCGATGGCCAGGAGGCAAAGGTAAATGCTGGTATAGATCGAGGTGCGCGCGGACTGGAGTGGACAGTTAAAGAGAAGACAGAGTGAA 
AAGTTAATATAAGTTCGAGAAGTCAGTGCAAATTTTTTCTTTAAGCAGCACCAGCAGAAGGACCCTAAAGATTCGATCATCAGGGAGCTTTTAGACATTTTAAGGAAATTGA 
TAATCTTAAACGTTTTGAAGAATAGGTGTCAGATTTGATAGTCAAACTTAATTTAGAACTTGATTTACAATAATTTTGGTGGGTGGTGGTGAAAATCCAAAGACTAAGAGTA 
ATACGAATACGAATAAAACAATGAACAGAACACGAAACAAGCACAACACATAGGTAATATACATACAAACTAAATGAACAAACACAAAATGCAATATTTACAGGACAAAATT 
AGCAACATTTGTATATTTTATTAATTAATGAAATACAAAAGAAAGTCAGAAGTGTTCTAGGTATTTAGTTGTACGATGTCTATGTTAAATTTGTTAAACAATCAATGCACAT 
AAAAGGGGTTATGGTATTTCAAAAATGATTAAACGATTAAGTAAAACATGCCAGAGTGATGATTTGTAACAAAAAGAAAAATGAGTCTATCATATGAAGATATTTCATTATT 
GGTTTTGTGTGCTAATCTTAAGCTATGACAAATTACTACGATACATAAATATAAATGGGAGTCTTCGCGTGCAAGCTGTTACTGGGACGCAACAGTGATCATCGCTGCCCAG 
CACAGCTTTTGAGTCTTATCTTTGAATCCTGCCAGAGAGAGGTTCATTAACAATTTAAGTTACTTAAGCAITATATCACATGTGTCTCTCGGTCATAAATACATATGTCTAG 
AAAGCCTAGCCTAGCCTAAAATTGTGTTTATCTGTTTTTTCACCACCACCACCACTTATATTGTAAGTCGTCCATGTTGGCCTGCATAATGATGTTGTTCGTGTTATCCAAT 
CCATTGCCGTCCACATTAAGGTTGTGATTGCGGTGCGGATGTGGCTGCGGCTGCTGGCCACGCTGCTGCTGCTGCTGCTGCTGGACATGCTGCTGCTGCAGATAGTTGTGGC 
TGATGTTGAGGTTATTGCTGGCGTTGCTGGCCTGAACTGCATGTTGCTGCAGTGCAGCGGCTATTGAGGCAGCAGCAGCTGCAGCAGCAGCCGTCTGCTGACCGTGCTGATG 
CAGATTCAGATTCAAGTTGTTCATGGCATGTTGCATATTTCGCTGCGGCTGATTGGACAGCACAACAGCAGCAGCAGCTGCCGCAACAACTTGTTGTTGCTGTTGCTGCTGC 
TGCTGTTGGACATTGTAGTTCAGATTGTTGCGGTGTGAGTTGCGGATGGAATTGTGTTGGATAAAGTTCGTGTTCCCATGACTGGGCGGCGGCACATTGGACTGACTGGTTG 
GCGTGGTCAGGGTCAAGGACTCGTGGGTCTGGTCGCGAATATTGTTTTGCTGGCGTCTGCTGCCCGACCGCTGATGCGGCAAGGGAtTGGAGCCGTTTCTTACCTTCACAAA 
GACCCCATTGAACAGCTTCAGCGTTAGCAGGGCGATAACTATGAAGACCACGCAGATGCCGATGCTTATGTAGATGAATATGCGCATGTTTCCATCGAACATGGTGAACAAC 
GAATGCTGGTGCTCATCCACAACGTCCATTAGTATGGCAAAGTTTGTCAGGTGGTTGCAACTGCAGACGCTGTGCGTGCGGTTAGTGGACTCCAGACTGCATCCGTTGGCAG 
ACCACGCACTAAAACAAAAACATTAATTCGTATAAATTATTAGGCATATGAGCATATGAGTCTTACTGGTCAATATAGTTCCAGAACACGCAGGTGGGATTCGTTACATTCT 
CGGTCTTCAGATGTTTCAGTGTCAGGGTTATGGGCTGGGAGAGTTGTATGTGACGCCCCTTGCCCAAGCTGGCCGAGATCACCTTGCTGTTCAGGATGCGTAGGCGCTGTTG 
GATCTCCCCCGCGTTGACATCGCTGTCGTTGCTCAGGATGGCTGTATTGCGAACTAAGAAAACATTTAGATTATACTATTCTAACGTTTTTCAGTTCTCTGTGGACTTACCG 
TAACTGCGGGAGCTCTTGAGATCGAAGTGATCATAGCTGGGCTTTAGAATGGATTCCAGGCGATCGAAGGCGGCGAATACAATTCGCACCAGACCGCCTTCACTATTATCTA 
TTAGAGCAGCTCGTGGCAGCTCAATACGATCCGAACTCAAGGGCCACTGATCCGAATCTGGGAAGACCACGCTGGACTGGATAGTCTTGGTTTCCAGAACTCGAACGGAGAG 
CACTATGGTGATATACAGTTATAAGTATTTTGAAATGCAAACAATGCAAACATTTAGAACTCACATATATTTTTGACTTTTTGCACCACGCTGCGCTCCCTGATGATCGTAT 
CCGCCAGCAGAAAGGCATTGTATTCCAGGCCAGTTAGCAAGGATGTGGCTACACGCATTTGGTCGTCCGGATTGAGATCCAACCACGAGGACAGCTGCGATTCGTCCAGCAG 
GTTGGAGCCGGTTTTGACCACACAATGCAACAACTCCATGATCATAGCCTCGCGCTGTCGCTGATCCGGGAAGGTCTCCTTGTCGTGCATCATCTTCTCGGACACTGTTTGG 
ATAATCTTCGTGGTGACCAACATGTCGCCGCCGTAGAGCGTTTTGCTACTGGTCACTTCGGACATATCGTTGGCGATGGAGATCAAGGAGGAGTCCCGCTGGTTTACTCGCA 
TTTCCAGATTGTTGAGCCAAAGGCTGCGGCATTGCGTCAGATCAGGTGTCGCGGGATGCCAGGTGGGCTCAAAGTTGCGCAGACGCTGGTTTACCTTGTGGGCCATGCTCAC 
CGGCGGCTCGCAGCTGCTGCTGTTGTGCAGACAGTCGCCACCGCTGGGTGCCGGAGTTGTCGAACTGATGTCATCATCGTACTCATCGTAGCCGGAGTCGGGTATCCTCTTC 
ATTAGAACGCAACGCCACTTGGCGATGCCGGCTGCTCCGCCAGGACAGGGCTGAACATTCACATCACCCACTCGAGTCATGTTCCAGTACAAATTGCGGGCATGAGTGGGGC 
CGCAAAACAACTTGGACTCATCATCGCCTCCGGACATCCCGGCTGCGTTCAAATTAATATTGTTGATGGTGCGCACCACGCTGCCGTTCCCGGGGACAACGGATCCACTGGC 
CGCAGTGCCCGGTGGGCGGTTGGGTGAGCTCTTTGTGGTCAGTAGGGTTCCGTCATCAGTTCCGGAACCTCCGACGCCCGTGAGTATCCTCGTGTTGGATGGTGAAGTGGCA 
TTACCGCCAGCCGGCAGCTTGCTGCTCGGCTTGGTGGGTGAGGCAGTGTCTTCACCGTGGTGGTTGTGGTGGTGGTGCAGTTGCGGCGGCGGTGGCAGACCATCATGGCGGC 
CAGCCGGATGCTTGGTGGTCGTCGTGGAGGTGGCCCCACCCTTCAAACGTCCACCGCCCACGGCTGTCGTGCTGCTCGAGGGCGTGGAGTGCGTGACGGCGGTGTGTTGCGG 
TGGTACGTTGAACAGGCCGGGATTCCCACTGATTCCCACCACTCCGGGAAGTGTGGGAAGTCTCGGCGGGGGCGGCGCACCCGCTCCAACCCCGGGCGGATGGATCAGTCCA 
CTGCCGTTGCCAAAGATCGGCGGACCATTGCTCAGCACCCATGGCGGCGGACTGGGCCTGTTGGTCGTCGTCGAAGTTTGGGCTGCACTTATGCACTGGTAGTGTGCCTCCA 
GATACTTGTGGGTACCGGGACAGGGATCCCCGAACATGCTCGTGGCTGCCAACACGCCGCAGCTCTGCTTGTGGGCACATCTGGAAAATGGGAAAAATACGCGTTCATCAGT 
AAACAGAGTTGGGTAATTTTCACATATTTAAGATGCTTGATTGATTGACTTGTGTTGAAATAGTTTATTGATATTCGCAACCAGAGTATGAAATTTCCAAATGACATGTTTA 
ATGATTTGAAGATCCAATAAACTTTAAAAGATGTATTACCTTGAGTTCAGTACGCTGAGTGACTTGGGAAACATGCAGTTAACACTCCACTCCACATTCCCGTGGTCATTGC 
AGATGGTAATCGAGAAGCGGCCATAGTTGGCCCGAATGAGGTTGATCACATCGCCGGGATCGCACTCGATGGTCAGTTTCTTACCTTCGCAGGCGTAGGCGGTTTGGTACTT 
GGACACTGAAAGGCGAAGGCTTCATAAGTCACAGGGTGCTGGGCAGGGTATGGGTTTATATGGACATGGTATATATATATGGCATATAAAAAGAAATTTTTTGGTTAGCGGC 
GATGATTGTTGTAGTTTGGTTTGGTTTGGATTGGATTGTTGGTGGGCCTGCAAAGACACTTACGATCTAACGATATATGCTCATAGGATATTGACAATATGGTTGGTAGCAT 
GATACGCTTTTACACTGCCTGGGATATACTTAGTAATGCTGAGGTTGATGGTGGAGTATGTGGGTCGTTAGCTTGGTAGCAAGTGGAATAGTAGTCATAAATACCAAATCCT 
GCCGGCCCAGATCACGAATCATCGATCGTCCATCATTGGTTCTTATCGGCAATGCTCATTCGGCAGTGTTTCACTCGGTTTGAAGTATTCAAACTGAAAGGGAAGATAACAT 
TTATAATGTTTGGTATTATTATTATTATTATTTTAAGGCACATAACTAATAACTTAAGACATGTTGTATTTCGTTAGAAAGGAAAAGGAGAACTATCCTTTTCTTTTATTGA 
TATTGGTTCCATTTAATTTACTTGTGCAATATCACATCAATGTTGAGTCAATCTAATGACCTGGTTACAGTAAATATCAAAAGTGTAGATTCTAATAAATTTAAGTTATTTT 
AATATTCAGAACCTCTTTGTATAACTCCTATTAAAACACTATATAAGCAATGCAAATATCAAGAAAACATAATTAATTGTAAGAATTATAAATTGCATTTATATATCTTTAA 
TAATTAAAATGGTCTTATATTTATGTATGATGATATCCAGGCCAAGCACACCTATGTACATATGTACATACCTCTTATGTATTTCGGTTIGGAGTCATATTCATGAAAGTTG 
GACTCAAAAGTACCATTCAACTTATGTATCTGAAAGGTAGAGAATCCATTAGATAATTGTTTACATATTGACTCCTTGGGCAAACACGAACAAGCAGGCAAAAGACAAATAG 
AAGATAAGAAGATGGCATGGGGCAGGGGAGGGGGTAGGGTAGGGAGCTAAGAATACACACCCTTCTGTGTACACCCACACTGTGGTTGTAGTTGTTGCATGGCCATCGATTC 
GTGGACAGAACAAAGACCCGTTTTCTTTGTACACCCGATGCACACTATTCACGAACATATATGCATGTGGATGTAAATGTAGAGAAAAGTTGCTGCAGCTGCGC 
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Start ATG: 8400 (Reverse strand: CAT) 
Transcript No. : CT8755 

ATGCTACCAACCATATTGTCAATATCCTATGAGCATATATCGTTAGATCTGTCCAAGTACCAAACCGCCTACGCCTGCGAAGGTAAGAAACTGACCATCGAGTGCGATCCCG 
GCGATGTGATCAACCTCATTCGGGCCAACTATGGCCGCTTCTCGATTACCATCTGCAATGACCACGGGAATGTGGAGTGGAGTGTTAACTGCATGTTTCCCAAGTCACTCAG 
CGTACTGAACTCAAGATGTGCCCACAAGCAGAGCTGCGGCGTGTTGGCAGCCACGAGCATGTTCGGGGATCCCTGTCCCGGTACCCACAAGTATCTGGAGGCACACTACCAG 
TGCATAAGTGCAGCCCAAACTTCGACGACGACCAACAGGCCCAGTCCGCCGCCATGGGTGCTGAGCAATGGTCCGCCGATCTTTGGCAACGGCAGTGGACTGATCCATCCGC 
CCGGGGTTGGAGCGGGTGCGCCGCCCCCGCCGAGACTTCCCACACTTCCCGGAGTGGTGGGAATCAGTGGGAATCCCGGCCTGTTCAACGTACCACCGCAACACACCGCCGT 
CACGCACTCCACGCCCTCGAGCAGCACGACAGCCGTGGGCGGTGGACGTTTGAAGGGTGGGGCCACCTCCACGACGACCACCAAGCATCCGGCTGGCCGCCATGATGGTCTG 
CCACCGCCGCCGCAACTGCACCACCACCACAACCACCACGGTGAAGACACTGCCTCACCCACCAAGCCGAGCAGCAAGCTGCCGGCTGGCGGTAATGCCACTTCACCATCCA 
ACACGAGGATACTCACGGGCGTCGGAGGTTCCGGAACTGATGACGGAACCCTACTGACCACAAAGAGCTCACCCAACCGCCCACCGGGCACTGCGGCCAGTGGATCCGTTGT 
CCCCGGGAACGGCAGCGTGGTGCGCACCATCAACAATATTAATTTGAACGCAGCCGGGATGTCCGGAGGCGATGATGAGTCCAAGTTGTTTTGCGGCCCCACTCATGCCCGC 
AATTTGTACTGGAACATGACTCGAGTGGGTGATGTGAATGTTCAGCCCTGTCCTGGCGGAGCAGCCGGCATCGCCAAGTGGCGTTGCGTTCTAATGAAGAGGATACCCGACT 
CCGGCTACGATGAGTACGATGATGACATCAGTTCGACAACTCCGGCACCCAGCGGTGGCGACTGTCTGCACAACAGCAGCAGCTGCGAGCCGCCGGTGAGCATGGCCCACAA 
GGTAAACCAGCGTCTGCGCAACTTTGAGCCCACCTGGCATCCCGCGACACCTGATCTGACGCAATGCCGCAGCCTTTGGCTCAACAATCTGGAAATGCGAGTAAACCAGCGG 
GACTCCTCCTTGATCTCCATCGCCAACGATATGTCCGAAGTGACCAGTAGCAAAACGCTCTACGGCGGCGACATGTTGGTCACCACGAAGATTATCCAAACAGTGTCCGAGA 
AGATGATGCACGACAAGGAGACCTTCCCGGATCAGCGACAGCGCGAGGCTATGATCATGGAGTTGTTGCATTGTGTGGTCAAAACCGGCTCCAACCTGCTGGACGAATCGCA 
GCTGTCCTCGTGGTTGGATCTCAATCCGGAGGACCAAATGCGTGTAGCCACATCCTTGCTAACTGGCCTGGAATACAATGCCTTTCTGCTGGCGGATACGATCATCAGGGAG 
CGCAGCGTGGTGCAAAAAGTCAAAAATATATTGCTCTCCGTTCGAGTTCTGGAAACCAAGACTATCCAGTCCAGCGTGGTCTTCCCAGATTCGGATCAGTGGCCCTTGAGTT 
CGGATCGTATTGAGCTGCCACGAGCTGCTCTAATAGATAATAGTGAAGGCGGTCTGGTGCGAATTGTATTCGCCGCCTTCGATCGCCTGGAATCCATTCTAAAGCCCAGCTA 
TGATCACTTCGATCTCAAGAGCTCCCGCAGTTACGCCATCCTGAGCAACGACAGCGATGTCAACGCGGGGGAGATCCAACAGCGCCTACGCATCCTGAACAGCAAGGTGATC 
TCGGCCAGCTTGGGCAAGGGGCGTCACATACAACTCTCCCAGCCCATAACCCTGACACTGAAACATCTGAAGACCGAGAATGTAACGAATCCCACCTGCGTGTTCTGGAACT 
ATATTGACCATGCGTGGTCTGCCAACGGATGCAGTCTGGAGTCCACTAACCGCACGCACAGCGTCTGCAGTTGCAACCACCTGACAAACTTTGCCATACTAATGGACGTTGT 
GGATGAGCACCAGCATTCGTTGTTCACCATGTTCGATGGAAACATGCGCATATTCATCTACATAAGCATCGGCATCTGCGTGGTCTTCATAGTTATCGCCCTGCTAACGCTG 
AAGCTGTTCAATGGGGTCTTTGTGAAGGTAAGAAACGGCTCCAATCCCTTGCCGCATCAGCGGTCGGGCAGCAGACGCCAGCAAAACAATATTCGCGACCAGACCCACGAGT 
CCTTGACCCTGACCACGCCAACCAGTCAGTCCAATGTGCCGCCGCCCAGTCATGGGAACACGAACTTTATCCAACACAATTCCATCCGCAACTCACACCGCAACAATCTGAA 
CTACAATGTCCAACAGCAGCAGCAGCAACAGCAACAACAAGTTGTTGCGGCAGCTGCTGCTGCTGTTGTGCTGTCCAATCAGCCGCAGCGAAATATGCAACATGCCATGAAC 
AACTTGAATCTGAATCTGCATCAGCACGGTCAGCAGACGGCTGCTGCTGCAGCTGCTGCTGCCTCAATAGCCGCTGCACTGCAGCAACATGCAGTTCAGGCCAGCAACGCCA 
GCAATAACCTCAACATCAGCCACAACTATCTGCAGCAGCAGCATGTCCAGCAGCAGCAGCAGCAGCAGCGTGGCCAGCAGCCGCAGCCACATCCGCACCGCAATCACAACCT 
TAATGTGGACGGCAATGGATTGGATAACACGAACAACATCATTATGCAGGCCAACATGGACGACTTACAATATAAGTGGTGGTGGTGCATTTACCTTTGCCTCCTGGCCATC 
GAGCTGCTCTTTCTCCTGGGCATTGAACAGACCGAAACAAGCATTTTCTGCGGCTTCATTACTATTTTCCTACACTGTGCCATCCTATCGGGCACCGCCTGGTTCTGTTACG 
AAGCCTTCCATTCGTACTCAACGCTCACCTCGGACGAGCTCCTGCTGGAGGTGGACCAGACGCCCAAGGTGAACTGCTACTACCTCTTGTCCTACGGACTGTCGCTGAGCGT 
GGTGGCCATCTCGCTGGTCATCGATCCCAGCACCTATACCCAAAACGATTATTGCGTGCTGATGGAGGCGAATGCCTTGTTTTATGCCACCTTTGTAATACCAGTGCTTGTC 
TTCTTTGTGGCTGCCATTGGTTACACATTCCTCTCCTGGATTATAATGTGCCGCAAAAGTCGCACGGGTCTAAAGACCAAGGAACATACTCGCCTCGCTAGCGTGCGGTTCG 
ACATACGCTGCTCCTTTGTGTTCCTCTTGCTGCTCAGCGCTGTTTGGTGCTCGGCCTACTTCTATTTGCGAGGAGCCAAAATGGACGATGACACGGCTGATGTGTATGGATA 
CTGCTTCATCTGCTTCAACACATTGCTGGGGCTCTATATCTTCGTGTTCCATTGCATTCAAAACGAAAAGATCCGGCGGGAGTATCGGAAGTATGTGAGACAGCACGCTTGG 
CTGCCCAAGTGCTTGCGCTGCTCGAAAACATCAATTTCCTCGGGCATTGTTACCGGCAATGGACCCACAGCCGGAACCCTTTGCAGCGTCTCCACGTCCAAGAAGCCCAAGC 
TGCCGTTAGGAGTGAGCGAAGAGGCGCATGACGATCCCCAGCAGCAACAGCAGACACCAGTGCCCATCACAGAGGATGCCATTATGGGAGCCACCTCTGATTGTGAACTGAA 
CGAGGCCCAGCAAAGAAGAACCCTAAAAAGTGGCCTAATGACGGGCACACTACAGGCTCCACCGCAGACCCTTGGTGGCCATGTTGTGCTCGAAAGAGGTAGCACTCTCCGC 
TCCACTGGTCATGCCTCACCCACCAGCTCTGCCGGGTCCACACACCTGATTTTTGCGCACAAGCAACAACAACAACAGCAGCAACftGGGACCTTTGGGCGAGTCTTACTACC 
ATCAGCCGGACTACTACAGCTGGAAGCAACCATCAACTGGAACAGGAGGATTGAAAACACCGCGGGAGTACTACAATAATGCGGGTGCTGCTGCATCATCGCCGCAACAGGC 
GCACGAGGTATTCTACTGGACTCAAAAGCCGAACAGCGGCCACAATGGCAAAAAGAAGAGGGGCGCCGGAGGAGTTCCCGCCTCGCCTAGTGGATCTCTGCACAGTCGCACG 
GCCGCCGCCTCCCAGGTGCTTTTCTATCCATCGTACAAGAAGACCAAGCCTGGCCAGCCAACAGGCTATCCGCAATACGCGGAGGCGTTGGACCCACCACTAGCCACTGGCA 
ATGCCGCTGCCTACTACCAGCAGCAGCAACAGTTGCGTCGCCAGCAGCTACATCAGCAGCAGCAACAGCAGCAGCAGCAGCAACTCTCCTCGGACGAGGAGCAGGCCGAGCA 
ACATGCTCACCTGTTGCACCTGCAACGACGAGCTGGTAGCCAGCAGCAGCTCCCTGCTCCACCGCCACACATGGCGCAGTACCAGCAGGAGTTTATGCAGCGCCAGTATAGA 
AATAAGCATTCCAACTGTGATCTGGGCATGGGCGATGCCTACTACAACCAAGGCAGCGTCGGCGGCGCGGATGGTGGGCCGGTCTACGAGGAGATCCTCAGCAACCGCAACT 
CGGATGTGCAGCATTACGAGGTGGGTGACTTCGATGTGGACGAGGTGTACAACAATAGCGTTGGCACTGGCGTCTTCAACAACATGAGAGCGGCGGTGGCCGCCGGCGGTAG 
TCGGTACGGCGGCGGAAGCCTGAGTGGCGGCAGTGTCTCGTCCAGGAGCCAACAGCAGCAGCTCAAGAAGCAGCAGCAACAACAGTCGCTGGCTCAGCAAAGATCGGCTCGA 
CGCTGCACGGCGGATGACGATGACGACGAAGACGAGGAGGAGGATGAGGAGGCAACGGCCGCGGAGCAATTGCACGACAGCGTCTGTGATGAGGATGAGGAGGAGGACGAGA 
GCGACTTGGAGGATGATGCTCATGGATTGCCACCGCAGAGCGATGAGCGGATGCGTCGTCTGATGGCGATGCAGGACGAGGATTTTAAGCGGCGGTTTCAACGTCAGCTGCG 
CAAACATGGAGCGCCTCTTGACTACGGGGCTTTGCCACCAGGAGCAGGACCGCAACCCGAGCACAACGGTGCGGTTTTTGGGGTTAGCGGCGGCGTTGGTGAGGGCTCCCTG 
CGTGGCGCATTCCGGCAGCAGCAGCAGCAAGCACTGAATGCCAAGTCGCCAGGCGGCCGTTTGGCGGTAAATGAGCTATTCGGTCACGGCAACTCGGGACCACCGCTGCCGC 
CGGCGAATCAGACGCCCGCGCAGAAGCGCCAGCAGCTACAGAAACTGTCACCACAGTCCACCACATCATCGTCGTCGCATACATCACACAGCAATCCCAATCCGCATCCCCA 
CCAGCTAACCCATCCGCATCCCCATCAGCATCCTCCGCACCATCAGCAGCGCCACCTGTCGGCCATGCTCGATGAGAACAACACGGTGCGATGTTATCTGGAACCACTGGCT 
AAGTGA 

(SEQ ID NO: 8) 

Start ATG: 1 (Reverse strand: CAT) 

MLPTILSISYEHISLDLSKYOTAYACEGKKLTIECDPGDV1NLIRANYGRFSITICNDHGNVEWSVNCMFPKSLSVLNSRCAHKQSCGVLAATSMFGDPCPGTHKYLEAHYQ 
CISAAQTSTTTNRPSPPPWVLSNGPPIFGNGSGLIHPPGVGAGAPPPPRLPTLPGVVGISGNPGLFNVPPQHTAVTHSTPSSSTTAVGGGRLKGGATSTTTTKHPAGRHDGL 
PPPPQLHHHHNHHGEDTASPTKPSSKLPAGGNATSPSNTRILTGVGGSGTDDGTLLTTKSSPNRPPGTAASGSVVPGNGSVVRTINNINLNAAGMSGGDDESKLFCGPTHAR 
NLYWNMTRVGDVNVQPCPGGAAGIAKWRCVLMKRIPDSGYDEYDDDISSTTPAPSGGDCLHNSSSCEPPVSMAHKVNQRLRNFEPTWHPATPDLTQCRSLWLNNLEMRVNQR 
DSSLISIANDMSEVTSSKTLYGGDMLVTTKI IQTVSEKMMHDKETFPDQRQREAMIMELLHCVVKTGSNLLDESQLSSWLDLNPEDQMRVATSLLTGLEYNAFLIiADTIIRE 
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RSWQKVKNILLSVRVLETKTIQSSVVFPDSDQWPLSSDRIELPRAALIDNSEGGLVRIVFAAFDRLESILKPSYDHFDLKSSRSYAILSNDSDVNAGEIQQRLRILNSKVI 
SASLGKGRHIQLSQPITLTLKHLKTENVTNPTCVFWNYIDHAWSANGCSLESTNRTHSVCSCNHLTNFAILMDVVDEHQHSLFTMFDGNMRIFI YTSIGICVVFT VI ALLTL 
KLFNGVFVKVRNGSNPLPHQRSGSRRQQNNIRDQTHESLTLTTPTSQSNVPPPSHGNTNFIQHNSIRNSHRNNLNYNVG^QQQQQQQQVVAAAAAAVVLSNQPQRNMQHAMN 
NLNLNLHQHGQQTAAAAAAAASIAAALQQHAVQASNASNNLNISHNYLQQQHVQQQQQQQRGQQPQPHPHRNHNLKVDGNGLDNTNNIIMQANMDDLQYKWWWCI YLCLLAI 
ELLFLLGIEOTETSIFCGFITIFLHCAILSGTAWFCYEAFHSYSTLTSDELLLEVDQTPKVNCYYLLSYGLSLSVVAISLVIDPSTYTQNDYCVLMEANALFYATFVIPVLV 
FFVAAIGYTFLSWIIMCRKSRTGLKTKEHTRIASVRFDIRCSFVFLLLLSAVWCSAYFYLRGAKMDDDTADVYGYCFICFNTLLGLYIFVFHCIQNEKIRREYRKYVRQHAW 
LPKCLRCSKTSISSGIVTGNGPTAGTLCSVSTSKKPKLPLGVSEEAHDDPQQQQQTPVPITEDAIMGATSDCELNEAQQRRTLKSGLMTGTLQAPPQTLGGHVVLERGSTLR 
STGHASPTSSAGSTHLIFAHKQQQQQQQQGPLGESYYHQPDYYSWKQPSTGTGGLKTPREYYNNAGAAASSPQQAHEVFYWTQKPNSGHNGKKKRGAGGVPASPSGSLHSRT 
AAASQVLFYPSYKKTKPGQPTGYPQYAEALDPPLATGNAAAYYQQQQQLRRQQLHQQQQQQQQQQLSSDEEQAEQHAHLLHLQRRAGSQQQLPAPPPHMAQYQQEFMQROYR 
NKHSNCDLGMGDAYTOQGSVGGADGGPVYEEILSNRNSDVQHYEVGDFDVDEVYNNSVGTGVFNNMRAAVAAGGSRYGGGSLSGGSVSSRSQQQQLKKQQQQQSLAQQRSAR 
RCTADDDDDEDEEEDEEATAAEQLHDSVCDEDEEEDESDLEDDAHGLPPQSDERMRRLMAMQDEDFKRRFQRQLRKHGAPLDYGALPPGAGPQPEHNGAVFGVSGGVGEGSL 
RGAFRQQQQQALNAKSPGGRLAVNELFGHGNSGPPLPPANQTPAQKRQQLQKLSPQSTTSSSSHTSHSNPNPHPHQLTHPHPHQHPPHHQQRHLSAJ4LDENNTVRCYLEPLA 
K* 

(SEQ ID NO: 9) 

Name: Latrophilin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384743 

CGCGGGCTCCATACGATTCGTATTAGCCCGCAATTGTATAAAGGTTAGTTAGTTTCTTAGCTAGTTAGTTGGACATTTAGAAGTAGGACTTTTTCTGTGCGAAAATCAGCAG 
AAGTTGAAGGAAAACCCTCAGTAATTCCGCCAAAGTTCGTTGCTGGAAAAGGACTCTTTTAAGGTAATTTCATGATATTGACTCGTTTCAGTTAAGTCCCTAAAGCTAATGG 
CACACTGAATACAATTAGGCATTACGGCTACGATTGCAATTGCGTTTTTTTTTTTCGATTGTTAAGTACACTTGTTTCGGCATCGGGAGAAGACAGCAAAGTTGGAAACCTA 
GTTTCTGGTTCCCATTTTTATTTTTTTGTTTTTGAAGACATGAACATGCGTATCACAGTATTTGGTTGCTGCGCGTTTCTCTCTCTCTCTAACTCAAAAAAAAAGAATTTAT 
ATAAAAATGCTAAATATATATATGTATATATGTGTTATGTACGCAAAGGCGACACACATTAAGTCCTAATGTCTAAAAGGCAAGAGCGATTATTGAATTTGGTTACACCAAA 
ATGGTAAAAAGGCAACGCAAGGGGCAACTAATTAGGAGCAACATTAAGTACACACAGGAAAATAGGATATGAAGCCCATCTAGGGCAATTACAAATACATTGTAGATGACCT 
ATGACCTCGCGATAAAGGCCCAAACTAAAGCACTAAGAGCACATTTCCGGGGTTATATACACAGTACATAGATCGGTTTACCCCATTGTGTTCTATGGACAGGCAAAGCAAC 
TAAAATCGAAATAACGATATCGTTATCTCACGATCGTTTTATGACAAACAAATTGGACCGTTTCCCCACCCATTTAAGTTGCTTTGCCTCTTTCGATACCCGAAACCGCATT 
AAGTTATAGTAGAGGATTGACTTTGGCACAGCAGCTGCTGCAGGAACATTGACTGGAAGCAGGAAGCATTTCCTGCCCTGAAATGGTACCTAATCCAATTCCGTCTAGAGTA 
GCGTCTCATTGCCATACTGCTGCAAGTGGGGATAGTAGCTGCCGTGCTGGCTGTACTTTTGCGAGGCATAGCTACCTGGCGGATAATACAGCCGATGCGAGGATGTGGAGTG 
ATGCTGGTGATTGTGGTGGGGATGGAACTGATGATGCTGCTGCTGCTGGTGATGATGATGATGCGGCGAATGGCGCTGCTGGTGATGCACGCTCCTGCCCAGCGAATAGTTG 
GACATCTGATTCAGCGATATAGAGCCGGTACCAGGCGAAGTCTTCAGTTTGTACGGACTTGGCGGCAACGGAACACTGGCCATAGAAGTTCGCACCGAATGGTAGGAGACCA 
CCGTGCTATTGCAAGCCGCATGCGATCCACGCGAGCTACTATGCCCGTCCAGGTGTCCAGTTGCTGCTCCGGCAGACGTATGCTTCACCATTTTGCCATTGCTGACTTGCCT 
CTTGTAGAGAGTGGAAATGCTTGGTCCAAGTACCCAGGCAAAACAGCAAATGCCCAGCATCAGTTCCATGAAGGCGCGCAACAAAAATGGCCAAAGTGGCGTATTGGTATCA 
CCCGATCCCAGCCAGACATCGATGTTGGCAAACTCGTAAATGACAGCCAACAGAAGCATGGCACATGCAAGTCCATAGATGAAAAGGAATATGCCAATGCCGGAACTACTGT 
GTGCGCTTAGCTGTTGCAGTTGCTGTTGCACGCTGAGTGCATTGCTATTCCTCAGGATCTCCTTGGTGCGACAGTGTACCAAATAGCCGGAGATCAGATTCATCGAGCCAAA 
GATCCAGTAGCAAAAAACGGGCGTAGCCACCAATATCTGTAGCGCCTTATCCGACTGATTGCCTACGAAACAGGCGCCTAGAAGAGCGTATGGTTGGGTTACTCTCCGAATT 
CTCCGGCGGATCTATAGCATATCTCACCAAGCAACTCGTCAGCATCCACGAAACGGGCAACGATCACTGCTGACGTTTGAAAGGCCGGCAATCCCCAGGCCACGAAGCACAC 
AAAGTTGTTTTGCGTTAGATCCTGCTGAGCCGTCTTGGCACTGTTCCTCTTCGCCGGACTGCCTCCAAAGTTCGAGGGAATGACCACATGGCCCTTGGAGTCCGGACTGTGC 
CGACGAflTGTCGCGGTGCCAGCCCAAACAGAGCACAGCCCACCTACAAATAGGATATGGGAATATAGTTTCTATAATATTAAGTATATATTATAATCTAATACCAAGCGCAG 
GCGGCCATTCCAAAATAGTAGCGCATAAGGAAGACACTAGCGCAGGAGGCATTGGACAGTCCGTCCACGGTGAGTAGGGATTCATTGGGCGCCTGGGGATCCGTGCCGCACG 
CAGTTCCCGTTCTTCCCACCATAAAGCGCACTGCCCAGCCCAAGGTGACCATGTTGTGGCACCAAATCAGCGGCGACAACAATCTGGACCACTTGGCGCTGGCCAAACGACT 
GCCATCGCTGGCCAAAAGACACACGGTGGCTACCAGGGCCAAACCCAAAGCGGCATAGGCCCATGTGGAGACCCAGATCTCTGCCAGATGCTTCTCCGCCGGTGTGAACAGT 
ATGTCCGCCTCGCAGAGTGGTGCACAGCGTCCAGATCTGGGTAGCCTTACGTACAGATGCGATTTGGCCAGTCCCGAGCAATCCATGGGCAGTTTACCGCCCAATCCACCGG 
GTATTCCTTGGCCCGGTAGCCCATACAAATCCTGCTCCTGCTGCGGCTGATGCAATTCACCGGGACCTTCCATACACATCGTCTCGTGGTTGTTCTCTCGCGGAAATTTATC 
GCAGTCGAGAGCCGGTGGCCATGGAAATCCGAATCCCTGCAAAACTGGATGACAACGTATTCGCACGGACTCACAGAGACTCCGGCACGGACCGATGGCATGGACTGGAGCT 
TTGGGCGTGCACATGGGCACGTAGGCGGCGCACAGGAACAGCTTCAACTGGGAACTGCAGTCGTATTCGATCAGTGGTGCAAAGGTTTGCAGTGTGTACTCCACGTCCGTTT 
GCATTTCGTTACCCACCAGATTTGGCATGGAGGTCTCATTGTAGCCGATCTTGCGGCACATCTCAATGCGAATGGTTTCGCATTGCCGGAAGGCGGGGATTTCTGGCGGAGA 
ACTGGAACTGGAACTGGCACTGGGGTTGCCGCTCGTTGAGGATTTTGAGATGCGCGGATGCAGGAGGATCACCACCAGGAGACACAATATGCAGGTTGGCTTCATTGCGGCT 
GATTTGTTGTTCTTTCGTTCCGAAGAATACGAATTCGATACGTTTCTTCGCTTGGTGTTCCGATTTCCGTTTCTCTAAAGGAATCGAAATGTGGAAAAGATCAGATTTCGGG 
GTTTTCTTCGCGCTGGGTTCCCTCTTTTCTTTCCAAAGAATCCGAACTATATGCAGATTCTTCGTTCTTCGGTGGAACGAACGCTGGAAGGGGTAACCTGTTCCAGGTGCGT 
TCGCCCTCTCTCGCTCGCACCCTCTCACTCGTTCCAACAACTTCCAGCTCTCCTTCTTTCCAGCTTGTTATTGTTGCTTTTGGCTTGTCAACTCGTTAACTACCTCTTTTTT 
TTCGGCGTCGTTCTTGTTATCGCTTCTCCTCCTTCCCCCACTCGTTCTTCAGCCCCAACCACTCCCCTTTCTCCACCGCACGCCGCCCAACCCGCTGATCGACGTCGATAAA 
CGGGTTTTTCCTCAGGGATTAACCCGAAAACGATCTAGCAAACCATCGAATCTTTCACACATCTCAGTGAACGAATAGTGTATTTGTTTCGATGTGCTTAAACGAATGGCGC 
AGTCAGTTTGTAGCCTTAGCGAATGTATTCTAATCTAATTGGTATTAACATCGAACGTTTATATATTTTATTTTGTTTTCGTGGTAAAATTCAACGCATACTGGTCACTCTG 
AGCTAGAGATGGCTCTGCGTCACGTCACCTCGCATCGATTGCAATTGCTCAACGGATCTGCTATCGATATCAACAAGTGTGTTGGTTACTAAATGCTTTTAGATCAAAAGGC 
GCCCCCTGGCGGAGTTATGCGTAATCAAAGAACATTGAGAAGGGCGCGCTTTTCAAAAACAAATTTGTCCAAATAATAAGAAACATTTAAGCATAATAAGAACATAAATGTT 
CAATAAATTCTATAGGTGCAGTAAAATACGTGTTTTGCTAGCACAGCTATTTTAAGCTGAGAACTTACAATATTTTAGTACATTGTGCTTAAGGTTTAAAGATACATATATT 
GAAAAAGTTCTGGAAATGCTTTTGAAGAACAAAGCTTTAAGAGGCGCGCTTTCCGAGTATGTGAGCTTTTTTCAAATGGCACACTTTCTGCGGAATATAAAATAAAAAAACC 
TAATAATATGCCGCAAGACCACTTAAGATAAAATATTAAAAATATTAAAATTGATTATATTTTGTGCACCAAAAGTGGCGGTAGCTTAATATCGCAAAACCAAATTAATAGC 
AATTCTGAGCCAAACCGCGATTGCCATTTCTGCATGTTGGTCCTCCGTCCGTTTCGCCATCTCCCTTTGGCCACCGCTTGGCCTCTCTTTCGCACACACATGGCAAACACAA 
GCCACGCCCATGTATTTCAACCCCCTTGACAAAAATACGTAACATGCCACTTTTATGTATGAAACGTGACAAATACGCACATGTGTGAGAAAGAGATCCACCGAGCTGGCGA 
GAAAGAGAGAGAGAGACGCTGGACGTTTATGTGTTTGGCAGGACTCGCGTATGCAGTGTGTGTGTGCGTGTGTGTGTGAGTGTGTGTGAGTGTTTGTTGTTGGTTAACGCAC 
ACGCGCAGACGTAGCAACACATACACATATGTGGAAAGGATATGTACATACATACATACATACATACATATGCCCGCTGACTTTTTCGCCACTGCCAA 
(SEQ ID NO: 10) 

Exon: 3914. .3576 
Exon: 3322.-2232 
Exon: 2170.. 1932 
Exon: 1854.. 1001 

Start ATG: 3241 (Reverse strand: CAT) 
Transcript No. : CT9057 

GACCAGTATCCGTTGAATTTTACCACGAAAACAAAATAAAATATATAAACCTTCGATGTTAATACCAATTAGATTAGAATACATTCGCTAAGGCTACAAACTGACTGCGCCA 
TTCGTTTAAGCACATCGAAACAAATACACTATTCGTTCACTGAGATGTGTGAAAGATTCGATGGTTTGCTAGATCGTTTTCGGGTTAATCCCTGAGGAAAAACCCGTTTATC 
GACGTCGATCAGCGGGTTGGGCGGCGTGCGGTGGAGAAAGGGGAGTGGTTGGGGCTGAAGAACGAGTGGGGGAAGGAGGAGAAGCGATAACAAGAACGACGCCGAAAAAAAA 
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GAGAGAAACGGAAATCGGAACACCAAGCGAAGAAACGTATCGAATTCGTATTCTTCGGAACGAAAGAACAACAAATCAGCCGCAATGAAGCCAACCTGCATATTGTGTCTCC 

TGGTGGTGATCCTCCTGCATCCGCGCATCTCAAAATCCTCAACGAGCGGCAACCCCAGTGCCAGTTCCAGTTCCAGTTCTCCGCCAGAAATCCCCGCCTTCCGGCAATGCGA 

AACCATTCGCATTGAGATGTGCCGCAAGATCGGCTACAATGAGACCTCCATGCCAAATCTGGTGGGTAACGAAATGCAAACGGACGTGGAGTACACACTGCAAACCTTTGCA 

CCACTGATCGAATACGACTGCAGTTCCCAGTTGAAGCTGTTCCTGTGCGCCGCCTACGTGCCCATGTGCACGCCCAAAGCTCCAGTCCATGCCATCGGTCCGTGCCGGAGTC 

TCTGTGAGTCCGTGCGAATACGTTGTCATCCAGTTTTGCAGGGATTCGGATTTCCATGGCCACCGGCTCTCGACTGCGATAAATTTCCGCGAGAGAACAACCACGAGACGAT 

GTGTATGGAAGGTCCCGGTGAATTGCATCAGCCGCAGCAGGAGCAGGATTTGTATGGGCTACCGGGCCAAGGAATACCCGGTGGATTGGGCGGTAAACTGCCCATGGATTGC 

TCGGGACTGGCCAAATCGCATCTGTACGTAAGGCTACCCAGATCTGGACGCTGTGCACCACTCTGCGAGGCGGACATACTGTTCACACCGGCGGAGAAGCATCTGGCAGAGA 

TCTGGGTCTCCACATGGGCCTATGCCGCTTTGGGTTTGGCCCTGGTAGCCACCGTGTGTCTTTTGGCCAGCGATGGCAGTCGTTTGGCCAGCGCCAAGTGGTCCAGATTGTT 

GTCGCCGCTGATTTGGTGCCACAACATGGTCACCTrGGGCTGGGCAGTGCGCTTTATGGTGGGAAGAACGGGAACTGCGTGCGGCACGGATCCCCAGGCGCCCAATGAATCC 

CTACTCACCGTGGACGGACTGTCCAATGCCTCCTGCGCTAGTGTCTTCCTTATGCGCTACTATTTTGGAATGGCCGCCTGCGCTTGGTGGGCTGTGCTCTGTTTGGGCTGGC 

ACCGCGACATTCGTCGGCACAGTCCGGACTCCAAGGGCCATGTGGTCATTCCCTCGAACTTTGGAGGCAGTCCGGCGAAGAGGAACAGTGCCAAGACGGCTCAGCAGGATCT 

AACGCAAAACAACTTTGTGTGCTTCGTGGCCTGGGGATTGCCGGCCTTTCAAACGTCAGCAGTGATCGTTGCCCGTTTCGTGGATGCTGACGAGTTGCTTGGCAATCAGTCG 

GATAAGGCGCTACAGATATTGGTGGCTACGCCCGTrTTTTGCTACTGGATCTTTGGCTCGATGAATCTGATCTCCGGCTATTTGGTACACTGrCGCACCAAGGAGATCCTGA 

GGAATAGCAATGCACTCAGCGTGCAACAGCAACTGCAACAGCTAAGCGCACACAGTAGTTCCGGCATTGGCATATTCCTTTTCATCTATGGACTTGCATGTGCCATGCTTCT 

GTTGGCTGTCATTTACGAGTTTGCCAACATCGATGTCTGGCTGGGATCGGGTGATACCAATACGCCACTTTGGCCATTTTTGTTGCGCGCCTTCATGGAACTGATGCTGGGC 

ATTTGCTGTTTTGCCTGGGTACTTGGACCAAGCATTTCCACTCTCTACAAGAGGCAAGTCAGCAATGGCAAAATGGTGAAGCATACGTCTGCCGGAGCAGCAACTGGACACC 

TGGACGGGCATAGTAGCTCGCGTGGATCGCATGCGGCTTGCAATAGCACGGTGGTCTCCTACCATTCGGTGCGAACTTCTATGGCCAGTGTTCCGTTGCCGCCAAGTCCGTA 

CAAACTGAAGACTTCGCCTGGTACCGGCTCTATATCGCTGAATCAGATGTCCAACTATTCGCTGGGCAGGAGCGTGCATCACCAGCAGCGCCATTCGCCGCATCATCATCAT 

CACCAGCAGCAGCAGCATCATCAGTTCCATCCCCACCACAATCACCAGCATCACTCCACATCCTCGCATCGGCTGTATTATCCGCCAGGTAGCTATGCCTCGCAAAAGTACA 

GCCAGCACGGCAGCTACTATCCCCACTTGCAGCAGTATGGCAATGAGACGCTACTCTAG 

(SEQ ID NO: 11) 

Start ATG: 421 (Reverse strand: CAT) 

MKPTCILCLLVVILLHPRISKSSTSGNPSASSSSSSPPEIPAFRQCETIRIEMCRKIGYNETSMPNLVGNEMQTDVEYTLQTFAPLIEyDCSSQLKLFLCAAYVPMCTPKAP 

VHAIGPCRSLCESVRIRCHPVLQGFGFPWPPALDCDKFPRENNHETMCMEGPGELHQPQQEQDLlGLPGQGIPGGLGGKLPMDCSGLAKSHLVVRLPRSGRCAPLCEADILF 

TPAEKHLAEIWVSTWAYAALGLALVATVCLLASDGSRLASAKWSRLLSPLIWCHNMVTLGWAVRFMVGRTGTACGTDPQAPNESLLTVDGLSNASCASVFLMRYYFGMAACA 

WWAVLCLGWHRDIRRHSPDSKGHVVTPSNFGGSPAKRNSAKTAQQDLTQNNFVC FVAWGLPAFQTSAVIVARFVDADELLGNQSDKALQILVATPVFCYWIFGSMNLISGYL 

VHCRTKEILRNSNALSVQQQLQQLSAHSSSGIGIFLFIYGLACAMLLLAVIYEFANIDVWLGSGDTNTPLWPFLLRAFMELMLGICCFAWVLGPSISTLYKRQVSNGKMVKH 

TSAGAATGHLDGHSSSRGSHAACNSTVVSYHSVRTSMASVPLPPSPYKLKTSPGTGSISLNQMSNYSLGRSVHHQQRHSPHHHHHQQQQHHQFHPHHNHQHHSTSSHRLYYP 

PGSYASQKYSQHGSYYPHLQQYGNETLL* 

(SEQ ID NO: 12) 

Name: Frizzled 4 -like 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384825 

TCATACTCACTGGCrGCCAACCCCTACTCTACTCTGACTACTTTGGTTCTGGTTTTGGATCTTGGCTGATGCTAGGTTCTAACTTTAAGTACGATTCTTAGTATTGCTCACT 
GGATACACTGTAGATACACCGCAGATACAGCCGTCCTGCGGCGGCACAACAACAAGAAAACAAACGCTAATGCAGCTACTTACGACTAGGAGGATGATTAGTGGCAAGGGGT 
TAAGGAGCGGGTTAATCACAACTGGCTGGGAAACTGAAAGTGGCCGACAAGCGCATCTACTTAACCTAGATGATGGTCCTACGCGGCCCAGGGTCAATCACATTGCGGAGTA 
GGTTCTACACAGAGAGATGCGATTTGGTCAAAAACTAAGCTTAAACGACCGCGATGTTATCTTArCAATTAAAACGAAAAAAATGGAATCAACATTTrTATAGACCCATTTA 
AAGAGCTATCATCTTTTTTCACCGAAATACTTAAGGAAATAATATGCATGCTTAGTAAAGGAACACTACACTAGGTCTTAAGAAATGATTATCAAGAAGTAAACATTTGGCA 
CCCCGAGGCGAATACTGATACTGGTAGTTTTTAGAAAACTTTGCTGATACCGAAAACTCGCGAATTAAGATAAATTCTAACATTGCACAAATTTCTTAGTTTACATTGTAAA 
TTTTGTCTACTTCTAGGCAGAGCACAATGTGCCTGCAATTTTCTCCGTGTTGTTGATTCTGCTAAGTTCAGGAAAATGACTAGCTATGTGAACTTGACTrGAGGAGGGAGGG 
GGAGGTTCGTATCGTGACTTATCTAGCGTTTTACCCTAATGCATTTGATTGCTTTAAGCTGATGATTGGTCAGTTGCACACGCGCTATTCTGGGGCAGATTTTGTGGGCGTG 
TCGTAAAATAATTGCTTGGTGTTTCGCTTCTAATTGACGAGGACTTGGACTTGGATTTGAAGTGTGGACCGTGTACAGAGTGGACAATGGACAGGGCATCTTATTTATTCGG 
TGCCATTCGCAGGCGCCTGCTGCTCCTCTTCCTGGGAATCGCCGAGGGAAAGCCTCGACTTGCCCGCTGTTTCCATCGATTTCCAGGTGCCTCCACTGGTGGCCTGCTGCAG 
CGAAATGCGAGGTATGGACTTGCAGCTGCCCAAGATAGAGGTGCTATCTCCGGGACGGTCCTCGGATTCGGCCAGGGCGCTGAGACTGCAAGTGGGCGGCAGGATGCGGTGT 
GGAAAGCGTGGCGAACCCCTTACCCTTTCCCTTAAACTGGGTGGCTGCTGCACCACCGAAGGTCTCCGGTTGCTGGGCAATTGGAAGTTCACCTTGGTCTCCAGCAGTTTCA 
CATACTTTCCACCCAACGTTTGGTCCCCCTGTTCACCGCCAGAAATAGTGGGGATAGGCAGGAAATTCGGATCCTTGGTAAGGTGATCCAGGTGAGTACCGCTTTTGGTCAA 
TGAACCCCTATGGATGGGCTTTAGGTGGTTACCATTCGGGTGGGAGGACGCCTCTGAGGGAATGGAATCCTGATTTGGATTATTGTATATGGCATTGCAGGACGCCGTTTGA 
GAGGAGGGAATCAGGCGGGTGAACAGTCCCATGTGCGATTTAAAGGCCGCCTTCAGACCCTTTTGGTCCGCCAAGCTGGCGGTGGATCCCTTGTTGCTCGAAGTGGTGCGAG 
AAAATATCCCTCGTAATCGGTTTATCAAGCCCGTCTTAGCCTCCGTGGCCTGCTGGTGGGCTAAACGGAGGTTAATCAAGTCCTGTGCAGGCAGGCAGGCCTCCTCTAATTG 
GGGCTGGGAGGCAAAGGTAACAGAACGCCTCATCGGGATCTGCATGTACTCTGCTGACGGCCAACTGGCCCGTGGAATCACAGGAGGCAGCACGGAAAGCAGGAGATTGGGC 
AGTATTCCCGAGATACTTGGAGTTCTGTTTTTACTTGCAGCCGGAGTCTGCGAGATGCCACTGGTTACCGTCAGTTCTGGCTTTAGATGACCTCCCCCAGTTAAGAGTGATG 
TGGACGACGAAGTTGTGGTGGAGGTGGTGGTTAGTCCCGACTCGAGTAGCCTCTCCAGCTTTCGAATCTCGGCGTCCAGAGCCTGGATTTCCTTCTTATAGACCCTGTTTTG 
CACCTCCACTCGATACTGCAGTTCTCGGCGATCATCGACCACGAATCGGCGTGTGTTGCACTCCATCTTAAGGCCCATACTGTGGATAACCGGATCGATAATATCGTCTGAG 
GGAGAAGGAGAGTTCCCAAAGGATTAGGACACAACTTTATAAAGGTTCTAGAAATTAGGAGACTAGAACTTAAAAGTRTTTCCTAAGTGAGCAAGATCTTATTGCACCTTCA 
AGCGCTACTTACTTCTTGCCCAAATATCATGGAGTTTTGGGATGAAAAGCAGACAAAGGGTTGCAGTGGTGCTGGTTAAAATCAGAGCTGTGATTGTGATGAAGGCCAGGGT 
GACTCGCTCCGAAATCAAGTTGGCCAGCACCACGACGATGGCGCTGGTGATGACCACACTGTATACAGACACTCCGATGTACTGCGAGTCATTGAGAGCAGGTATTTTTACG 
TGGCGCGTCTCCCAGGCCATATAGACACCCACCACAAGAAGAAGGCCTTTGTAGGCGTACAGGACACTCAACCACGTTTGCGTGTGCTGCGAACGGCAAACTTCAACCTAAA 
ATAGATAGATAGATAGATATTCATAGATTCTAAATGAATTAATACTTTTTCAACATAATTAAAAGTAATCTGATAATTTGAAGCATATTAATATTTTTCTTACCTGAGGCTG 
GTAAACGACACTTCTATCAGTCGCACTGATCTCGAGCGTCAGGTTGTGAAGATGGCGCTCCATTGGATCGGTGACCACCCAAAGGGTTACGAGCAGCGCATCCACCAGAAGC 
AATCCGCCGACGAGCAAGATCAGTTGAATGTCCTGCAGCATCTTGTCCTTGAAAACGCTGCCGGTACGAGTGAATATCCGATGCACTCTGTAGGTCTTGGCAAACATCGATC 
CAAAGGCCAACGAGAATCCGGCGGAGAGCAGATAGACGCGGGCCTGGTGAGCATGATAAGCGGATGACGGGGAAGTTCAGGTTAGGGAGAAAGTACTGTAGTAGCAGCAGCA 
AAGTAACACACTAAGTGTCATGTCAGGCAGACCCAAAGAAAACAAGCCGGCGGCAACAATGGGGGAAACAAGTCATCAATGGACCACAATTCTCAGAGTGCTTGAGACTTTG 
TTGTGCGTTTCAAATTTATGCAAATAAAGGAATATCACAAAAGTTGTAAATTTCCGTGTCATGTGTTTGCATTTCCCTGCCACAAGCCTCAACAAAACAAAAACAAGCAGGC 
AACTGGAAAAGCGGAACGACAGTAGAAGTGGTTACAGAAACAGAAACAACAGCGACCAGCAAATTTCCCAGGAGTGGCGTTCGAGGCAGTTGTTAAGGGGGTCAAAGTGGAA 
AGGCGCAGATCGGGATAGAAGAAAGCTCGTCAAACTTGACTTGACTTCTGGCTTAAAGAGCAGCAGATTTGCAGCGAAACCGAACGGCTATTTATAGCCAGCAGGGGCAGTC 
GTGTATCTTGAAAAGACACGCGGAATAATCAATATTCAGTTTATATAATCCACTTTAAAATAATAGAAATTAACAAAAAATTGCCACAATTATCAGTAArTTTTATACTTCG 
CAGGCTATTTTATCGCCATCTTCGTGTATACGAATACGACAATTAGTTCAAGCAATATTTCTTTCAGTGAATAAATAAAGCAAACACATGAGCGCGACTCGAAAGTTACCAG 
CCTGAAACTGAAAGTCATATTTGATGAGGACCCCACATGCTTGCTTACACACAGAAGCCCGTCAAAGTCAAACAAAGACGGCTGAGAGGAAAGGCGGATGAGAATGCGGATT 
CGGATAGGAATGGCGATAGGGATGGGGATGTCGGAGCCGAAGGATAACAACTCACCGTGCAGACCGTTGCGAAAGAGTCCTCCGCCGAGGGCAGCGTCGAGTGGTCCAAGCC 
CAAAAGGATGACGGTGGCGTACACAAAGATGCAGCCCACTGCGGTGATGTTGCTCAGCTTCGGGCTGGAAAGTTTAATTGCCCTGTGGAGTGAGAAGGGGGATGGTCATGGT 
TATAAGCTGAGGCGCCGCAATCCTTTGCAGAGAAAGTGCAGAGTAACCGTAGAATGCGAAACATATATACGAACTCAACTAACGATGGGTCAGAAATACAGCTGGAAAGCCA 
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AGCAATTACTTCRCTTCAAAAACCGAAACAAGTCACATGTGTAATGGGTTCAATGGGTTGAGCACCGAAAGCAGTTAACTCATCATGGAATftTAAAGAAACRGCCCGCACAA 

ATTCTCTTTTATATGGCCACTAAATATTTTATAGTCGTTTAATGAAAAATAATATATTCAAACAATCAGGGCCCATAAAAAGAGTAAGCCAATAATTTATGTACTTATCGCC 

TTACTTCAGCTTCCGAAAGTGCAGATTGAACGCCAGGAAGGCGATGGCCAGAGCGATTCCCACGCTGGAGAGGGTGGCGATGGTGTAGAAGGCCAGTGGAGCGATGGTCGCC 

ACCCGCAGCTTGAACACCCGCTTGGCGATGGGTACCTGCCCGCTGTGCCACTTCACCGGCCGGCAGCGGGGACACCGGAAGTCCAGGGCATCCGTGGCCGGATAGTAGAGGG 

CCACCGGTTCCAGCAAACCGCGCTGGATTTGATAGAAGGCAGTGGTGCCAACGCGATCTGGGCCGCTGAAGGAAACGGGGCCCTACCAAAAAGAGAAATACTCGTGATGAAC 

TAGATTTAAGAACCTTAGGATCAAAGTTAAATGGGTTGGATGATTTCAGTTGAAAGAGATAACTCACCGACACTCCCAGGAAGTGGAGCTTGCCCATTTGCTGCAGGAACTC 

CCAGGCCATGTCGCTGCGGGTGTAATCGAATCCGTCCAGCTTCGACTGCTCCTCGTTCCGCCGCCAGTGCTCCTCAGCGGCTCTCAAGGCCAGGGCGATGGCCCACACGGCG 

TCGTAGGTTTGCGGCGCGTACTGCGAGATCGCCTCCGGAAAGAGGTGCCCTCCGCTGGTGCCTCCCACGCCCCTTCTCCTCCGCCGACGAGAGTCAGATTGCGTTGCAGCGA 

TACTGATCCTGGAACCATAACCGGAGCCAAATCCATCCTGGCCGTGGAACTGGGCGGATTGCTTGCGCAGCTGGGAGTTGAACATGTGATTGTTCTGCGAAGTGGAAACGCG 

ATTAAACAAGCCTCGAATGCCGCAGAAATGTCGCCAGTAATGAGATTGCCTATGGCAGCTACCACTCCTGCCCCAATTTCATGACCAGCACTAAAAGAGCCATAAAAACGCG 

TCTGCGATGGTGAATGCGAATGCGATGACAGGCCGGACTTAGGGGGCTTCGCACTTGGCAATCACCATCGCCATCGCCATTGCTGAAACCATCGCGAAACGCAATCGCAGCC 

TCGGGATTGTCTAATCAAATAAATCATTTCATTTCGGACACGCACAGCAAGACGGACAAGACAGGACAACGAGAAGGACAAGATAGTCGTCTCTGAATAAAAAAAAAAAAAA 

AAAAAAACGGGGAATAAAAGTTTCGGCAATTCTGGAAAATGTTGCTTGAAGCTGCCTTTGAGAGTTGGGTTCAATCCAATTTGGATCCCGAAATTTATATGCATGTATAACT 

TAACAACAATCTATTCTTGAAATTCATCTTTACGTTAATCTCATACTGTCATGAAATTAATTTATTTATTTTCCCTTTAATATTGAATGATATTTTCTCGCAGTGCACCTCC 

TCAACTTACCAGTCCACTATAGCTGACGTTATTTCCAACGATGCTGTTGTGCGTTGAGACCACGATGAGGTTCTCGACGGCCAGCTGCAGTTCGTGGTTAGAGCAGGCGGTG 

CGCTGGTCCGGCCACCACGGAGCCCCCATGCTCTCGTGGAGGATCCAGGCGTAGTCCGCCCCGAACATTCGAAGCCTGTAGGCCTCGCACAGGATCTGGGGGGCCAGCTCCT 

GCGAGAAGCTGCCGATGATGATGCGCGTGTCCGTCTCCTGTGGCGATACGAACGAAACGGAACGGCATGTGCCCATTTGTCATTTGTCATCCTTCGATTAGCACAAGTGCCA 

ATCCACACTCAAAAAAAATGGGTAACAATCATCATGCTGATAGAGGGTCTCTAGGCGATGTGGTCTGCTAATGAAACATTTATCCATGCGGACTGGACCAACAAAAAACTTG 

AAGATAAACTCAGAGAAATAAGTTAACGTAAAAAACGTAATTATAATCATACATATGAACCTTTTTTTTAAAAACTTACTAAATTAAGTTGCCAGGCAAATAAATTGGTCTA 

TTTAAATATTCCCCAAATACGTTCATTTTCATATTTGTTCATGCATAAGCTTTTAATATAAACAGAACAGCGCTAGCACCATTTTAATTTAAACATTTTCGAAAATTGAAAT 

TTAGTTAAATTTAATGTTGGTTTTTCAAACAAATATTCATAAATGTTTACGTTTATACGTTTGCGTATACGAGTATAAAATAAAACGAAAATGTGTATTTAAATGGCCATTT 

GTTGCTCACCCTAAGTAGCAGCAGCTGCTCCTTGAAGTCGGTGGCCGCAAAGGTGATGGTGGCGGCACAGGATATGTTGGCCGCCTCCAGTTCGGTGACCAGGTTGTTCACC 

GCCAGCGAGTGAACCTCCTCGTTCTGCGAGAAAGTGGTCACCGTGCCCCAGCCAAACTTCCGAATGAAAGCGATGCGCGCCGGATTGTGTGAGGAGTCCGGGGCCACCGTCC 

TGTAGAAGTAGGGGAACTCCCGCCTGTCGCTCAACGCCGGCGATGTGGAACCGAAGGATACCTTTCATCGAACGAAATCGTTAGACTATCTCTACCGTACATGGGGTCGCTT 

AAAATGCAAGACACGCATCAAGAAATCCAGCCCTCTAGGTCATGTGTCTTCCCAGTTTCCCCACTCATTTAACACACGCACCGCCGAAATGAAACTTCAAAGGCCAAATGCA 

AATGCTCTAAGTGAATTTGTGGACCTCATAATTTGTGGGCTGTGGAAATGTTTATGGCTCAAAGGTTTATGGGGTGCTGGAATCGGCTGGATTCCGCAGCATTTATGCTAAT 

CGAGCTACGCACGTGAGTTGATTAGATTTTTGAGAAATTGCCCGATACGAGGAGTTGGTTGGATGAACCGAGCGAGCTGGATTATTTGCATTTTAATCAGTCGCAGATGACT 

CACCTGCACGATGTTCCAGTAGGGCACCACCTTCGCCAGGCTCTCGGTGACCTCCGAGCAGGCCGATCCCAGCAGCATCACCATCCTCGTCGAGGGCTGTGTGTAGATGGCG 

TGGAAGAAGCGATCCACGCCCACTCCAGGATCACACTGATTGTCGGAGAAGAAATGTGTGAAGAAACGGAGGAGCAAAGGACACCACATCAGAGATGAGTAATAGAGGGGGC 

AGGGAAAATAGCAGTGGCAGAGGCCGTTGCTCGCTGACTCGTTTCGACGCAATAAAAAGGCAACATGATGGCGCAATAAACATTTTTAAAATGTCGACTTTTAGAGAGGCAC 

AGCCGGAGATGCGGCTCCAGATATGACAGTCTGTTTAAGGTCGATGTGCATATGCATACACTTAAAAAAATCGTTTGAAAACCTAGTTCACTAAGTGCTGACATCACAGTCG 

AGGTATTACGTATTGACGGTTTAATTTATCAGAAACTAAACTACTATTATGCATTTTCTTTTTTTTACCATCAATTAATTGTAGCTTTAATACACTACTGCACCAAGTTAAA 

AATGATTGCAGGTAAATAGTTTACATTGTrGGTAACTTAGTCAGTGGCATTGCTTTTCTTCAGTGACAGTGTTCTGTACATTTGCGTGCCCCATCAAAACCGGCAACGACTC 

ATCGCAATAAAAGCGCGAAAGACGGAAGGTTGATGACATATTTGCCTCTCTTTTAGGGCCACAGCCGCAGCTGCTATCCGCACTGTTCTTGTTGTTGCAGATTTTCTTATTA 

AGTTTTAATGTTGCACGCGCTACTGAATGAAAATTCAATTTCGTAATGGAACAGCGACGCCTAGAAGTGGCGCCCAACGTGGGGCATACGGTTACGCCACGTAGAAATGACA 

TTCTCACCTGAGTATCGTTGGTCACGAGCTCGAGGGTGTAGCCCGGCAGCAGGCGCTTGCGGTTGATGTGCTCCACGGCCATGGTGGCAGCTCCCAATTCGCTCAGACCATC 

CGGACGTGGTCCCCGCGATGTGGACAGCTCAAAGAGTCCGAGCAGCACGATTTTGCCATGATTGCGCTGCATTTCGGCCGGCGACATGCGGCGTTCGCTTTCGAACACCTGG 

TTGACCTTCAAGTATCGCTGCCTGGGGGGCAGATTGAGGAGGTGGTGGGCGTGGGAGGCATTGGTGCTTCCGGGCAGATTGCTGTGCGAATTGAGGCGCTGCTGCCGGCGTT 

TGACTTTCGTCCGTTTGTACTTCCAATCGGAGCGCGGTGGAATCGGAGTCCCCCCTGCAGTCACAGTCGATGCCGATTCTCCGGGTGGCGAGCTGGATGGGCTGGATGTGCT 

AATTGAGGCAGCACCTGGCCTAATTGCCTCGTAGCCCAGGGCCTGCAGCTCGGCTGATGACTCCATGGCAGCCGCTGCCGACGTTGCCCAGGCAAGGGCCAGGACTAGCCTC 

AGTCCCACGGCCGGCCATGGACCGTATCTGGTCCCTTGGACCGGTTGAATTATGCGCATCCCGTCGGCAATGGCCAATGGCCCATTAATCACCAAGGCATTTGAATCGTCGG 

CTATCAGAGGCTGCAGCAACAAACAAACACGAGCCACACAGCACAGGACACCGGAATATATATCGCCTTTTTATGCACTCTGCCGACAGGATACGCTTATCGTTTCTCTTCC 

GACGAGCGGCTCTTCCTTCTGCTCCCTCCTCTTGTATCGGCGTCAATTTTATCGCCTTTTTATTGGCCTCACTTTTGCTCGCATTTTGTATCTTTTTGCACATTCACTCTGC 

ACAGCAGTCGACCATAAAAGTATGCAATGCAAAGGACTTTCTCGATAGGTCCGATTGACAGGATTGGGTAACACGGGAAAAAACTACACGCCATTCGCTAACTTTCAGTCAC 

TTTTCGTAGAAAAAAATGCTTATTTTTATTATGCACTGTTTATTAAGGATTAACACAATTTAGGTGAATATTTATTAGTATTTTGCTGTAGTATGTGATTTTATTAAAATGT 

AATTTGAATCTACATATGTATATATATGTATATATGTGAAAACATTGATTTTTTCATGAGAATGTGAAAGGGAATGTTAATGGAAATACCGAAACAAATGTCTAGGGAGGCA 

TAAAAAGGCATTTCACACCAATTTGGCAGTCGAAAGGATTATGACGAAGGCTTCGATGATATGGACACACCGCGCCGCTCACGTCTCGTCCCATGACTTGTTATGTGCCTGC 

CACTTGGCCATTATTTTTTATGCTAATGGCCAAACATTTTGCGCTTTCATTGCGGGCTTTTGGACTACTTTTTGTCGAGTCTCACGCGCCACAGATTTCTGTGGCGAACGTT 

TTTCTCTGTATGCGCGCGTATGCATAATTGTTTCAAGATTACCAACTTGCAGTCCCTTTGCCGCTTTCCCGCTTTTCCACCGCCCACTTGCCTGCACCTAAGCCCACTTCAT 

AGACGCTTTCCTTCGCGCTTTCTGCTTTTCCCACAGCACGCTTTTCGCTGT 

(SEQ ID NO: 13) 

Exon: 8795.. B184 
Exon: 7427.-7284 
Exon: 6893.. 6618 
Exon: 6085.. 5834 
Exon: 5246.. 4884 
Exon: 4786.. 4485 
Exon: 4114.. 3976 
Exon: 3067. .2792 
Exon: 2683.-2365 
Exon: 2235.. 1001 

Start ATG: 8795 (Reverse strand: CAT) 
Transcript No. : CT9836 

ATGCGCATAATTCAACCGGTCCAAGGGACCAGATACGGTCCATGGCCGGCCGTGGGACTGAGGCTAGTCCTGGCCCTTGCCTGGGCAACGTCGGCAGCGGCTGCCATGGAGT 
CATCAGCCGAGCTGCAGGCCCTGGGCTACGAGGCAATTAGGCCAGGTGCTGCCTCAATTAGCACATCCAGCCCATCCAGCTCGCCACCCGGAGAATCGGCATCGACTGTGAC 
TGCAGGGGGGACTCCGATTCCACCGCGCTCCGATTGGAAGTACAAACGGACGAAAGTCAAACGCCGGCAGCAGCGCCTCAATTCGCACAGCAATCTGCCCGGAAGCACCAAT 
GCCTCCCACGCCCACCACCTCCTCAATCTGCCCCCCAGGCAGCGATACTTGAAGGTCAACCAGGTGTTCGAAAGCGAACGCCGCATGTCGCCGGCCGAAATGCAGCGCAATC 
ATGGCAAAATCGTGCTGCTCGGACTCTTTGAGCTGTCCACATCGCGGGGACCACGTCCGGATGGTCTGAGCGAATTGGGAGCTGCCACCATGGCCGTGGAGCACATCAACCG 
CAAGCGCCTGCTGCCGGGCTACACCCTCGAGCTCGTGACCAACGATACTCAGTGTGATCCTGGAGTGGGCGTGGATCGCTTCTTCCACGCCATCTACACACAGCCCTCGACG 
AGGATGGTGATGCTGCTGGGATCGGCCTGCTCGGAGGTCACCGAGAGCCTGGCGAAGGTGGTGCCCTACTGGAACATCGTGCAGGTATCCTTCGGTTCCACATCGCCGGCGT 
TGAGCGACAGGCGGGAGTTCCCCTACTTCTACAGGACGGTGGCCCCGGACTCCTCACACAATCCGGCGCGCATCGCTTTCATTCGGAAGTTTGGCTGGGGCACGGTGACCAC 
TTTCTCGCAGAACGAGGAGGTTCACTCGCTGGCGGTGAACAACCTGGTCACCGAACTGGAGGCGGCCAACATATCCTGTGCCGCCACCATCACCTTTGCGGCCACCGACTTC 
AAGGAGCAGCTGCTGCTACTTAGGGAGACGGACACGCGCATCATCATCGGCAGCTTCTCGCAGGAGCTGGCCCCCCAGATCCTGTGCGAGGCCTACAGGCTTCGAATGTTCG 
GGGCGGACTACGCCTGGATCCTCCACGAGAGCATGGGGGCTCCGTGGTGGCCGGACCAGCGCACCGCCTGCTCTAACCACGAACTGCAGCTGGCCGTCGAGAACCTCATCGT 
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GGTCTCAACGCACAACAGCATCGTTGGAAATAACGTCAGCTATAGTGGACTGAACAATCACATGTTCAACTCCCAGCTGCGCAAGCAATCCGCCCAGTTCCACGGCCAGGAT 

GGATTTGGCTCCGGTTATGGTTCCAGGATCAGTATCGCTGCAACGCAATCTGACTCTCGTCGGCGGAGGAGAAGGGGCGTGGGAGGCACCAGCGGAGGGCACCTCTTTCCGG 

AGGCGATCTCGCAGTACGCGCCGCAAACCTACGACGCCGTGTGGGCCATCGCCCTGGCCTTGAGAGCCGCTGAGGAGCACTGGCGGCGGAACGAGGAGCAGTCGAAGCTGGA 

CGGATTCGATTACACCCGCAGCGACATGGCCTGGGAGTTCCTGCAGCAAATGGGCAAGCTCCACTTCCTGGGAGTGTCGGGCCCCGTTTCCTTCAGCGGCCCAGATCGCGTT 

GGCACCACTGCCTTCTATCAAATCCAGCGCGGTTTGCTGGAACCGGTGGCCCTCTACTATCCGGCCACGGATGCCCTGGACTTCCGGTGTCCCCGCTGCCGGCCGGTGAAGT 

GGCACAGCGGGCAGGTACCCATCGCCAAGCGGGTGTTCAAGCTGCGGGTGGCGACCATCGCTCCACTGGCCTTCTACACCATCGCCACCCTCTCCAGCGTGGGAATCGCTCT 

GGCCATCGCCTTCCTGGCGTTCAATCTGCACTTTCGGAAGCTGAAGGCAATTAAACTTTCCAGCCCGAAGCTGAGCAACATCACCGCAGTGGGCTGCATCTTTGTGTACGCC 

ACCGTCATCCTTTTGGGCTTGGACCACTCGACGCTGCCCTCGGCGGAGGACTCTTTCGCAACGGTCTGCACGGCCCGCGTCTATCTGCTCTCCGCCGGATTCTCGTTGGCCT 

TTGGATCGATGTTTGCCAAGACCTACAGAGTGCATCGGATATTCACTCGTACCGGCAGCGTTTTCAAGGACAAGATGCTGCAGGACATTCAACTGATCTTGCTCGTCGGCGG 

ATTGCTTCTGGTGGATGCGCTGCTCGTAACCCTTTGGGTGGTCACCGATCCAATGGAGCGCCATCTTCACAACCTGACGCTCGAGATCAGTGCGACTGATAGAAGTGTCGTT 

TACCAGCCTCAGGTTGAAGTTTGCCGTTCGCAGCACACGCAAACGTGGTTGAGTGTCCTGTACGCCTACAMGGCCTTCTTCTTGTGGTGGGTGTCTATATGGCCTGGGAGA 

CGCGCCACGTAAAAATACCTGCTCTCAATGACTCGCAGTACATCGGAGTGTCTGTATACAGTGTGGTCATCACCAGCGCCATCGTCGTGGTGCTGGCCAACTTGATTTCGGA 

GCGAGTCACCCTGGCCTTCATCACAATCACAGCTCTGATTTTAACCAGCACCACTGCAACCCTTTGTCTGCTTTTCATCCCAAAACTCCATGATATTTGGGCAAGAAACGAT 

ATTATCGATCCGGTTATCCACAGTATGGGCCTTAAGATGGAGTGCAACACACGGCGATTCGTGGTCGATGATCGCCGAGAACTGCAGTATCGAGTGGAGGTGCAAAACAGGG 

TCTATAAGAAGGAAATCCAGGCTCTGGACGCCGAGATTCGAAAGCTGGAGAGGCTACTCGAGTCGGGACTAACCACCACCTCCACCACAACTTCGTCGTCCACATCACTCTT 

AACTGGGGGAGGTCATCTAAAGCCAGAACTGACGGTAACCAGTGGCATCTCGCAGACTCCGGCTGCAAGTAAAAACAGAACTCCAAGTATCTCGGGAATACTGCCCAATCTC 

CTGCTTTCCGTGCTGCCTCCTGTGATTCCACGGGCCAGTTGGCCGTCAGCAGAGTACATGCAGATCCCGATGAGGCGTTCTGTTACCTTTGCCTCCCAGCCCCAATTAGAGG 

AGGCCTGCCTGCCTGCACAGGACTTGATTAACCTCCGTTTAGCCCACCAGCAGGCCACGGAGGCTAAGACGGGCTTGATAAACCGATTACGAGGGATATTTTCTCGCACCAC 

TTCGAGCAACAAGGGATCCACCGCCAGCTTGGCGGACCAAAAGGGTCTGAAGGCGGCCTTTAAATCGCACATGGGACTGTTCACCCGCCTGATTCCCTCCTCTCAAACGGCG 

TCCTGCAATGCCATATACAATAATCCAAATCAGGATTCCATTCCCTCAGAGGCGTCCTCCCACCCGAATGGTAACCACCTAAAGCCCATCCATAGGGGTTCATTGACCAAAA 

GCGGTACTCACCTGGATCACCTTACCAAGGATCCGAATTTCCTGCCTATCCCCACTATTTCTGGCGGTGAACAGGGGGACCAAACGTTGGGTGGAAAGTATGTGAAACTGCT 

GGAGACCAAGGTGAACTTCCAATTGCCCAGCAACCGGAGACCTTCGGTGGTGCAGCAGCCACCCAGTTTAAGGGAAAGGGTAAGGGGTTCGCCACGCTTTCCACACCGCATC 

CTGCCGCCCACTTGCAGTCTCAGCGCCCTGGCCGAATCCGAGGACCGTCCCGGAGATAGCACGTCTATCTTGGGCAGCTGCAAGTCCATACCTCGCATTTCGCTGCAGCAGG 

CCACCAGTGGAGGCACCTGGAAATCGATGGAAACAGCGGGCAAGTCGAGGCTTTCCCTCGGCGATTCCCAGGAAGAGGAGCAGCAGGCGCCTGCGAATGGCACCGAATAA 

(SEQ ID NO: 14) 

Start ATG: 1 (Reverse strand: CAT) 

MRIIQPVQGTRVGPWPAVGLRLVLALAWATSAAAAMESSAELQALGYEAIRPGAASISTSSPSSSPPGESASTVTAGGTPIPPRSDWKYKRTKVKRRQQRLNSHSNLPGSTN 

ASHAHHLLNLPPRQRVLKVNQVFESERRMSPAEMQRNHGKIVLLGLFELSTSRGPRPDGLSELGAATMAVEHINRKRLLPGYTLELVTNDTQCDPGVGVDRFFHAIYTQPST 

RMVMLLGSACSEVTESLAKWPYWNIVQVSFGSTSPALSDRREFPYFYRTVAPDSSHNPARIAFIRKFGWGTVTTFSQNEEVHSLAVNNLVTELEAANISCAATITFAATDF 

KEQLLLLRETDTRIIIGSFSQELAPQILCEAYRLRMFGADYAWILHESMGAPWWPDQRTACSNHELQLAVENLIVVSTHNSIVGNNVSYSGLNNHMFNSQLRKQSAQFHGQD 

GFGSGYGSRISIAATQSDSRRRRRRGVGGTSGGHLFPEAISQYAPQTYDAVWAIALALRAAEEHWRRNEEQSKLDGFDYTRSDMAWEFLQQMGKLHFLGVSGPVSFSGPDRV 

GTTAFYQIQRGLLEPVALYYPATDALDFRCPRCRPVKWHSGQVPIAKRVFKLRVATIAPLAFYTIATLSSVGIALAIAFLAFNLHFRKLKAIKLSSPKLSNITAVGCIFVYA 

TVILLGLDHSTLPSAEDSFATVCTARVYLLSAGFSLAFGSMFAKTYRVHRIFTRTGSVFKDKMLQDIQLILLVGGLLLVDALLVTLWVVTDPMERiJLHNLTLEISATDRSVV 

YQPQVEVCRSQHTQTWLSVLYAYKGLLLVVGVYMAWETRHVKIPALNDSQYIGVSVYSVVITSAIVVVLANLISERVTLAFITITALILTSTTATLCLLFI PKLHDIWARMD 

IIDPVIHSMGLKMECNTRRFVVDDRRELQYRVEVQNRVYKKEIQALDAEIRKLERLLESGLTTTSTTTSSSTSLLTGGGHLKPELTVTSGISQTPAASKNRTPSISGILPNL 

LLSVLPPVIPRASWPSAEYMQIPMRRSVTFASQPQLEEACLPAQDLINLRLAHQQATEAKTGLINRLRGIFSRTTSSNKGSTASLADQKGLKAAFKSHMGLFTRLIPSSQTA 

SCNAIYNNPNQDSIPSEASSHPNGNHLKPIHRGSLTKSGTHLDHLTKDPNFLPIPTISGGEQGDQTLGGKYVKLLETKVNFQLPSNRRPSVVQQPPSLRERVRGSPRFPHRI 

LPPTCSLSALAESEDRPGDSTSILGSCKSIPRISLQQATSGGTWKSMETAGKSRLSLGDSQEEEQQAPANGTE* 

(SEQ ID NO: 15) 

Name: GABA B-like 

Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000012808751 

TTTTCTGCCCGTTTGTTTGTGCTCCCGTCCAGGTGAAGGTTAAGTGCTCCGAGCTGAGGATCAAGGACAAGAAGGAACTCACCAAGCAATTGGATGAGCTCAAGAATGAGTT 
GCTCAGCCTGCGCGTGGCCAAGGTGACCGGCGGAGCTCCCTCCAAGCTCTCCAAGATGTGAGTATGCCGCGGCACAACGACAAGCGTCACGAACTAATTCCGGTGTTTTGTG 
CATTTATGTGCGTTCTTATCCATACAGCCGCGTTGTCCGCAAGGCCATCGCTCGCGTCTACATTGTGATGCACCAGAAGCAGAAGGAGAATCTGCGCAAGGTCTTCAAGAAC 
AAGAAGTACAAGCCCCTGGATCTGCGCAAGAAGAAGACCCGCGCTATCCGCAAGGCCCTGTCTCCGCGCGACGCCAACCGCAAGACCCTCAAGGAGATCCGCAAGCGCTCCG 
TCTTCCCCCAGAGGAAGTTCGCCGTCAAGGCCTAGAATGCATCCAGATACTCCGACTGTCTATAGTGCGCGTTAAGGGTTCTTCGGTACAATGAAGAAGAAAAACTAAATAA 
AATCTTTTTTTTAAAGTGAAAATAGCGATGCGTGTTTGATTTTGTGACCGAAAAAACGCGAGTGGCACATGGCTTTGGCCATGGGTTCATGGAAATTAGTTAAAAAACAAAA 
AAACAAGTGCCGCATTCCCTGTTCTACATATGTTAGTCGGATCCAATGGGATCTACTTAACTACTTGTGGTCCTTAAGATCATTATAGAGCAAAAACGTTGAGTTAGCGCCT 
TGTAGAAGTAATGGCAAATCTTAAGTTCAATCTTTAATTCAGCCTTCTTTGAATATGCAAACTTGAAAAACACCAGAATTCTCAATTGGTCTACGTATATGTATTTAATTCG 
ATTGTCTTTGATGTAAGCACGGCCATACATCGTAAATAATTATGAACTAACAAGTAATAAGTAGGTAATTGATATGAATTACAAGCTCTCGACTGCGAGCTCGATCATACTG 
AATATGTTTGGCACATTTGGTTAATACTATTCTTCTGCACAGCGTGAAACAGCGGCGTGCTGCGGATGACGCCCGACAGCTCCTTGGAGTTCTTGCTTTGGTGATAGTTTCT 
ACTCGGCATCGGCGACAACGGTTCGCCCCAGAACTTGAGCAGCGCAAAGATCTTGTAGTAGGCCACTCTGAAATAGAAGATAGATTCGCTATTAGCAGCTAAACCCGATAGC 
AGCCAAAACTGCCAGCAAACTTACCTGTAGTTGCGATTGCTGGCCGCATAGATGATCGGGTTAATGACACTGGAAGCCCAGGCCATCACGGAGGCGATGATGTGCAGCCAGG 
GGTACGAGGTATTGCGCTCGTCGTCCACCACATTGGCCAGCATTAGCGGCAGGAAGCAGACGAGGAAGCACAGAAAGATGGTGACCATCATGACGGTCAGTCGATTGTCTTC 
GCGCGCCTTGCGCGTGCATGTGGTGGTCATGTAGGATCCGCCGCCGGACGACGAGGAGCCCTTGGCCGCTGCGATCTGGAAGTTATCGTGGTTGCGAATCTTCTTTTTCTGG 
TGCAGCACCGTAATGTAGATGCACGAGTACGAAACGATGATCACCAGGCAAGGCAGCAGGAAGCCGATCACGAACAGCGTCTTCTTGATCGATCGCCCCTCCTTCTTGAGTA 
TTGTGCAGGAGAAGGTGGCCTCGTCCAGGCCCATCTCGCCCCAGATGCCCAATATGGGCGGCAGCTGGGAAGATAAAAACCGATTAGTTGAATGTGTGCACACAAATCGAAG 
ACGCCTTACTTACCAACAGCAGAAAGGAGACGGCCCAAACGAACAACAGCTGCAGGGTTATGAACTTAGGCTTATATATCTGCGAGTAGCGGCTGTGGCAAGCGATGAGTAT 
ATATCTGCCAGGGGATTAGGGTGACGGTTGCGGATTAGCTTGCGACAATGCGGACTAGACTTGGGCCTACCTGTTCAGGGTGATGCCCACCATGCTGAGGAGTGAAACAGCC 
ACATTGCCATAGAAGATCACCGGAAAGATCTTGCACAATGTGGTACCAAAAGTCCAGCTCTGTCGTTGGAATCGAGCAAATTATAATCATTCACTAATTGTATATTACATTC 
ACAGGTATAATGACTCACCTCCTGGAAGAATCGCACTGCGGTCAGTGGCAGGCTGAAGGAGCAGAAGAGCAGGTCGGAGATGCTTAGCGAAATGACGAAGGCGGTTGTGGCA 
TGCTCCCGTATCGTGGGGCTCTTGAGCAGCGCCAGCAGGGTAATCAGGTTGCCTGTGCGGCAATTCAGCATAGAGAAACAGGTTAGCTAATCGAGGAGCTGAATGTGGGATG 
CCAATTACTCACCAAGAACGCCGATCGTCACAAAGACACAGGCACTAATGGCCGCGAATAATGTCGCCGAATGCGGGTAGATCGATTGCGTGGCGGCCGCCGGTTCATCCAT 
CTGCATGTCTGCGTCCTGAAAGTAGCCCGTTGCCATGCCCATGTCCTGATCCATTCTGCGGGGGAAACAAGCAAACGATGTGATTAGCCAATTTCAATTACTTATTTTTGAA 
CAGGAAAAGGAGCTGTATCTTTTAAATTGGTTGCATTCAAATCGCTTATGCTAATCCTATTGTTTTTTTTTTTTTAATTTGTTAGCCCGTATCATGAGTAATTAAAAAGGAA 
GTTAATAGCCGGATGTAGACAACTCAATATGCATCCGCTTGGTTATTTGAAAACAATTAACTTATGAAGAGAAATGAATGAAATTACCATTCCTAATTGCCCAGAAGCATGA 
TGCTTTTGTTAAGAAGGGATTGTTAAGAATTTAACTATCTAAACGAACTAATGTGTACATTTCTGTGCTCCGCGATAAAGGAAATGTAATCAGCAAAATCAAGTTTCAGGCG 
CGTGGAAAATCTGTGAATGAATGAACGTACGAGTAATACAAAAAGATAACTAATCACCGCTTTGCGGATGCAATAGCCCAGTTTTGTCGTCGGTGACACCGAGTAAAAACCC 
GATCCGATCGACGGGATTAGCGAGGAAGTCTGCGTACTCCCACCAGCTTATCGCATCCGAGACATTTGATTTATGTAAACGGATTTTGTTATAAGATTTCTGCCGAGCCAAG 
CTGGCCAACTGGCGAATCGTAATGATCGAGCCGGTTTGCACGCCATTTGAATTCACATTTGTACGAATTTTTTTTTTATCAAAAGTTCGAGTTTTTCACCAATTTCCTCATC 
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AACCGAGCRRGGCRAACGGCTTTGAATAATATGGTGTTATATATACATATATCAAATCGCTGCTGACTGCGTGflTTGflTGGCCCCAAGATTACATATTflTCGAflTCRGGRTT 
CAGAAGGAGATCAATGTCAAATGCGGACAGGAACATGAGAGACGCCTGTTATGCGCAATTAAAAATTTGGGTTTAATTGCTGTGGAAACTGTTGTTGGTGGCATCTTAAGTT 
CCTGTTTAACAACATCAACTACTTATGTACGTAGAAGCGTTTAAGCCATTTGCATACAGATGAGAACTGGCTTTTGTGCTAATCAGCCAAGATGACTCCGATGATGATCACT 
CATTACCTGACCAGTTTTCGCTGCTTTCTTTTCAACAACTACTTGTATATGTATTGTATCCAATAGCAATACATTGGATTTCCATGGTCTAGTCACGTATTATCATTTAATT 
GACGCCAAG'1'CGTGTTATTGTTGAGCTATCGAGTTCAGCTCAAACATTTCTTATTCCCATGAATAAGCCGGCAAAAATATGCAATCTATGAAAGTTAATATAAGCAAACCTT 
ACTTTGACTCAATACCAATGCACTTTGTGTCGATAGGTTCACGCAATTGAGGCGATTATTCCGATAACCCAAGCGATTGACTGTTCCCGTTTCGATTCCAATTGAAATTTGG 
AAATGTACAATAGTTTTGCTATATGCTGTCAAGTACGCTCTTATCTTCTCTGGGTTTTCTTCAGAGTTTCGAAACGCTTCTTCTTTTTTTTGTTTTTTTTTTTTTGGAATCT 
CGTATTTTGGAAGGGGCTCCCCTCTGGAATTTGTTACACTGTCGTTATCATTGCGAACAAGCGGCCCGAAGCTATCAGCGACTTTAACATTTACAATGCACTTTTTTACGAC 
CAATTAAATGTACATTTTCCTTTCTTCGCCCGTTGATAAGCGAACGCGATGTGGCGCAGGCAATGTGTTGCTCTTGCGACACAAACGCAATCAAAATGGATTCAATTTCGCT 
TTTTCCCAGTGAAACGAAGAACGAACCGACCATCATGATATGCTGCTCTGCATGTTGCGTATTGAATCAATGACAATTTCAATTAAGCCGCCCGTTCGTCATGCGTTTTCGT 
GGGCTTCGAAATGCTGATAACGCTGCTGTCCTCCAACTGCTTTGCATGTGGACACAATTCCATTTATTTAATTCTTTTATTTGGATCGGTTAAATTAAAAAGCGCCTTGTTA 
CGCATTTAACGTTGTTTCCGGTGCGTGGTGGTTTCATGCTTCTGGGAACGGCAAATGGGTTTAGGATTGGGAACCCCTCATCATCCGTCCAAATATGGGGAGTTCAAGGAAT 
GGAGGGGGGAGGGGGGGAGGGGGCGGAGCAGCGGTCCGCTTTCATGCATGATCAACATGTGTGTTTGTTTTCCAACTCTATGTGAGTTGTTTCAGTTCTTTTCGGTCTCCTC 
GGTTCTACTCAGTTCGTTCTTACTTTCTGGCTTTTGTTTCCGAATAGGGGGAGCGCATCGGGCCGCAACTGGAAATGGAAATCTGAATATGCATCCCGGAGCATCCCGTCCG 
AAATGAGCACCATGCGCCATGCCGTTTCGTTTAATTCCCTACCCATAAATGCGTGTCCAAACAGCAGAGGGCATTTCAAGAGGTCCAACTATTAGCTATGTGCATATATAAT 
TCTCTAGATATATTAATTCAGCTAAAGTGCTAACCGCGAGTGTGCCGCCAGTTGCCCCTCAACAGTTTGCAACGATGATGATGGCACAGATATCAGAGATAACACTCAGATC 
GCGCCCCTCACAAGTATGCTCATTATGTACATTCATATGTACATATGTACACACAGCTAAGCACATAAGGAATACGATATGCGCTATGTACATBTATACATATTTACATCAA 
AGTCTTATCAGCCTCGGGTTCCCACCGTCCAAAAAAAAAGGCGGAACGAAGCTTTATAAATAATCAATCAACAAATATTTACAATCAAGTATTTATGTAGCGGGCGGAAAGT 
CGAATGCAAATAAGGTTAGACGCATTTGTACATATGTATTTTCTTAAAGGGTTCCTCTGATCAATATCTGCCCGTTTTTATCTAGCCCCGTTTATAAGGCTGTGAATGCCCC 
GATTGTGTTGGTCGTGATATCAATTAAATCGGCATAATCTCATGTCATCTTTTTAAACATTATGATAGATAAAATGTATTATAACGACTCTAATTACATTGGGATTAAATGG 
GTTTTCGGAAAATACCTATGCAGTAAATCAAATTTAAGATCAATGTGCAGTTATTTGGGGTTTCTTGAGTTAGGACCTCTGTTTTAATATTGAGAATTACTCGATATTATCT 
TCCGTTTTATTTTATTTTCTGATCGTGTCGGTGCACGAGAAATACATATTTCCCACTTTTGTTGCATTTCGCTGACTAATCGAAAGCTAAATACCAAATAAGTTTCCGATAT 
GGGGGGTATGTATGTACATACATACATATATAGTACGAGTATATATAATATACACTGAATAAGCCCGAGCATACATATGTACATATGTAGATAGTGCACATGTGTATTGTGG 
ATCGTTTTGGAGCTGGGTTTTTCTTTGCTTTTCGTCGTTITTAAGAGACTTGAGCTGAATGAGTCGCTACCAACTGTTGCGGTGGGTTAAAGGGTGGGGGGAGGGGGCGAAT 
GGGGATGGGGGATGAAGATGATGATTCATGCAGCTGATGTTTGGACAGGGAAGAAAGAAAGAGCGAAAGAGAAAGCAAATCGGAAGCGGCGGTATCAAACAATCTGTTTAAA 
TAACAACAAAAGACTTTATAAGCCTGACTTTCATTTCCATTTCCAATTTATTTTTCCCAAGTCTGGCTCATTTTGTGCAAATAAACAAATGACTCATTTGTTTAAAAATTCG 
ATGAGACTCAAGGAATGTCACATTCACAAAAATTTTCAATTTGAAATCTGAAAAGATTCAACAGCATAAGCAAATGAATTTAAGATTGCGATATTTGGGTATTATCCTTAAT 
TGGCTGGCTGCCTAATGTAGATTTGTTTTTCTCGGTGTACCGATGCATCCTCTTTGCCCATCCACTTCTTTCTCCCCTCTTTGCTGATTTTTTTTTTTTTTGTTTTGTGAGA 
GGGTGGAATTCGGGATTTGGGATGTGAGCGAGAAGGATGGCCATATTTTGAATTGTGGAAACAGCCGTTCCATTATCGTTCCATTACAAATGCACACACACACACACACACA 
CGCGCGCGTGCAGTTTGCGCTTCTCTGTGTGTGCGTTTGGACCAAATGCAATAACAATAAAAACGCGCAACTCAAATTGTTAACAAAGGTAAGGCCGTTTTTTTGGCCGGAT 
TTGGATTCTGGTCTTGAAAGAGAAACACACACACACACACACGCACGCAAAAAACCAACGAAAATAATAACAAAACAACAACTCACCCGAACTCGAACCCGTTTCACTGTTT 
TCCGCTTATTATAAATTTATACTTCGTTTGGCGTTTCTTAAATGCTTTCGAATTGCATTTTGGATTTGACATTAAATCGCTCCGCGCAGCACAGTAACAAAAATAAACGGAA 
CAAAATAAATACAACCAGCCGATTCTGGCCTTCTCACAATTCCGACTTTTGAATTATCGAAACAGAAGCGCTGCTTGGGCCAACTAAGCCAACTTAACTAACCGATCGCAGA 
GCGCAAAGTGTCGTATTTTCACTGTCGTCAATTAACAACAGCGGCAACAAGAACAACAAACGAAATGAGCGGCTGGGCTACAGTTTTTTAGTCCGAGCGCGATCGAACATGT 
TGGCTGCCTGGCAACAAGTTCGATGCGAACTTTTCTATTGATCTAAGAAATCGGCGTTCTATCGATTGACGCGCTGCGCAGGCGCCTATCGATTTTTGAAATTCTAAGCCCG 
CCTTAAAGGATATTTGTAGAAAAGTAGCTTGCAAAGAAATTAAATTCCTTGAAATAAAAAAGAAATTTAGTACTCTCGTTCTATACTCACTTCTATTCACTTGCGGTTAGTT 
AGTTAGTTTTTAATGCCACATTTTTATGATGATGACTTTTTTTCAGCTCATCTATGCGCAATCAATTTTGCGCCAAGCGACAGCTGCACATAGTTTCTTTTGACCAAGCAAT 
CATTTATCATATAATTTTCCAAGTAGTAGTGCCAAGTTCATAACTGATGGAGTGTGCTGCCAATTTGCATGTTGTGGAGCCATTCAGTTTCAAGGTCGCGCTAATTAAATAC 
TCGTTACAAGTGATGCCAAATGTATAAATATTTGAACAGAACGTATTAAATTAATATTATCCACTAAGAGCACTATTTTAAATTCTTAAAAACCAGATGCACGTCAAATAGA 
TGTACATAAAAAGTATATTTACTTACAAATATCATACACGGACCTGTTTTATTCCTCATCACTGGCCATTAGTTGATAACATGTGGACTTCTATGATATCCTGCTGCATTTT 
TGCCGTTTAGTCAGGATTTAGAGCTTGTACACAATGCGACAGCTGAAAGGGCGAAATCGGTGCAATCGTGCTGTTCGGCATCTGAAAGTTCAGGGCAAGATGTGGTTGAAAA 
ATCTTAAAAGCGGTTTGGAACAAATTCGAGGTGAATCTATCCAAATCGTTAGAATCCCATGAGTACGTTCATC 
(SEQ ID NO: 16) 

Exon: 6913.. 6695 
Exon: 2519.. 2365 
Exon: 2292.. 2147 
Exon: 2075.. 1975 
Exon: 1908.. 1806 
Exon: 1744.. 1257 
Exon: 1187.. 1001 

Start ATG : 2500 (Reverse strand: CAT) 
Transcript No. : CT10621 

GGCCCAAGCAGCGCTTCTGTTTCGATAATTCAAAAGTCGGAATTGTGAGAAGGCCAGAATCGGCTGGTTGTATTTATTTTGTTCCGTTTATTTTTGTTACTGTGCTGCGCGG 
AGCGATTTAATGTCAAATCCAAAATGCAATTCGAAAGCATTTAAGAAACGCCAAACGAAGTATAAATTTATAATAAGCGGAAAACAGTGAAACGGGTTCGAGTTCGGAATGG 
ATCAGGACATGGGCATGGCAACGGGCTACTTTCAGGACGCAGACATGCAGATGGATGAACCGGCGGCCGCCACGCAATCGATCTACCCGCATTCGGCGACATTATTCGCGGC 
CATTAGTGCCTGTGTCTTTGTGACGATCGGCGTTCTTGGCAACCTGATTACCCTGCTGGCGCTGCTCAAGAGCCCCACGATACGGGAGCATGCCACAACCGCCTTCGTCATT 
TCGCTAAGCATCTCCGACCTGCTCTTCTGCTCCTTCAGCCTGCCACTGACCGCAGTGCGATTCTTCCAGGAGAGCTGGACTTTTGGTACCACATTGTGCAAGATCTTTCCGG 
TGATCTTCTATGGCAATGTGGCTGTTTCACTCCTCAGCATGGTGGGCATCACCCTGAACAGATATATACTCATCGCTTGCCACAGCCGCTACTCGCAGATATATAAGCCTAA 
GTTCATAACCCTGCAGCTGTTGTTCGTTTGGGCCGTCTCCTTTCTGCTGTTGCTGCCGCCCATATTGGGCATCTGGGGCGAGATGGGCCTGGACGAGGCCACCTTCTCCTGC 
ACAATACTCAAGAAGGAGGGGCGATCGATCAAGAAGACGCTGTTCGTGATCGGCTTCCTGCTGCCTTGCCTGGTGATCATCGTTTCGTACTCGTGCATCTACATTACGGTGC 
TGCACCAGAAAAAGAAGATTCGCAACCACGATAACTTCCAGATCGCAGCGGCCAAGGGCTCCTCGTCGTCCGGCGGCGGATCCTACATGACCACCACATGCACGCGCAAGGC 
GCGCGAAGACAATCGACTGACCGTCATGATGGTCACCATCTTTCTGTGCTTCCTCGTCTGCTTCCTGCCGCTAATGCTGGCCAATGTGGTGGACGACGAGCGCAATACCTCG 
TACCCCTGGCTGCACATCATCGCCTCCGTGATGGCCTGGGCTTCCAGTGTCATTAACCCGATCATCTATGCGGCCAGCAATCGCAACTACAGAGTGGCCTACTACAAGATCT 
TTGCGCTGCTCAAGTTCTGGGGCGAACCGTTGTCGCCGATGCCGAGTAGAAACTATCACCAAAGCAAGAACTCCAAGGAGCTGTCGGGCGTCATCCGCAGCACGCCGCTGTT 
TCACGCTGTGCAGAAGAATAGTATTAACCAAATGTGCCAAACATATTCAGTATGA 
(SEQ ID NO: 17) 

Start ATG: 239 (Reverse strand: CAT) 

MATGVFQDADMQMDEPAAATQSIYPHSATLFAAISACVFVTIGVLGNLITLLAIjLKSPTIREHATTAFVISLSISDLLFCSFSLPLTAVRFFQESWTFGTTLCKIFPVI fyg 

nvavsllsmvgitlnryiliachsrysqi ykpkfitlqllfvwavsfllllppilgiwgemgldeatfsctilkkegrsikktlfvigfllpclvii vsysci yitvlhqkk 
kirnhdnfqiaaakgssssgggsymtttctrkarednrltvmmvtiflcflvcflplmlanvvdderntsypwlhiiasvmawassvinpiiyaasnrnyrvayykifallk 
fwgeplspmpsrnyhqsknskelsgvi rstplfhavqknsinqmcqtysv* 

(SEQ ID NO: 18) 
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Name: Melatonin receptor-like 2 
Classification: G_protein_linked_recept6r 



Celera Sequence No. : 142000013384305 

GTTCGTTTGRTCGGTTGACCTCGATTTATTTACTGCTATGCTkAAGCTAAAATTCAAAAATCCCTACTCATAGATGAAGTTTTTTGCACACTTTAGGTGTAACTTGGTGCTTT 

AACTTGGCGTTTCTAGTCTTAAATAATTGGCAAGATTATCTGCCTAACTTTTGGTGACCCCAACTTTTAATCACACACCAAGCGGAATTATACTAAGGTTTATTTCCCTTGG 

CATCAGACTGTCTTCTTCTTCGCTCTTCCAGCTCTTGTCTTTATTAAACCTGGCCAAGGACCTGTAACTTCGTTTAATTAAGTAATTCAATGAAATTGACAGCTGTTTGCTT 

GGGTCAAGCATAATTGATGACTACTCATAATCAAAACAGAAGTGCTGCCAATGAATGGGAAGCGGAAGTCTGGCATAAATTACAGGTCTGTCCATCCGAAGCGCAATCATCA 

TATAATTGTCATGGAACCACACTTGGCACTCACTTTTGCCTGGCTTCTGCTCCTGCGAACGCTGATGTTGCTTTGTGCTTTGTGCTTTCTGCTTGATGCTTTTTGCTTTGTG 

CCCAAGTCTCGTGACTCCACTCCACTCGGAGTGGCATCCATGGTTAGGAGAACAGCAATAATGAGGCACACCAACCGGCCAATTGGCAATTGTGCGAGGTCAAACACAACCG 

GTGGAGGAAACTGGTGTCGGCCGCACAGGGAAGACTTAAATTTTCTTTAATTGCTCGAATTTATGGGTTGGTAGTGGCATAACGACAATTGTCCAAGTCAAGGTCACAAATG 

ATGCTCCTGCCTTTTTAGTTCGATACACACAAAGAAAAGGGGACCATACATATACAAGCAACCAACTAGATGAAACCTATTTTAAGGACGGTTGAATTGTTGGCTTTTAACA 

TAGTAGATTAACTTCAAAATTAGCAGTTACTACTACTAATAACGAGCTAAGCCACTCATTTGCGTAAACACTTTCTATCTGTGCACTTTTATCTGCTTGAATAGTGTCTGGG 

TGATGGCCTCGATTGCCGTGGTTGGCAACCTATTGGTCCTGCTCGGTCGCTACTTCTACAAATCACGGAGCAACGTGGAGCACTCGCTCTACCTTCGCCATTTGGCCGCCAG 

TGATTTCCTCATGGGCATTTACCTCACATTGATTGCCTGTGCGGATATCAGTTTTCGCGGCGAGTATATCAAATACGAGGAGACCTGGCGGCATAGCGGCGTTTGTGCCTTC 

GCAGGTGCGTATCCTTCGACATCCTAGCCATTTCCATTTCGTAATGGAATGCTAATGTGTATTACGTATTCGCCGTGTAGGCTTCCTCAGCACCTTCAGCTGCCAGTCGTCG 

ACGCTGCTGCTCACATTGGTCACCTGGGATCGTCTGATGTCGGTGACGAGGCCCCTCAAGCCACGGGATACGGAAAAAGTTCGGTAAGTGTGAACTACAATAGATATGTAAT 

AAAGTGCTGGGCAAAGTTATTGTTTATTCTATTACATTAATTACTTTTGCTGCGGACATTTGCTGATTTGTTGACTGCTCAAATATCAATGCACAGAATTTGTTTACCCTAC 

TTGAATTTTGTTCGTAAATGCAATTATGTTTGTTTTGACCGCCGCCTTACTATGTACAATATATATTTTCTGTTGCTTCATTTTAGTTTGCAAAAGCTATGGTTTTGTATGT 

GTAAGCTATGATAAATTATTGAAATGACAACTTCCACAGCATTGTCCTACGCCTTCTGCTGTTGTGGGGCATAAGTTTCGGATTAGCTGCAGCTCCACTTCTTCCCAATCCG 

TACTTTGGTAGCCATTTCTACGGCAACAATGGTGTCTGCTTATCGCTGCATATCCACGATCCCTATGCGAAGGTAATACACCATTTCCTTGGAAACGCCTGTTCTAATATTC 

CAAAATGTATATTTTTAGGGTTGGGAGTACTCGGCGCTGCTGTTCATCCTGGTCAATACGTTGTCACTGATCTTCATCCTCTTTTCCTACATTCGAATGTTGCAAGCGATAA 

GGGATTCGGGTGGCGGAATGCGAAGCACTCACAGCGGTCGCGAGAATGTGGTAGCAACTCGGTAAGTTTGGATAATTCAAGTAAAATGATTTAATTATTGAAATATCATGTT 

TTTACAGCTTTGCCATCATTGTGACCACCGATTGCGCCTGCTGGCTGCCCATAATTGTGGTCAAATTGGCTGCTCTTTCAGGCAAGTTCATCGGCAACGCCTTTAAATGATT 

TATTTTGATTATCAACTTATTCGTATATATATAACCCCTTTCAGGCTGCGAAATCTCGCCCGATCTTTACGCCTGGCTGGCGGTTCTTGTGCTGCCAGTGAACTCGGCCCTC 

AATCCAGTGCTCTACACATTGACCACTGCGGCCTTTAAGCAACAGCTGCGTCGCTACTGTCACACCCTGCCCAGCTGCTCGCTGGTGAACAACGAGACCCGATCCCAGACCC 

AGACTGCCTACGAGTCCGGACTGAGTGTCAGTCTGGCGCACTTGGGCGGTGGTGTGGGCGGCGGGTCGGGGCGAAAGCGGATGTCACACCGGCAGATGAGCTATCTGTAGGC 

GGGCGCAACTGGCTTAAAATGCAGCACGAGATCAACAGACTGGGGCTAGTGGGTGGGTGGCATTACAACAGGACACTTCCATGTAATTGGTCCCTTAGACTATTGGTATTAT 

ACGTACTTATTGATACTTATTCTAATATCAATTTTAAATTACTTATAACATTTGAGGATATGATAACATAAACCTTAGACAAGAGATTCCAGCTATTTAAGTTTCTTTTTTG 

TATATTATTTATTTTACCAATCGTTTTAAGGCAAACCTCTTACAACTGTCCAAAGAATATCCTAACTATTTGTTGCATCCCCATTTATTCGACCTGCCCAATGGGATATTGT 

TACATGTGATCTAATAGTTAAATCAAATTTTAATTTTATAAATACATATTCAGATGTTAATTGAAGCTTTTGAACATAGGACTTGCAATCGAAGGCCGCAAATGCCTGCCAT 

CCGGCTTTCTATGCGTTGCCCTGAGGTTCAGTGGCTACATCGAGATCCGCCTGGACGTGATAGGGTCCCTCGTTCACATAGGCCGTGAAACGGATGGTTTCGGTGGGCGAAA 

ACTCAGCGCTAGCGGACAGAATCTCCATAATTTGCCACTTCATATACTTAATGTCCCTGTGCAGATCCTTTTGACGCGAAAGAAGGAACTCCACAGCACCGCCGGTATTCTT 

TAGAGCTACAATGCCCCAGTTGTACTTACAAATGGTTTTCAAGAACGACAAGGCGGCCATTTGAAGATCGGGGAACGGAGTGTTTATCAGGTCCATAATGATATTAGCATGC 

GCTCCCCCAGCGAAGCACTCGTACCAGTTTTTACCAATGTTACTGAAAAAGAAGAGAATCATTAGAAGAAAAACTTCAATTATAGAGTTAAGCATTACATGATTTCTGTGGC 

AGGCGGCGTCTTGAAGTCATAAATCACATCCAGGGAGTTGAGCAGCCTCTTCTTGATGTGCGCCGATAGCTTCTTAGTATGCGAGCCATACTTCTTAAAGGA 

(SEQ ID NO: 19) 

Exon: 1001.. 1236 

Exon: 1313.. 1427 

Exon: 1720.. 1864 

Exon: 1923.. 2077 

Exon: 2136.. 2209 

Exon: 2285.-2574 

Start ATG: 1011 

Transcript No. : CT13764 

TGTCTGGGTGATGGCCTCGATTGCCGTGGTTGGCAACCTATTGGTCCTGCTCGGTCGCTACTTCTACAAATCACGGAGCAACGTGGAGCACTCGCTCTACCTTCGCCATTTG 

GCCGCCAGTGATTTCCTCATGGGCATTTACCTCACATTGATTGCCTGTGCGGATATCAGTTTTCGCGGCGAGTATATCAAATACGAGGAGACCTGGCGGCATAGCGGCGTTT 

GTGCCTTCGCAGGCTTCCTCAGCACCTTCAGCTGCCAGTCGTCGACGCTGCTGCTCACATTGGTCACCTGGGATCGTCTGATGTCGGTGACGAGGCCCCTCAAGCCACGGGA 

TACGGAAAAAGTTCGCATTGTCCTACGCCTTCTGCTGTTGTGGGGCATAAGTTTCGGATTAGCTGCAGCTCCACTTCTTCCCAATCCGTACTTTGGTAGCCATTTCTACGGC 

AACAATGGTGTCTGCTTATCGCTGCATATCCACGATCCCTATGCGAAGGGTTGGGAGTACTCGGCGCTGCTGTTCATCCTGGTCAATACGTTGTCACTGATCTTCATCCTCT 

TTTCCTACATTCGAATGTTGCAAGCGATAAGGGATTCGGGTGGCGGAATGCGAAGCACTCACAGCGGTCGCGAGAATGTGGTAGCAACTCGCTTTGCCATCATTGTGACCAC 

CGATTGCGCCTGCTGGCTGCCCATAATTGTGGTCAAATTGGCTGCTCTTTCAGGCTGCGAAATCTCGCCCGATCTTTACGCCTGGCTGGCGGTTCTTGTGCTGCCAGTGAAC 

TCGGCCCTCAATCCAGTGCTCTACACATTGACCACTGCGGCCTTTAAGCAACAGCTGCGTCGCTACTGTCACACCCTGCCCAGCTGCTCGCTGGTGAACAACGAGACCCGAT 

CCCAGACCCAGACTGCCTACGAGTCCGGACTGAGTGTCAGTCTGGCGCACTTGGGCGGTGGTGTGGGCGGCGGGTCGGGGCGAAAGCGGATGTCACACCGGCAGATGAGCTA 

TCTGTAG 

(SEQ ID NO: 20) 

Start ATG: 11 

MASIAWGNLLVLLGRYFYKSRSNVEHSLYLRHLAASDFLMGI YLTLIACADISFRGEYIKVEETWRHSGVCAFAGFLSTFSCQSSTLLLTLVTWDRLMSVTRPLKPRDTEK 
VRIVLRLLLLWGISFGLAAAPLLPNPYFGSHFYGNNGVCLSLHIHDPYAKGWEYSALLFILVNTLSLIFILFSYIRMLQAIRDSGGGMRSTHSGRENVVATRFAIIVTTDCA 
CWLPIIVVKLAALSGCEISPDLYAWLAVLVLPVNSALNPVLYTLTTAAFKQQLRRYCHTLPSCSLVNNETRSQTQTAYESGLSVSLAHLGGGVGGGSGRKRMSHRQMSYL* 
(SEQ ID NO: 21) 

Name: TSH/FSH receptor-like 
classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013383844 

TTTTCCGCCAAATTCGCTGGCTTTTAAGTGCTCACATAAATGAAATTTTCAAATAGCAAATAACATAGGAATAATCCTTCTAAATCGCAATATAAAAACATATATACAAGGA 
GTGGTAAATGACAGTATTCTATTAATGGCAAAACGATTTATTTAAACAGACGTCGCATCCGCATAAACGATCTTTCACACTTTTTATTAACGGTTAGTATAAATAAAAAAAA 
ACATTAATCCTTAGTTTATTTAACCAATTTTGTTTGCTGAAATCGATTCTTGAAAGAGAAGCTGACATTTTTTTTATATATGCCTCCCTACTTCAGGCAGTGCGAACATACC 
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GCATCAACGCGACTTGTAGCGGCCGTGTGAAAAGGCTATTTTGTGGTTAGCGCAGATCTGCGCTGCAGTGTGACCCCGGCTACAGGTATTTCGGCATCGCACAGGGTCACAC 

TCGCCCAGGCACCCGTATAAAATCCCCGCCCCCCTGATCGACGATAGTATTTCGAAAAAAACGCGAGGAAAAGTCTGATGCGCGTTAAAATCGGAGCGCAAATCGTGCATAT 

ATTTTTTTTTACACACCGCGCGCCGTGTTTACGACTGAAAACGGAGACTGACTGAAAAGCCTTGGCGCACAAGCACGCAATGAAAACGCAACGATAAGAATGTGATATTGTG 

ATAAATACATGATTCGCTGACAACTCTATTTTATATTGGAATGCATTCGAAATCGAACGCAGCGCAGCGTGTGTTGTTTACTTAACGATTTTCGGTGTTTGTTTGCCCAACT 

TGCGGCGGTGAAATTCAGCATTTTCTGATTTTCCATTTGCCAACGAAAGAGAGGAAAAGTGAAAAGTGCTATTCGACGAAGCGAAACGGAACGCAAAATCAAAAGAAAAACA 

GCATAGCTAAAGCTATGTGTAGTACATAGCCACATAAATTGCAAATTGAGGCCAAGAGACAAGAGAACCTGAAATCCACACAGATCTTGACGAGTGAGCTAACGATGGACAG 

GAGTCGGAGTAGCAGAGCTGCCAGCAGCAACCAGTTCATCAGGCCATGTGGCCTACTGACGACCGTCATCCTGCTGCAGACGCTTGTGTCGATGAGTTTGGCCATCGAGGAG 

ATGTCGCCACCACCGGCCGCCCCGCCACGTCCATCGCCGCCGCCCACCGTGAAGCTCAACAAGTGCTGCCATTCGGGCGAGTATCTGAACGATGGCACCTGCATCGCCGGCA 

GCGAAGCCCTGTGGCTGCCCATGGTGTACTTGGTGCAGCAGCAGCGCTTCTTCGAGCCCCATGGCGCCAGTCCGCGATTCTTGAAGTTCCTGCCCAATACGAGACCCACGTG 

CCGAAAGGACCAGACGACGGAGATCTTTCGCAGCCGCGGAGCCAATGTGATGCTCTTTCCCAATGGCACGCTCTATGTGCGGGAACGCGCTCTAATGGTCCAGCCGTCGGAT 

TACTGTGTCGACTGGGAGGTGGCCGTCGTTTGCTTGAACGACAGTCAGCCGATCAATGCTCTGGAGGATCCCGATTATGCAGCCAATCCTCTCGTGCAGCAGGAGCCGCCCA 

AGGCTAGCCTGCGACTGAGCAAGTGCTGCGGCAAGTGGGGCAGCTATAACACGCAGCTCCAGAACTGCGATCTGCAGCCCAACCATCAGGCAGCCGTCGATGGACTGCTGCG 

ATTGTCGCCGCAGCTACCGGAGGGCAGCTATCAGACTAGCTACGGCCTGCCCGACTGTGGCCAACCCGGTGGCTACTCCATTGCCGGTGATTGGCAGGATGCCAAGCTGGAT 

CGGAACACGGCCATGCTCCAGTTGCCGCACAAGAACCTCAGCGCCGGACAATACTGCCTGGAGCACACGCAGCGCGAGGGCGAGGTGAAGATCATAGCCTGCCAGCATTTGT 

TTAGCTCCGCGGCAGGAGCCGGCATCCACGACGGCTCCATTGGAGGCACCATCGAGCAGGCCAATGGCCAGAATCTGCAGAAGGCCGTCCTGACTGGTGGCATACTTGTGTC 

CATTGTCTTCCTGTCCGCCACTTTGGTGGCCGGCTTCCTGCTGCCCGCTGTGCATCATGCGCTTCATTGGCGCTGCCAAATCTGCTATGTCACCTGCTTGCTATTCGGCAAA 

ATACTGCTGGCCATCGAGGAGCTGAGCTCCAGCCTGCAGCCGGGCAGTGCCGCCTGCCATACGCTTGCTATCACCATGCAGTTCTTCTTTCTGGCCGCCTTCTTTTGGCTGA 

ACACCATGTGCTTCAACATCTGGTGGACGTTCCGGGACTTTCGGCCCAGTTCGTTGGAACGCAATCAGGAGGCACTGCGTCGCTATCTGTACTCCCTGTACGCTTGGGGCGG 

TCCTCTGCTTATCACCTTCGTGGCCGCCTGCGTGGACCAGCTGCCGGAGACGACGCTACTGCGTCCGGGATTCGGACAGCTCTACTGCTGGTTCGACAATCGCAATCTCTCG 

ATCTTCGCCTACTTCTACGGACCCATTGGCCTGCTGCTGTGCGCCAATATAGCGCTCTTCGTGTCCACCACCCATCAGTTGACGTGCGGCCTGTGGAAACGGGACGATGTCA 

AGTCCTCATCGGAGAAGTCCGCCCTGGGACGCGTCTGCCTGAAGCTAGTCGTCGTTATGGGTGTCACCTGGATAGCGGACATTCTGTCCTGGCTAGTGGGTGGACCGCATGG 

CGTATGGTTCTTCACCGACCTGATCAACGCCCTGCAAGGCGTCTTCATCTTCATCGTGGTGGGCTGCCAGCCGCAGGTGTGGACCGCCTGCCGCAGGATCTTCTGTCCGCGA 

TTGCGTCATGACATCACCAATACGACCAACGGTGTCCAGCATTCGAGCAGCTCGCAGGGTTTGCCCTCGATGGCCGGCGGCACGGAGATTACACAGAACACCACCACCACCA 

CCACGACAACCAACACGACCGCCACTCACATGCCCAGCAATCCGGCGGAGGATGAGGTACCGGAGAAGGCTCCCATTGCCCCCGTCGCACCCATCGTCAAAATGGAGACCAT 

TTGCTAAACGTAAGTTGAACACCCGATAGCCAATGAAGTGTGCTTGTAAAGAGATAGCTCTGTTTGCCAGCGCGCAAACAAAGCGCCTGCTGTTCCCACTCCCAGCGCAGTT 

TGCCAAGTGGCTTACAAAACGCGGCTTTAAGCACACACACACACACACACACACTTGCTGATAGACTTATCGGCCTTAATGGGCTTTAGGGCCCAGCCGGGAATGCAAAGTA 

TCTCAATGAGCTGGCTGGCTGGCTGGCTGGCTGTTTGCTCGCCGGTTACGCCATGCATCGCAGCAGTTTCGATCCTATAAAGCGCACGTATGTACAGGTGGGGCCACTTCAA 

TATCACTAAGCAGCTTTAGCTTCAATAGACTTCAATCGGTGGAAGGTTTGGTATTTGGACCCAAGAACCTAAGTTTCTTATTAAGTTTTAGTCTTTCAAAAATAGCGTGCTA 

AATAGAGGGTCTTAAATTAACTCGAAAATGATTTAGAATCCGTGTAAATTGAATTGACACATTAGCTGAATTTTCCGGTATATAACCAAAAATGTCCGGACGATATATGTGT 

CGTAGAAGATAGTGTGATAATTGCGCTGTCTGTGGATTTAACCCAAATACCAATGAGGTATTCACAAAATGTGAAAACAAATAGGAAACCCCGAATTTGTCATGCGCCGACA 

ACTTTCGAGAACCAACCACCACCATATGAAGATAAAAAACAAAAGAAACTAGTTTGGTTTTCTATGGGGGAAAACGATACGAACTTTTCAATTCAAAACCTTTGGAGTTTCA 

ACGGTCAATGGCCATGTTGGAAAATACCCTTCTCTATAGGACACTATTCATCTCACACCCCCCATGTGGACTATACGCTATCCAGATGAAATTTAAGGGTTCGAGATTTCCT 

GACAACTTGCGGCTACCTGGAGCGTTTTTCCAACCAGGCGCCGGTTGAAAAAGTCTAGCACGTTTTCGTTTCGTTTTATGGTTTTATAGGGGGAAATTTACGTATTTACGTA 

GGATAAGAAGATCGTCCTGCGATGGTCAAATAGTGACCATATTCTTCATTTGCATGACCGAAATTGTTTAACAACATGCGCCAAGTTTACGATTTCGCGTGCACCACTTGAG 

TTCGAGTATGGTCATAAAGATGAAAATGTTGCCGCTGTGTGCCGTTATGTTTTTCATCGGAAAACTCTGACGCCGGGCCGGGCGAATTCGATTTTTGAATTTCCTGTCGATT 

TCCACATTTCCACATGGCCATAAAGTAATCAGCGCAACCCTTGACACACACGCAAGCGAAGATAATGGAAATCGTTGCTGGCATCCGTATCGGTTGGATTCAAAAAATGTGT 

GTTGCCGAATCCGTAAGGGAAAATCGAGTTTAAATTGGACTCACCGGCGACGATGGCGATGGTGATGGTGATAAGCACACAACACACCATCGTCTTGTTTGGATTTCGGCCC 

ATTTACTTATTTACTTGTTACTTTGTTAGTTTGTTAGTTTGTTAGTTTGGTCGCCTCGAGTGGTTTTGACACATCTCCCAAATGATCCAATGGCAAACATGCGGACAGGTCT 

CGAACTCGGCACAATCTACAATTATCTAAAAAGAAACCTCTGTGCGGCGATCATAAAGTTTTGAAAAACAATAATTCCATTTTCGGGCACTAAAAAAAAAACAGGGCAACTT 

GAAATCTAGCATTAAACTGGTGGCTTAATGTTCCAATGAAAAGATAAGGATTTAACAATAAAAAAAAACTTTTTTTTTGTATGCACTGCATGTTGATAGCAACATTAATGAA 

AGAAACTAATCAATGAAGTTTCTGTTTCTCTATATTAACAACAAATGCTGCTTTGTTTAGGCCATGTTTTTGTTATTAGAGCCCAAGAAACATTGGTTTGCATTGCACAAAG 

GCGTATGGAAAATGTGTAATTACTTCAGTTTGTAACATTTTCGAATATGACAGCTGCTGGTGGCAGTTAATTCTTTGCCCTCGCGTTTTATTGTTTGTTACTTTTTACTTTT 

ATTTTTTATTTTTTTTTTTGGGCCCTGATCGCGTCATACTTTTTGGGATGCGTGTCTGTCCGGGTGAACAGATGTGCTGTCTGTCTGCCATTCGCTTGTGTGTAGGTCCGTG 

TTGGAATCAGAGGTTTTGCCCAAACGAAAGAGAAAACAAAACAAACAGTGGGTTTTTGGCCCATGGGCCCATTCGATGAGATTGCGCAACCGAGTTAAATGGCCTGGCCCCT 

GAAAAGCTGAAAACCGAAAACAGAAACATGAAATCTGAAATCTGAAATTGTGGAAATCTGAAATTGTGGCTGGCACACGCAACACTCGTAATCAGACTAAAAAAAAAAGGGG 

CAGTCAGTCGGCTTAAAACTGTGACCTGCATGCATTTGCCAACTGCAGACTGCAAACTGCAAACTGCAACTTCGAAGGCAACCCGAAATGCATCTGCAACTGCAACTGCAGC 

TGCCGTGCCATTTGTGTTCATCTACATAATCCCAGAGCAGCGTGTCCACGTCACGCTCATCACAACGCGACGGTGCAAAAATGCAGCGAAATTGGATGCGTTGCTCGTTTGT 

CAACTGAAATGGTCCAGATGCACAATGCAAGGGAGGGAAGATGCGGCAGAAATGCGCCGGAATCGATGCAGTAATGCGTTTCAAGCTATCATTATGGGATCACATGGGGTCA 

CATCAGTGCGAAAATGCTAATGCTATATGGGCTAAAACCAGTTGGGAATTAGAGCGCCGTACAAGCTGTAAGCTAAGCTGATATGCAATACCGCTTATTAAAGAGCATTTAA 

ACTATTTGACTTGAAAATATGACTAACAAATTGAGCAGGGAAAATGCTTTTCCTTCGGAAAAAATGATTGTGTGCCAAAAACTAAAAACGAAAGTTTCCCTACTATTACTTT 

TGGCATCGCCCAAAAAACACTTTCTTTCATTAGCCATTTCACTGATTATCTTTTTTGGTAACCAAGCGGAGTAATCATCATAAGCACACAGCGTTAAACTATTCAGCTATTT 

CCAGCTTGTTGAGATAACTAAAAGCTCGGACACGCTTGAGTTCCACTTTTTTTTTTTTTTTTTTATCTCCATCAACATTTAATAATTAAGTGATTTCGCTTATTGTTCTAGT 

ATTTATTAGCGCATTACCTAATTGTTTGACACGACGAAACATGAAATGATTACTAATTTCATCATAGAATGCCAGTGACCCATGCTCGTTGGCATTTCGATTATAATAATAT 

TCGCAGTTGCATAAATATAGATTGTTTCCGAACGTGACCTGTACTTCCTAGTGCAACTGAATCTTTTGCGTTTCTGGGTCAGCTTTTATCTGCCTAGTTCGCTGACATTTCA 

AAAGTTTGCAACACTTGGCAGCTTGGCAGCGTAATGAATTCCTTCCTGTTGACTCCGCCACGCAAATTGGCACAAATTTCGCAATAAGAATACTTAATGTCCAACTCCAGAC 

GCAATCGCAGCGGCACACGCACACATGCTCTGCGTTAATTTGAGTTTAGCGGCAATTTTTAATTTGAACGTTCTGCAGTTTTTCAACTTATAATTGCACTCTGGCCCATTTA 

CATTTTACGCAGGGCATTATTATTATTTCTTTTTTTTTTTTTAATTTTTGCCTGCTGAGTCGGTCGACATCGGATTCCTCGTTGGAGGCCGAAGGAAGCGAAGTAGGGGGCA 

TTGTTGTCATGCATCCACAGTTGCGGTTAATGCAATAGCTCTGAGATGGATAATAATTACCTCGGTCCTTGAATGGTGCGAAAATATCTTGGCAGTTCCTATAAAAGCATAG 

CATTGCGTTTGGTACATGAATTTCATTGGCAAATAATAAGTATTTCATCTGGCAGCAGTTCACACCTCAAATCATATCGCGCAAATGCCACCTTTTCGCAAGAATTTGAAAT 

CAAAATAAATCTCAGAATTCCTGATTACGCGATGCCTGGAAATCTAAACAGATCATGAGTAAGCCGCAAATGTTTCAGCTAAGAAAAATGAAAAGAACAAAGTTTCCGCTAA 

GGGTAGTTGGAAAAGTAGTCAAATTGGTTTACTGATCAATATTACTTTTAGATCTCTCTGTTGTTTTATTTATTTAGCAGTGTAAACTAAGACTGCAAGTCCATGAACTATA 

GCCATTTTCAACACTGCTTCTAAGTACTAATGTCTTGAGTGCACTGTACCCACATGCGCACGCATATCATATGAAAATTTGTACTTATTTGTGCCCCTTTTCTTTCTTATTT 

TTTTTTTTGCAGACAACTGAGACGACGAGCGACAATGCGAAAGTCAGGCCCGGATACAAAAGTCCAGACCTCGAGCGAGCGAAATAATGGCAATGGGCAGCTCAGAAATAAG 

AAAGAAAGAATGGAAAAAACACGAAAAAAAAACGCTGCCAAAGCTGGAGTTTCGAGACGAGCGCGGCACACACTTGCGCAATCGCAGCAACAAAACCAAGCTGACCAGCTAA 

TTGTGCGAGTGTACGAGCGAATGAGTGTGTGTGTGGTTGTGTGTGTGTGCAAGCGACTTGCGCGTGATTTTGAAATCGGCCAACAACCGCTGGGGAACAAAACGTACTCAGT 

ATGCCAGTATTCAACTTCAATTTAAGCCAGCGCACAAAAGAAACGTATTATTTTATTCATCATCATCATCATAATCGCATAGCATCGCATCGCATCGCATCGCATCGTATCG 

CAGAATAACGGTAAAGCAGTTCGGGTCACATTAACACGTTCTAATTGGACACTCCGGTCAGTGTTGCGAAATGGAATGTGCAGCATAACGAAACAGGAATTCAGCCTAGACA 

AAACAAAGCAGCGCTATTGTGGCTCTTTCTTTCTTTTAAGCACTTTTAGCCAACAGAAATAAAAACACGAGGGCTTTATGTGTGTGCATCTGGCTGTATGTATGTGTGTGGC 

AACCACAAGGAATTTTCTGTTGAGGCACTTTTGTTGTTTAGTTTAGATCGCGTTGCTCTGGTTTTTTTTTTTTTACTCAAATCTCAAGTGTGAGAAAAACAGCAGGCGCACC 

ACACACAAACACACACACACATGCATGCACATATCGGATGTTTAACATTTAAATTAGCTGCTTCTGATTGGCATTTTCGGTATGACACTGCTTGCATTCAGTTCTTTATGTG 

TGGGTGTGTGTGTGTGTGTGCCTTTGACGTTTTTATTGCCTAATCACTTTTTTTTATTTGCACTTTACTACGAGAATACTGGACGAAAACCGCTGACTGGAACTATTTTTAT 

GTTTGCCCCATAATGTTTTTCAACCGAAAAATTGTATATTTTTTTTTTACGTCTAGTGTAAACAATTGAAATTTATTTACTTCAGGGCATAAACAAAATCGTTATACACACA 

CATTGATAGGCG 

(SEQ ID NO: 22) 

Exon: 1001.. 3033 
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Exon: 7293.- 7412 
Start ATG : 1001 

Transcript No. : CT14539 

ATGGACAGGAGTCGGAGTAGCAGflGCTGCCAGCAGCfiACCAGTTCATCAGGCCATGTGGCCTflCTGACGACCGTCflTCCTGCTGCflGflCGCTTGTGTCGATGAGTTTGGCCA 
TCGAGGAGATGTCGCCACCACCGGCCGCCCCGCCACGTCCATCGCCGCCGCCCACCGTGAAGCTCAACAAGTGCTGCCATTCGGGCGAGTATCTGAACGATGGCACCTGCAT 
CGCCGGCAGCGAAGCCCTGTGGCTGCCCATGGTGTACTTGGTGCAGCAGCAGCGCTTCTTCGAGCCCCATGGCGCCAGTCCGCGATTCTTGAAGTTCCTGCCCAATACGAGA 
CCCACGTGCCGAAAGGACCAGACGACGGAGATCTTTCGCAGCCGCGGAGCCAATGTGATGCTCTTTCCCAATGGCACGCTCTATGTGCGGGAACGCGCTCTAATGGTCCAGC 
CGTCGGATTACTGTGTCGACTGGGAGGTGGCCGTCGTTTGCTTGAACGACAGTCAGCCGATCAATGCTCTGGAGGATCCCGATTATGCAGCCAATCCTCTCGTGCAGCAGGA 
GCCGCCCAAGGCTAGCCTGCGACTGAGCAAGTGCTGCGGCAAGTGGGGCAGCTATAACACGCAGCTCCAGAACTGCGATCTGCAGCCCAACCATCAGGCAGCCGTCGATGGA 
CTGCTGCGATTGTCGCCGCAGCTACCGGAGGGCAGCTATCAGACTAGCTACGGCCTGCCCGACTGTGGCCAACCCGGTGGCTACTCCATTGCCGGTGATTGGCAGGATGCCA 
AGCTGGATCGGAACACGGCCATGCTCCAGTTGCCGCACAAGAACCTCAGCGCCGGACAATACTGCCTGGAGCACACGCAGCGCGAGGGCGAGGTGAAGATCATAGCCTGCCA 
GCATTTGTTTAGCTCCGCGGCAGGAGCCGGCATCCACGACGGCTCCATTGGAGGCACCATCGAGCAGGCCAATGGCCAGAATCTGCAGAAGGCCGTCCTGACTGGTGGCATA 
CTTGTGTCCATTGTCTTCCTGTCCGCCACTTTGGTGGCCGGCTTCCTGCTGCCCGCTGTGCATCATGCGCTTCATTGGCGCTGCCAAATCTGCTATGTCACCTGCTTGCTAT 
TCGGCAAAATACTGCTGGCCATCGAGGAGCTGAGCTCCAGCCTGCAGCCGGGCAGTGCCGCCTGCCATACGCTTGCTATCACCATGCAGTTCTTCTTTCTGGCCGCCTTCTT 
TTGGCTGAACACCATGTGCTTCAACATCTGGTGGACGTTCCGGGACTTTCGGCCCAGTTCGTTGGAACGCAATCAGGAGGCACTGCGTCGCTATCTGTACTCCCTGTACGCT 
TGGGGCGGTCCTCTGCTTATCACCTTCGTGGCCGCCTGCGTGGACCAGCTGCCGGAGACGACGCTACTGCGTCCGGGATTCGGACAGCTCTACTGCTGGTTCGACAATCGCA 
ATCTCTCGATCTTCGCCTACTTCTACGGACCCATTGGCCTGCTGCTGTGCGCCAATATAGCGCTCTTCGTGTCCACCACCCATCAGTTGACGTGCGGCCTGTGGAAACGGGA 
CGATGTCAAGTCCTCATCGGAGAAGTCCGCCCTGGGACGCGTCTGCCTGAAGCTAGTCGTCGTTATGGGTGTCACCTGGATAGCGGACATTCTGTCCTGGCTAGTGGGTGGA 
CCGCATGGCGTATGGTTCTTCACCGACCTGATCAACGCCCTGCAAGGCGTCTTCATCTTCATCGTGGTGGGCTGCCAGCCGCAGGTGTGGACCGCCTGCCGCAGGATCTTCT 
GTCCGCGATTGCGTCATGACATCACCAATACGACCAACGGTGTCCAGCATTCGAGCAGCTCGCAGGGTTTGCCCTCGATGGCCGGCGGCACGGAGATTACACAGAACACCAC 
CACCACCACCACGACAACCAACACGACCGCCACTCACATGCCCAGCAATCCGGCGGAGGATGAGGTACCGGAGAAGGCTCCCATTGCCCCCGTCGCACCCATCGTCAAAATG 
GAGACCATTTGCTAAACACAACTGAGACGACGAGCGACAATGCGAAAGTCAGGCCCGGATACAAAAGTCCAGACCTCGAGCGAGCGAAATAATGGCAATGGGCAGCTCAGAA 
ATAAGAAAGAAAGAATGGAAAAAAC 
(SEQ ID NO: 23) 



MDRSRSSRAASSNQFIRPCGLLTTVILLQTLVSMSLAIEEMSPPPAAPPRPSPPPTVKLNKCCHSGEYLNDGTCIAGSEALWLPMVYLVQQQRFFEPHGASPRFLKFLPNTR 
PTCRKDQTTEIFRSRGANVMLFPNGTLYVRERALMVQPSDYCVDWEVAVVCLNDSQP1NALEDPDYAANPLVQQEPPKASLRLSKCCGKWGSYNTQLQNCDLQPNHQAAVDG 
LLRLSPQLPEGSYQTSYGLPDCGQPGGYSIAGDWQDAKLDRNTAMLQLPHKNLSAGQYCLEHTQREGEVKIIACQHLFSSAAGAGIHDGSIGGTIEOANGQNLQKAVLTGGI 
LVSIVFLSATLVAGFLLPAVHHALHWRCQICYVTCLLFGKILLAIEELSSSLQPGSAACHTLAITMQFFFLAAFFWLNTMCFNIWViTFRDFRPSSLERNQEALRRYLYSLYA 
WGGPLLITFVAACVDQLPETTLLRPGFGQLYCWFDNRNLSIFAYFYGPIGLLLCANIALFVSTTHQLTCGLWKRDDVKSSSEKSALGRVCLKLWVMGVTWIADILSWLVGG 
PHGVWFFTDLINALQGVFIFIVVGCQPQVWTACRRIFCPRLRHDITNTTNGVQHSSSSQGLPSMAGGTEITQNTTTTTTTTNTTATHMPSNPAEDEVPEKAPIAPVAPIVKM 
ETIC* 

(SEQ ID NO: 24) 
Name: mth-like 1 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384553 

GACCAGACAAGTGCAAGCAATGGAGCTCAGCGTCAGTATCCACTGTCTCACACACATCCACAAAAGCATCTTCATCAGATTCAGTTCCTCAGATCTCGCATCAGCTGCTGCG 
ATTCCCATGGACGTGACACGGTGGAGCGGTGGAGAAAACATCATCATAATAATAATAAAGCCGAATAACTTTCTCCACACGTAGTGTGTGTCAATAAATCGGTTGTTTCTCG 
CAGAGTTTGAAAGGAAACTTTTAGAGTTCTTAAACAAAGAATGTGGTTTTGGAACCACTGTGCCGCATCCTGTCTGCCATGAATGATGCGTCCTTTCCTGCCAGCACATAAA 
GCCCCGGCCAGTTGCATCTATTCCACACACTGATAGACAATGTTTGTGCACTGATTTATACTCAGCCAGCTATTAAATAAGAATTTTCGTTATTACGCTGCCGGGGATTTTA 
TTATGTGACTTTCAAAAGCCCCCGCCAACTTGTTGTGGCTTTGGCTTAAAGCCGCGCCAAGCTGAAAATTGTTATTTTAACCTTTTAAAGTCAGCTTTATCAGCACCAAATG 
CAGGGCCATTATGCGTGGCCCACGCGGCGTATGAGCAACGTTAAATCTGCGAATAATTCAAAGACGTGCGTTTCTTTTTACAGCCGCCTTGTCGGTAATCCCATTAAGGAGT 
TATCTGGCGAAACATTTTTGCACAACACTCGACTGGAGGCACTGTGAGTATACTTTGTTTGTTTTAATTTAAAAAATCACAAAATATATCATGAAGTTGTTAAAACGGGATT 
GAATTTGATTTGAATATATGATTTCATTTAATTTCTCACCAATATTTAAGTTCGTTGGCCCTAATGCCGATTCACATCAGCAGCAGTTTAATGGAGCCACTGAATATATCGT 
TTTTGAATCTCACGGGAATTCGGTATGATCACATCGACTTCGAGGCCATCAATTCAATGCGCAACCTGACGTACATCATCTACGATCGATTCTTCTACTGCTCAATGACGCC 
GCGGGTGAGAATGTGCAAGCCGTCCACCGATGGAGTCAGCTCCTTTCAAGATCTTCTGAGCAAACCAGTGTTACGTTACTCCGCCTGGGTGATGGCCACGTTGACAATAGCT 
GGAAATGTGCTAGTCCTATGGGGACGCTTTATCTACCGCGATGAGAATGTGGCTGTGACAATGGTGATTCGGAATCTGGCTCTGGCTGATATGCTGATGGGCTTCTATTTGG 
TCACCATTGGAGTGCAGGACTATCGCTACCGCAACGAGTATTACAAGTAAGTCAAATGTGATGGATTGGCATTTGTTACGCCCCAATCTTTTAGGGTTGTGCTGGACTGGAT 
AACCTCGTGGCAGTGCACACTGATTGGAACGCTGGCCGTGAGCTCCTCGGAGGTTTCGATGCTTATCCTGGCTTTTATGTCGCTGGAGCGGTTTCTCTTGATCGCCGATCCC 
TTTCGCGGCCATCGCAGCATTGGAAATCGGGTGATGTGGCTGGCTTTGATTTGCATTTGGATTACGGGCGTGGGCTTGGCTGTGGCACCGGTGCTCCTGTGGCGGACCAGTA 
CCCTACCCTACTACGGATCCTATTCCGGAACCTGTGAGCACAATGTTTATGATAAAATGTAGTTTACCAAGTGACGAAATTAAACTTAACAGGCTTTCCATTGCACATCCAC 
GAGGCCTTCCCGATGGGCTGGCTTTACTCGGCTTTCGTGTTCCTGGGCGTCAATCTGCTGCTGCTGGTGATGATTGCGATGCTCTACACGGCACTCCTTATTTCGATATGGC 
GAACGAGGAGCGCCACTCCGCTCACTCTGTTGGATTGCGAATTCGCTGTGCGATTCTTTTTCATCGTGCTGACCGACTTCCTGTGCTGGGTGCCCATCATCGTTATGAAGAT 
CTGGGTTTTTTTCAACTACAATATCTCCGATGACATCTACGCCTGGT'rGG'l'GGTCTTTGTGCTGCCACTTAACTCGGCGGTTAATCCTCTCTTGTACACCTTCACCACGCCC 
AAGTACCGAAATCAAATCTTCCTTCGCGGCTGGAAGAAGATAACTTCTAGAAAGCGAGCGGAGGCTGGCAATGGAAATGTGGCCACCACAACGACGGGCACGGCAACTGGAT 
CCTCTCAGCATCCGGATGACTTCACGATCTTTGCCAAAGCCGCGATGAGATGCCATTAGGTCCTAACGCTGTCGAACTGAAGGATTCTTGTGGCGCAGATCGAACTTGTATA 
CCACCAGATTTAGCAGCATTTATTTTGTCTATGATATCGAAAATCTATATATGTATGTGAATATCTATCCCTGTCCGTGTAAGATAACCAAATGAAAGTGTGAAATAAATTG 
AATGTGATATTTGTAAGGCAATTTTACGGAATTTAAAATGGGATGCGAGTTAGTGTAAGATTAAGCTATTATCTAAGTTTTGGTAATAGTTTTAAATACTTTTTATTTAGAA 
ATAGCAGCGCTATCATTTTTGGCATGGTGTGATTCTTTCTGGTCCGAGTAAATCTTTGCCATTACCAACTTGGAGCGCACGATGTGAAACTTTCGCATTTCCTCCAGGTCCT 
CCATCTGCACCACCATGTTATGCTTACGCTCCAGGTGCATGCGTAGTCTAAAAAATTATTAAGGAGTGTTACTTTCAAATAATGTACACTTTAAACAAAATAACTAACTTGT 
TACGAGCTGGATAGCTCTGATGGCAGTGGGCGCACTCAAATGGCATGGGCGTATGCGTGCACTCGTGATTCCTGAGAGATGAAGATGAGGTAAAGGACTTCTGGCAGCCGGG 
AAACTCGCAAATGAAGCGAGATCCAGAGTGCTTAACCTGCTGGTGGACAGTCAAATGGGATCGCTGTGCGAAACGACGATCACAGAGATCACAGGAATAGGGAAAGTCTCTG 
TTATGATAGCGTTTATGAATCTCCAGGGCTTTGGCAATGGCAAAAACCTTGCCGCATGTGTCGCAGTGGACTCCAGACTGCCTGTGCTTCGAATCGACGTGAACTTTCAGCG 
CCTGCTTTGTGAGGTAAACCCGGTCGCACTGGCTGCAACGAAAGTTGGGACGATTGGCCAAATGAATCATCTGGTGCTTGGCCATGCGGGAATGGTTCCTGAACTCCTTGTG 
GCAGAACTTGCAGTTGGATTTTCCCGTATGCCGATTGCGTGGCACAATGTC 
(SEQ ID NO: 25) 

Exon: 1001.. 1278 
Exon: 1327.. 1601 
Exon: 1661.. 2187 



Start ATG : 1 
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Start ATG : 1001 
Transcript No. : CT16185 

ATGACGCCGCGGGTGAGAATGTGCAAGCCGTCCACCGATGGAGTCAGCTCCTTTCAAGATCTTCTGAGCAAACCAGTGTTACGTTACTCCGCCTGGGTGATGGCCACGTTGA 

CAATAGCTGGAAATGTGCTAGTCCTATGGGGACGCTTTATCTACCGCGATGAGAATGTGGCTGTGACAATGGTGATTCGGAATCTGGCTCTGGCTGATATGCTGATGGGCTT 

CTATTTGGTCACCATTGGAGTGCAGGACTATCGCTACCGCAACGAGTATTACAAGGTTGTGCTGGACTGGATAACCTCGTGGCAGTGCACACTGATTGGAACGCTGGCCGTG 

AGCTCCTCGGAGGTTTCGATGCTTATCCTGGCTTTTATGTCGCTGGAGCGGTTTCTCTTGATCGCCGATCCCTTTCGCGGCCflTCGCAGCATTGGAAATCGGGTGATGTGGC 

TGGCTTTGATTTGCATTTGGATTACGGGCGTGGGCTTGGCTGTGGCACCGGTGCTCCTGTGGCGGACCAGT^CCCTACCCTACTACGGATCCTATTCCGGAACCTGCTTTCC 

ATTGCACATCCACGAGGCCTTCCCGATGGGCTGGCTTTACTCGGCTTTCGTGTTCCTGGGCGTCAATCTGCTGCTGCTGGTGATGATTGCGATGCTCTACACGGCACTCCTT 

ATTTCGATATGGCGAACGAGGAGCGCCACTCCGCTCACTCTGTTGGATTGCGAATTCGCTGTGCGATTCTTTTTCATCGTGCTGACCGACTTCCTGTGCTGGGTGCCCATCA 

TCGTTATGAAGATCTGGGTTTTTTTCAACTACAATATCTCCGATGACATCTACGCCTGGTTGGTGGTCTTTGTGCTGCCACTTAACTCGGCGGTTAATCCTCTCTTGTACAC 

CTTCACCACGCCCAAGTACCGAAATCAAATCTTCCTTCGCGGCTGGAAGAAGATAACTTCTAGAAAGCGAGCGGAGGCTGGCAATGGAAATGTGGCCACCACAACGACGGGC 

ACGGCAACTGGATCCTCTCAGCATCCGGATGACTTCACGATCTTTGCCAAAGCCGCGATGAGATGCCATTAG 

(SEQ ID NO: 26) 



MTPRVRMCKPSTDGVSSFQDLLSKPVLRYSAWVMATLTIAGNVLVLWGRFIYRDENVAVTMVIRNLALADMLMGFYLVTIGVQDYRyRNEYYKWLDWITSWQCTLIGTLAV 

SS3EVSMLILAFMSLERFLLIADPFRGHRSIGNRVMWLALICIWITGVGLAVAPVLLWRTSTLPYYGSYSGTCFPLHIHEAFPMGWLYSAFVFLGVNLLLLVMIAMLYTALL 

ISIWRTRSATPLTLLDCEFAVRFFFIVLTDFLCWVPIIVMKIWVFFNYNISDDIYAWLVVFVLPLNSAVNPLLYTFTTPKYRNQIFLRGWKKITSRKRAEAGNGNVATTTTG 

TATGSSQHPDDFTIFAKAAMRCH* 

(SEQ ID NO: 27) 

Name: TSH, FH, LH receptor-like 
Classification: G_protein_llnked_receptor 



Celera Sequence No. : 142000013383791 

CACTCAGATTTGCCGACAGCAGAAATATTTTAGGGTTGTATCTTAGAGGTGGAGAAACATCGATGCCAACATCGATGTCATCGATGTTTTCGAAAAATCAAAAACGTCGATG 

TTTTTCGAGGAATCGTCGATGTTTTCAGCGCATAGAAACAGCAAAGTGTGTAAATTGTAAGTCTAAATTAATTTCCATATTTTCTAATACAATTATTAATAGCATATTGCAT 

ATTTATTGTGACTCGCCGGTGGTGTAGGCTGGTGTCTTGGAAGACATGGATAAATATCTGAAATGTAAGTTTTTTTTTTAAAGATCAATTTATTTGTTTTTTGCCAAAGACA 

TCGAAAACATGGATGTTCGCCAACATCGATGTTTCTCCACTTCTAATGCCAACGAAAACGTGACTGGGAACGTATCGATGGGAAAAACGTTACAAAAATACCAACTATCGCA 

TAGCGGAATCACCTTGGCGGGCTATTCGAATGTATCTCACAAATAGTTTTTAAGGAACCTACAGTCAGTACCAAATACTTAAAATTCCATTTATATTTTATTTTATGCTACA 

GAAAACTGATAAGCTGCATTACTTTTGAAAAGTTAAAAATTGTGTTTTTTAAATGATTTATTTATATAAATAAGCTTATGTAATACTATTGTTAATATGTAAAAAAAAAAGA 

ATAAAAGAAAAAGAATAGTTTACCTAAGCAGTTGCTTGAATAATACATACATAATTTAAGTTTTATACATAATTTATCAACCCTCGCTCAGTAGGAAAAAAAGATGGCTCAG 

AAATGAGCAACGAATTGGAATGTCCGATAAGCCTTTTTTATCAAAATAATAATTTTTACATTCCCTTTTCATAACGCTTTCCCAGTTAGTTTGTGCCAATCGTAAAACAGCA 

TCCGCAAAGTGAAGTCGCTGCTATTTTGTAGTACCACTACATTCTTATTTATTTAAGTAATATTTAAAGAATGTCCAGCTAACTATATGATTGCTAGTTCAAAAATGTTACT 

TTCTGCATCGATCCTAATATATTTCTTGCTGAATCTGCAATCTTCGAGTGCTGAGATTGCCGACTGCAGTTTCTATGATACCGTTGATATCTCGGAAGGCCAGAGGCTCTCG 

AATGGATCATACCTCTACGAGGGATTACTTATTCCCGCCCATTTGACGGCCAAGTATGAATTCAAGCTTCTGGCGAATGGAGATAAGGAGCAGGTGCCAAGTCACGTGAGAG 

GATGTGTGTGCAAGCTAAGGACTTGCGTCAGGTTTTGCTGCCCACATGACCACATAATGGATATGGGCGAATGCTACGCCAACATGACGACGGAGGAGAACGAATTACTGGA 

CCCCATGCTAAATGTGACCCTGGACGACGGATCAGTGGTGCAAAGGCACTATAAGAAGGAGCTCATGGTGCAGTGGGATCTGCCAAAGCCCTGCGATGATATGTTTTACCTA 

GACAACCGGGATATAATGGATGAGTACACCTTGTTCGAGGTATGGCTGCTCGAAAAATAAGCATTTTTGGGCAGGATTTATTTATGCTTTGGCTCTCTGCAGAATGGAAGAT 

TGCTGCGCCACTACGATCAAGTGTATTTGGACAAGTCGGAGTACTGTCTGCAGCACCGAACATTCGGCGAAGGTAACAACAACTCCATTCGCATTATACCCCACAACTGCTT 

GATACTGCCATCCAGAACGGGACAGACCGTTGGTAAGATGATGCCCATTGCTTGCATGGTAGCTCCCAGTTCTATATCTTTCCTATTTCCATAGTGATGATCACATCGTTGA 

TATGCTTGGTTCTTACCATCGCCGTGTACCTGTGCGTTAAGAAGCTAATGAATTTGGAGGGCAAGTGTTTCATCTGTTACATGATGTGCCTATTTTTCGGATACCTGTTCCT 

ACTCCTTGATCTGTGGGAGTTATCGCTCGATTTCTGTAAAGCAGCAGGTTAGCTTAGAGGGGATGCATTTCAACACAACTTGACAAACCATTGACCATTTACGACAGGTTTC 

TTGGGATACTTTTTCGTCATGGCTGCGTTCTTTTGGCTATCTATCATCAGTCGGCACTATTGGAAATGTCTTACCAATCCCTGTGCCTCGATGAACATTCGGTCAGAACGTG 

CGTTTCTGCTATACAGCTGCTTCGCCTGGGCAATGCCTTTGGCCTTGACCGGAGTCACCTATCTGGCTGATAACGTTGTGAACAACGAGGAGTGGCAGCCTCGCGTGGGCGA 

CGAAGGGCATTGCTGGATCTATAGTGGGTTCTGTTTTGTATCCGGCTTGTGGTACCAGCTCACCAATGTTTTTCTTTCCACCCTCAGCTAAAAGCTGGTCTGCAATGGTGTA 

CTTCTATGGACCGATGGTGCTCCTGATTCTCTTCAACATAACTATGTTTGTTCTGACGGCTAAACACATAATAGATTCCAAGAGGACTCTTCGAAAGATTGCTCGCAATGAA 

GGTAGAATACAGAAGCTGAATTCGGACAAACAGAAGTATATCAATTATAATAGTTGAAAATGTCAGTTTAAGATAAATATTATCAATTCGTTACAGTTACACACAATTTCTG 

CTGCTCTTCACTGTGATGGGGATGTCGTGGAGCTTTGAGATATTCTCCTACTTGGTGCAACGCGAGAAACTGTGGGTCAATATCTTTCTGGTAGCTGACTACTTCAACTGGT 

CCCAAGGCGTCATTATATTTGTGCTATTCATTCTGCGGCGCAAAACTCTTGTACTATTCAAGAAACAGTAAGTCTTATAGTTCATAGTTCGGGTAACCAGTTCCCTAAAATG 

TCTCTTTTCAGAATATTTCCCAAACAGAGAGCCTTTAGTAGATCTGCAACTCAAAGTACGATTGAATCCATTTCTCAAACAAAAAGGCATTTTAACATGACATAGTATTTTC 

TTCTCTTCGAATTAATCAATTTAATAGTAAAGTATAACTTTTTTTCTCACAAAATGTATGAATATATTTCACTTCATTCTACATTCTATATAGAATTCATTTAATCACAAAA 

TGTATGAATATTTTTCTTTATATGACGTGCATCGACTATAGAAAATCGACTATAGTGTGCCACAATATATTTTACTCATCATTTCCTAATCAAACTATTGCAAACTTAACTT 

TGAGCCGAGGAAAAACCCTGTATGTCTATCAAAACTATGAGTCAAAAGTATGACGCGATGCTGAAAGTAGTTGCCTTTAAATATTGAAGTGATAAATGTAGAAAGAAGACAT 

TTGTATGGAAATAAACTAAGTTTGTAAATCTACATTCGCTTTATTTCATGTACAAATTTTAATTTAAAAACTTTGAAGGTAAATAGTGCTTTATCAGATGCGGATTACACGG 

TAGCAGTTATCATCGATGTCGAATCTTCTTGTTTATGTGACCGCTTTTCTTCGCTGTCGCACTGGAGTTCGTGGATAGTTTGCTCTTTGTATTCGAAGTGGGTGTGCCGAGG 

AGATCCTTGTTCCCTCCACTGTTCGTAGATCCAGACACCATTCCGCTCACTACGCTGCCCTCATTGCGATGCGAGGGTGAATCTGTAAATAAAAAGGTGTAATTAAACATAT 

ATACTTTTCTGGAGAAAAGCGCTAAAGTTGAATCTCAATGTGCTGTGCTGCCTTACCCTTTTCGTCTTTCGGTGTGCTCTTGATTGTCGTGGTTCCGTCGTGGGCCGTTGTG 

GTGTTCTGATTGATTGAGATCTGATTGTCGCTGGAGTCTTCATTTGTGATGGAATCCGATTCG 

(SEQ ID NO: 28) 

Exon: 1001.. 1495 

Exon: 1559.. 1712 

Exon: 1775.. 1951 

Exon: 2012.. 2263 

Exon: 2328.-2499 

Exon: 2561. .2759 

Start ATG: 1001 

Transcript No. : CT16507 

ATGTTACTTTCTGCATCGATCCTAATATATTTCTTGCTGAATCTGCAATCTTCGAGTGCTGAGATTGCCGACTGCAGTTTCTATGATACCGTTGATATCTCGGAAGGCCAGA 
GGCTCTCGAATGGATCATACCTCTACGAGGGATTACTTATTCCCGCCCATTTGACGGCCAAGTATGAATTCAAGCTTCTGGCGAATGGAGATAAGGAGCAGGTGCCAAGTCA 
CGTGAGAGGATGTGTGTGCAAGCTAAGGACTTGCGTCAGGTTTTGCTGCCCACATGACCACATAATGGATATGGGCGAATGCTACGCCAACATGACGACGGAGGAGAACGAA 
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TTACTGGACCCCATGCTAAATGTGRCCCTGGACGACGGATCAGTGGTGCAAAGGCACTATAAGAAGGAGCTCATGGTGCAGTGGGATCTGCCAAAGCCCTGCGATGATATGT 

TTTACCTAGACAACCGGGATATAATGGATGAGTACACCTTGTTCGAGAATGGAAGATTGCTGCGCCACTACGATCAAGTGTATTTGGACAAGTCGGAGTACTGTCTGCAGCA 

CCGAACATTCGGCGAAGGTAACAACAACTCCATTCGCATTATACCCCACAACTGCTTGATACTGCCATCCAGAACGGGACAGACCGTTGTGATGATCACATCGTTGATATGC 

TTGGTTCTTACCATCGCCGTGTACCTGTGCGTTAAGAAGCTAATGAATTTGGAGGGCAAGTGTTTCATCTGTTACATGATGTGCCTATTTTTCGGATACCTGTTCCTACTCC 

TTGATCTGTGGGAGTTATCGCTCGATTTCTGTAAAGCAGCAGGTTTCTTGGGATACTTTTTCGTCATGGCTGCGTrCTTTTGGCTATCTATCATCAGTCGGCACTATTGGAA 

ATGTCTTACCAATCCCTGTGCCTCGATGAACATTCGGTCAGAACGTGCGTTTCTGCTATACAGCTGCTTCGCCTGGGCAATGCCTTTGGCCTTGACCGGAGTCACCTATCTG 

GCTGATAACGTTGTGAACAACGAGGAGTGGCAGCCTCGCGTGGGCGACGAAGGGCATTGCTGGATCTATACTAAAAGCTGGTCTGCAATGGTGTACTTCTATGGACCGATGG 

TGCTCCTGATTCTCTTCAACATAACTATGTTTGTTCTGACGGCTAAACACATAATAGATTCCAAGAGGACTCTTCGAAAGATTGCTCGCAATGAAGGTAGAATACAGAAGCT 

GAATTCGGACAAACAGAATTACACACAATTTCTGCTGCTCTTCACTGTGATGGGGATGTCGTGGAGCTTTGAGATATTCTCCTACTTGGTGCAACGCGAGAAACTGTGGGTC 

AATATCTTTCTGGTAGCTGACTACTTCAACTGGTCCCAAGGCGTCATTATATTTGTGCTATTCATTCTGCGGCGCAAAACTCTTGTACTATTCAAGAAACAGTAA 

(SEQ ID NO: 29) 



MLLSASILIYFLLNLQSSSAEIADCSFYDTVDISEGQRLSNGSYLYEGLLIPAHLTAKYEFKLLANGDKEQVPSHVRGCVCKLRTCVRFCCPHDHIMDMGECYANMTTEENE 
LLDPMLNVTLDDGSWQRHYKKELMVO^DLPKPCDDMFYLDNRDIMDEYTLFENGRLLWlYDQVYLDKSEYCLQHRTFGEGNNNSIRIIPHNCLILPSRTGO/rVVMITSLIC 
LVLTIAVYLCVKKLMNLEGKCFICYMMCLFFGYLFLLLDLWELSLDrcKAAGFIXSYFFNTC^ 

ADNVVNNEEWQPRVGDEGHCWIYTKSWSAMVYFYGPMVLLILFNITMFVLTAKHIIDSKRTLRKIARNEGRIQKLNSDKQNYTQFLLLFTVMGMSWSFEI FSYLVQREKLWV 

NIFLVADYFNWSQGVIIFVLFILRRKTLVLFKKQ* 

(SEQ ID NO: 30) 

Name: mth-like 2 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384913 

ACGTTGTCAGTGTGTGTGACGGTTGTTAATTTCATTTGACTTGCGAGGGTTTTAATATTGCAATGGAAATTCGGTACAATTTGCCGTCCATTAAATAAGCACGCAATTAATT 
GTGGCAGCCATGACAGGCGTGGCTTGCCACCAATTAAGTGGAATACTATATGGCTATATCGCTATATCGCTATATGTCTATATGGCATATAGATGCCCACAGCCCCTTGGGG 
TCGATCGCCTACGCCGTCTATAATCCCCAGCTAAACTTCGAGACATCGCATCCGGCACCGCAATCCGAAATCGGAATGCTGATGAGGCAACAGAGTGGACGCGGCCGGAAAC 
AAGGCACTCCTCCCATCAATCTGTCAGACGGGCCAGTGAGTCCTGGATGTCCTGGTGAACGCAGCTCAGCTCCCAGCGGGTGAGGCCAGTTGCCCAGACAGTTAGTCCGCCA 
GGCAGTCAGTCGGTCTGTCAGTGAGTCAGTTAGTCAGTGAGCGAGTGAGTCAGTCAGTCAATCGAACAATCAATCATACGGAATGACAGCTACGTCGGCTTAAGTCAAAGGC 
AGTCGGCCAAGAAGGGATCTTGTCTAACTTACAGTCGCAAAATTCAATAGCCTGTGGCAATCGCACTTGCAACTTTAATAATTCCAAGAACGAAGCAGTCATAATGTTTACT 
GAAATT^TGTTATGCCACAATTGGCACATACAAGCTCATAATAGTGTTTTATTCTGTTTTTAATTGCACTTTAAAATTGATGATTGTATTAGCCATCATAGGCATGGCTTGA 
GCTGTTCATTTAGCCATrTGATrTTGTCTACGTTGGTTTTTTATC 

TTTCGCATTCCCTTAAACATTTGCTGGGGAAACACATTGATTTATATGCCGGCATCGCCAAAAATTGAATGTTATTGCTTGAATGGCCCCAATTTACTTAAAAGCTACGCTC 
CTCCGCCCCACTGCCCCTCTGCCGCTCCACTGACCCCTGACCCCATTATACATGTTGTGGGCTCCACTGGCTCCTGATTGGTATTTATGAGCGTGTTTATTTATTATATGAC 
CCAACACTTGATAAAATTAAACGGAAATGGTTGGCAATTTTGCATTTGATTCTGACACCCAAACACACAGGCAGGCAAATTCGCAAATGAACTCATCTGCCCAATATTCCGA 
CCATAAAGCATCCCGACCAACGGCACTCACATATAAGTTAGTTAAGTATGTATATTAGTGGCTTGGCAACTTACCAGAAAGTAGACAGCAAATGGCATCACGACCACGGCGA 
CCAGGAGATCGGCGATGGCCAACGAGACTATAAAATAATTCGTGACCGTCTGCAGGGAGCGCTCGCGGCACACGGACAGGATGACCAGGATGTTGCCGAAGAGGGTCAGGAT 
GGGGAACAGAATCAGGATGAGCGCCCAGTAGTTGTGGTCCACTGAAACGAAGCGAGTTGTAGTTACATTATATCCGGGATGCATCCATCCGCAGGCTCTCCTGCTGCTGCCA 
CCCAAftTTCCCTGGCCAAAGTTCACCTTGCAGGAAGATTTCAGGGAAAGCTCAACGGGCACGTAAGTCAATTACACAGGCAGTCCGTTTAATCAAGGACTCTGGCTGACAAC 
TAATTTTAATGGCMCACAAGTAAGGACTAGCACACAAAGACGTGAATCTTAAAGGCTGTTTTAATATAATCTATCTAATATAGAGTGATGCATGTTACCAACAGCTGCGTT 
GTTAAAAATTAAAGTTATTTCACATATTAAAAATGGCAAAATATCGCTATGT^ 

CAAATATGTAGGAACTTTTCTTGAGATTGCACGAATAAATGAGTTATGTAACGCGTTTAAATTAAATTCTTGCTATCAAAACTGATTTCTAAATGAACATCTTTATAAATAA 

ATTAATTATCGGCGGTTTCATTCAAATACTCCCATCTTAGAAATCTAAGGAATTCTGTTCTCAAAACCTAGTCACCTCACTATCCGATGAAAATATATTATACTAAAGAAGT 

TGCTGTTGCAAGGTTTAAATGATGCTCCAACTTGATGATAATGTTTAAAGGTATTTAATGAGATTCAAGGCTATCTAATCTAATTTAATAATTTTAAAGCCTTTGAATGATA 

TTCAATAATATCTAATGAGCTTTAATGATTTTTAAAGATTTATAATGCGGTTTTAAGATGTTAAGTTGTAGTAGTTTGATGATTTCGAATGGTAATCCTGCCTCGTACTCAC 

CCACTCTCAGCTCGCCGCAGGAGTCGGTTAGGTTGAGCGTCTCGTTGGTGCAGTTCAGCCGGTAGCCGTCGAGGTACAGTCCCGTCAGGTTCTGCTCGAGGAGCAGTCCGCT 

GCCCCTGCTCCCCGTGATGGCCACTCCCGGCGCAATGGTGCTTGTTCCCGAAACCGTGCTCCCATTGTAGTTGGGAAAATAATCGTTAAAGTTTTCGAGCAGCAGCATGAAG 

GTGCTGCCCCCTGATCCTCCAGCGCCGCCTACACCTCCGCCCGCCGTACCGGCCAGTCCGCCAGCCACGCCCACATCCCCAGAGCCGCCTCCATCGTAACTGGTCGTCGAGG 

GCGCCGTTCGATTGCCCAGCGTCTCGCGCGCCACCGTTGCTGTCCGCTGGAGCAGTTCCGCATCACCGAAGCCGCCCAGCATGCCATAGTCCGTTTCCGCATCAGCCACATT 

GTCCAGGGCTGCCTCCTCCCCGCTGTCGATCAGCGCGATGCCAGCATCGCCGCCGTCGCCCGCTCCTGCTGCCCCTCCGCCCGCCAGCTGTTGTTCCACAAAGCTGCGCAGC 

GAGTAACGCCCCCGCTCACCATCCTCCGCACTGGAGTTGCTCTGGGCTGGCTGCGTGCTGTTCAGTGCCTCCAGCCAGGCCAGGGTTCCGTTGGCGCCCGTCACCGCCAAGT 

TTGTGCTGTTCTCGGGCCCGCCCACTGCCGCCTCACTGCCCACCGAACCCAAGAGCCATTCGCTGATGTCGTCGATCAGCGCGTAGCCGGCGTCCTCCGCCGCCGTTTCCAC 

ATCCTCGTAGAGCGGCAAGTAGCCGGCTGGCATTGTGGGGCCCTCAACGCCCGCCTGCGCATCGCTGCCGTTCGCCGGTTGAGAGCTGTCCGCTTCCACCGCCAGGTGGTGA 

TTTATCCACGTGGCTCCCGCGGATGGGGCTGCCGTGGTCGAGGTGGAAGCAGCGGCTGTGCTCAAGGTCGCCGATGTGGTTGCTGCCGTTGCTGCAGTTGCCGCAGTTGCTC 

CCGTTGCCGCCGCAGTTGTTGCCGCCGTGCTGCTCAGTGGCTGGGCTGCCATGGTGCCGCCGGTTCCGCTGCTGCTGATGCCCCGCCGCCAATCGAATGGGCTCAACATCTA 

GTGGGGCGGGGGCGTGGCAGCGGAGGCGAGCTGCAACTCCTGCAACGCCAGCGGCATGTTTCCGTTTCCTGCAGCGAAAGAGACGAAGTTCGGAGGTGAGAAAACATAAATC 

ACAAAAATTTCCATTATAATCAAAGTTCTTGAAGATAGTTCTTAACGATACGCACAACTAAGATTACAATTTTAAGCCTAATTGGTCGCGTTAATGTTGCTTAATTTGATAA 

ATTCTTGTAATCCGCAAATGAGAGTCATGGTTCTTTTCGACTTAATATTTTTAGGATATCTAAAAAACAAAGTCGTTCTCAATTTTTCCTACTTACACATTACACATTGCAT 

GCCCCTTCGTTTGCATTTTCAACACAAGGCGTACTTCATGCTTGACCACGCCCCCCCAACCAACACAACAACATACCATCCAGCACCACGCCCCCATGCCATTTTGGACGTA 

CGGACCCTTTGCCCAACTGCCATGGCCGCATGTTTAATTAACTCTTTAATCAATTTATTGTCAATTATTCTACAAAAAGCGACAGCGAAAATTGTGTAATTTAATTAAATTA 

GCAAAAGGTTTTTGTGTCTCTCTCGCCCCACCAACCATTCCGCCCACTCCCACTTCCATTCTCGCAACCAAAATGCCACGCCCCCATGCTCGACGCTGTCTTGCTTTTACAC 

ATTTTGCAATTTTCGCTTTTGTATTTGCCAATTGGCGCTGATTAAAGCCGCCCGAGTCGATGGGCCATAATTAAAAGATGGCCTAATGAGAGGCGAGGCGAGTACGTCTCGA 

GTTGGCTGGGGGTCAAAGGTAATCGGATCGGAGTGGTCTTAAGTCCCCTGCCTTCTGCCTCTGTCTTCCGGAGATCGGGTTATTCGAACTCTGAGCTTGTGGATTCTGGACT 

CTGAATTCTGGATTCCGGATCGGAGGCAGAGATCCGAGAGCTGCACACCCAGACGGCTGGCACTAATCCAATTTGGGCACACTTCCAGGCGACGAGAGGCA 

(SEQ ID NO: 31) 

Exon: 3469.. 2353 
Exon: 1497.. 1307 
Exon: 1081.. 1001 

Start ATG: 3469 (Reverse strand: CAT) 
Transcript No. : CT17758 

ATGTTGAGCCCATTCGATTGGCGGCGGGGCATCAGCAGCAGCGGAACCGGCGGCACCATGGCAGCCCAGCCACTGAGCAGCACGGCGGCAACAACTGCGGCGGCAACGGGAG 
CAACTGCGGCAACTGCAGCAACGGCAGCAACCACATCGGCGACCTTGAGCACAGCCGCTGCTTCCACCTCGACCACGGCAGCCCCATCCGCGGGAGCCACGTGGATAAATCA 
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CCACCTGGCGGTGGAAGCGGACAGCTCTCAACCGGCGAACGGCRGCGATGCGCAGGCGGGCGTTGAGGGCCCCACAATGCCAGCCGGCTACTTGCCGCTCTACGAGGRTGTG 

GAAACGGCGGCGGAGGACGCCGGCTACGCGCTGATCGACGACATCAGCGAATGGCTCTTGGGTTCGGTGGGCAGTGAGGCGGCAGTGGGCGGGCCCGAGAACAGCACAAACT 

TGGCGGTGACGGGCGCCAACGGAACCCTGGCCTGGCTGGAGGCACTGAACAGCACGCAGCCAGCCCAGAGCAACTCCAGTGCGGAGGATGGTGAGCGGGGGCGTTACTCGCT 

GCGCAGCTTTGTGGAACAACAGCTGGCGGGCGGAGGGGCAGCAGGAGCGGGCGACGGCGGCGATGCTGGCATCGCGCTGATCGACAGCGGGGAGGAGGCAGCCCTGGACAAT 

GTGGCTGATGCGGAAACGGACTATGGCATGCTGGGCGGCTTCGGTGATGCGGAACTGCTCCAGCGGACAGCAACGGTGGCGCGCGAGACGCTGGGCAATCGAACGGCGCCCT 

CGACGACCAGTTACGATGGAGGCGGCTCTGGGGATGTGGGCGTGGCTGGCGGACTGGCCGGTACGGCGGGCGGAGGTGTAGGCGGCGCTGGAGGATCAGGGGGCAGCACCTT 

CATGCTGCTGCTCGAAAACTTTAACGATTATTTTCCCAACTACAATGGGAGCACGGTTTCGGGAACAAGCACCATTGCGCCGGGAGTGGCCATCACGGGGAGCAGGGGCAGC 

GGACTGCTCCTCGAGCAGAACCTGACGGGACTGTACCTCGACGGCTACCGGCTGAACTGCACCAACGAGACGCTCAACCTAACCGACTCCTGCGGCGAGCTGAGAGTGGTGG 

ACCACAACTACTGGGCGCTCATCCTGATTCTGTTCCCCATCCTGACCCTCTTCGGCAACATCCTGGTCATCCTGTCCGTGTGCCGCGAGCGCTCCCTGCAGACGGTCACGAA 

TTATTTTATAGTCTCGTTGGCCATCGCCGATCTCCTGGTCGCCGTGGTCGTGATGCCATTTGCTGTCTACTTTCTGGAGCCAGTGGAGCCCACAACATGTATAATGGGGTCA 

GGGGTCAGTGGAGCGGCAGAGGGGCAGTGGGGCGGAGGAGCGTAG 

(SEQ ID NO: 32) 

Start ATG : 1 (Reverse strand: CAT) 

^SPFDWRRGISSSGTGGTMAAQPLSSTAATTAAATGATAATAATAATTSATLSTAAASTSTTAAPSAGATWINHHLAVEADSSQPANGSDAQAGVEGPTMPAGXLPLVEDV 

ETAAEDAGYALIDDISEWLLGSVGSEAAVGGPENSTNLAVTGANGTLAWLEALNSTQPAQSNSSAEDGERGRYSLRSFVEQQLAGGGAAGAGDGGDAGIALIDSGEEAALDN 

VADAETDYGMLGGrGDAELLQRTATVARETLGNRTAPSTTSYDGGGSGDVGVAGGLAGTAGGGVGGAGGSGGSTFMLLLENFNDYFPNYNGSTVSGTSTIAPGVAITGSRGS 

GLLLEQNLTGLYLDGYRLNCTNETLNLTDSCGELRVVDHNYWALILILFPILTLFGNILVILSVCRERSLQTVTNYFIVSLAIADLLVAWVMPFAVYFLEPVEPTTCIMGS 

GVSGAAEGQWGGGA* 

(SEQ ID NO: 33) 

Name: G-protein coupled receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384638 

tttgtaccgcagtcaagcgcaaaaattcctcgaaaaaccatagaccaaggtgtttatgagtttattaatactgtgtaattattatttgtgatatatgtgtgggaaattggta 
atagatcaaacatgaaagtaatggtatctgtttttctgtattaaaataacatatctgataatttcctaaaggatgttaatgttaatttaataggtattccaaaactaagagt 
gaactggcaaaatgtaaaattagttggtatttccttttgaataagcttgacctttaacttacttcatagcctatgacaactcatatattataaatataaataataataaatg 
agtgctaataataatttttaatcaaaaattctcacaagctaaccttgtaatgatggtacgcctaactgatacccattatgttaccatggccaggcataaattcctgttactg 
gcaccgcatcatcatcatactggcattcggtcatattaaaataatatcacctatctatagggctgcggctcttttgacacgcacacgtgctgctcagcagaccaagtgaact 
tatcaatgaacacccgccacacccacacggtggcccaagcacctgacccacccatgcccagtggccagcaccacccgaactcgcagAagccaatgaatggccggcagaccaa 
gaaacacccactaacatctcgggagattcttcgtctgccgagttcaagaacaacacacaagtggccgtaatgaaatgtgaatagccttggcaaacatgagtccggaaagcca 
acttatgatggcttctgaatagccaaccgcatagtacacggctaagagcggggaccaaaaaaggacctgctctttgctcgctcgttggctttctctttcgcagcctcggctc 
ggaaaaaggatgccgacaagcaacaattttcattgataaaaacaccagctgctgggtcctgaaactcagtcgtacgcgagtcctcagtcagtcagtttcgagctctgggatg 
caacggatgcggatgccttaaatccccgagccgtttggcggtgttattgaaaattgcgtattcgccgcgtattaccaagtaatagcttgtccaaagccactacgctgtcaaa 
ggacgggcttaaaacgctacttgcatttttacgtactatacatagataaattttccgggtaatcggcggaaattgagttaaagttttgtgtccggttggcgccagttggcaa 
ttggtgctggccaactttcggacccgaacgatgacaaactgaaaagtaaactttgtaaagactcagcaccttacaaacaaattcaagtgtttcgcgaaaaaaaataaaaaat 
tttgaattgaagaggacacagttcggtgaaaaaaaaatatttgcaatgcacaatggaaaatcagcaaagtgacctccaagtttacgaaaacagaaactaactcaataaccat 
ccctgcaggtgagcagaggcaacattttttctttaaaacattttttaccatatttccgtaaacggtttgcgtcctttggcaaatttatggttgtcttggaataaaatgaact 
ttatttacttggcaacaaaacgagtgtttattcttgtcgcatagggaatcaagcagccaacataaggactatattttgtttaattcgatttaatgtgagttgcagaattaaa 
tttctttttaattcaacacgcatctgtacaccatttgtaaaattacaagctaaagctcagcaaacactaaattttaatcagtgaaacgaactgtattctgtcaattttaata 
tcaaaagc»tatacattgctcttaaaaatattaaactgcgttagtaacgtaaacagaaaatgctatgaaatttgttataatcaaaattcacttgaaagagatttcaacaaaa 
ttaccaatctcgtgttctaaggaattaatataataattataataatatcatagcttgttccctgtcacctaatatataccatttaaataaaagtaatgcctaggtaattgtt 
tgtgagcgataagcgcagatttgtctgcataattatgtgttaatttgattgaggtctactgaagagggccttatttttataagccaaaagaacaaattcaaacaaaacactt 
gagggcggccttagctggcaataagcggcttttgcctaatggaaaataagtggaatgcctaggaattattggccaagtggacgggctgctctctttctggcaatcctgagtc 
ctttcgactgattcgtttttggctcctttggccccatctccgcacgctttgttcggcttgacacacctgagtggcgatgttccacacaggcactttcccaaacaaatggaac 
ccggcatctcgcatttggttgcccgaacgtaagtcccatgaagtggtcttctccagtggacagccagtcagccgcatttcttgggggcgctcttctcgagcgcttggcctct 
gagatggcctctttggatgtcttggagatatgcagctcatcctggccacccacttcgccgagccattccatttcatttccctggccaacatacccgtatgaaacgatgagtg 
ctcttcacagtttttgcctacattccggccactgcaccgactgatggttgatggagacccattctccggttagtgttcgtcttggtcgtcttggtctgggtctcggtcctgg 
tcacaatgggaattcgcagcttaaaggcgccgtggctttattcaggccagattggatacttgacatgttctaatgcggcgcagatctcagtttctgcaacgtgctctcccta 
tctcagttacatgaattagcattcatcgccactgcactgagcaaaaaataggaccacaaatcgctctaactgctgtaaattgatatccattcaattattttatttaagtatt 
ataacacattgatagaacattaacatgtcatatatttttgaacacaattcagcattacactttaataaagtcgaatttgaatttatccaactgacgcgttgggtgatatcat 
aaattcaatttcgagaatgacaggtcattaaaatgttgtcacctacactgtcaaatatttaagtcaaaatattaagttataaaatatcaaaaagctttgatagctttagagt 
atgatttacgtttaaaaggattgtacacgaatacaattttgtttttttaagcaatcagcggcgtgtttctgttttgccatcataaatcgcagagtttcataatccctctaaa 
ttaaaatgtcctttgggatttttctgaaatttttttcaaacgcttccccctttttgtttcctgcatgtttaggccgttaagagcctgttgatgatggcggcatggtagaact 
ttctgttttgtcacagccttgggctgctggctttacatttggtggaaaaacgagaccagccgagcacgtgcggtccgcggggttggggggggttgtggaagctaaggacgtg 
gcaggacgaaagcgctgagtgcagatgaggactgcaataaactgccatacataagtgaaaattgtgaaatgaaaacttgcgcgagcataaatcaatacacccaaggggggtc 
atgggctcccatggcaggcaacaaaaaaatatatatgtatatatatcagctgggaaatgtgctccctactcaggtttagcgacttgtatttcatgttcccaaatcagctggt 
gttatcctttgattgccgggcacacgacaaaagtggtggaccaggacacatgtgatgattgcttgcgtgggtcgcaggaatgagccaactcagattggtaatattgtgtatg 
aacgtgatttcgtgactttgcaatgtttctatttaacaaaaactcttgttaaagaaatacacacaaaacatcacttataaccctttttggatatctatgaggatttaaggtt 
tgtaaactaaggcaatggtataccttaaaaagtctatagtaaatcgcatatattacatcaattttaacgttcatgctgatcttccagttccaaaatatatattttgaaatag 
cttacatttaattaagtgcacttgctataaaagtrttgagcttgcctgaagtttattacataaaaaattcgcagaaaggcggctaaagaatgaaactaatttaagcccaaat 
gataaactatttgccaagcatgtttcgtttgaattatgctcccgttgctctgcctaccaaaaatggcaaaaactcgcaatatgattattgtggcagcttcttccggtgaatt 
aaaaatacaaaaaaccagctcaaattgaaacttaattttaataaattgttttcatgctctttttcgctgccgttgtagcgttttcccgcccatttcacccgagtggctggat 
aattgtatccgcgcaatcatcccgcggcttccattgccgcaggataaaaaccaaactgcaacgaacagaacagaactgagcactttaagccatttccactcgtaattaaatt 
ttgagagcggatgaaattgtggcggctaaataaaaggaacaaaatgcatctgtgcatacagcaaaaaaaaaacagcaaaaataaataacaaaatgcagaagagcaaaaaatc 
gcgcatatccgctttgtaaaaatacgagtctaagtgatgggttccgtttttgtttagtagtgctcactagtgaacggccttgtcggatttggggctgatagtggatgtagtg 
tggggtgtattgtggtgtgcatgggggtgtttgaatcgcagaagtcggcccagcccgttggccgtcaagaggacctggcctaatcgaatgaatgtagaatgggcacaaacaa 
gtcacgcctgcctgcgcatttcatcaaagaacggcgaacaagaggctaacagctcttcgtttctctctctctttctctgctcagctgggtcctttgaagatcctgaatgaca 
ctcaattggcctaatcggtttctgaaaagcatagtatgaatttcatacggtacgctaatttcgaatataacaaaaagcaaaaacgcttttgatatctaaattgtatttgcca 
ttgaaaagtttatacctatatgtataatatggaacctttctaaatatagtaattgatacgaaattgcaaaaatgatggatgaacatctcaacattcaacaggttgcccagtt 
acttaatcaatccatcggtgttgaaagtgagcactttatacaaaaatactcatgagtggttctaaagcggataagaggatttatcttgagctcaaaaactgatgcagtgatg 
caatattcaagatgtgtgccttagtattcaacatttttgccagatttatcgaaacgtaaataaataaaaatatacatgaaatttaataacattgcgcagctttttggtgaaa 

GCAGTTTTAATTTAATAAAGCAATCGTTGCGTTGAAACTAATATTATTTTAAATTATACAGGATACAAAAAATAATATACATATACATTTAAATTTTGAATTTAAAAAGGGG 
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TGTGACAGATTTACTGAATAATAATAACTTTAAACATGCCACCTAACAAAAGCGAAAACAATTGTTTAATTACATTTGTGCAACTTGTGGCATTTAGTAAAAGCGAATATCT 

AAGCAAATAAGTCGTGTAATTTGATATGCAATATAGCAACCAAACAAAAATGCGTGTTATTAAAATCATGATTACTGCCATACGTATAGATGTTTCATGTCCTGTCCCTTGT 

TGATTTCATGCTGATAAAGGACATACACTTTTAAATCTTGCTAACTTGGTACTTTTAACGAGCATTCGAAGGTATCCTGTGCCCATTTCAAAAAAGCTGTAAGCTCTTTGGC 

AGCCAAAAAAGCTGACAAATTTGTGCCGGAAAGGAAACAACTTCCTATTTGCATGATTTATTGCCAATGGAATAACAGGTTGGCTTCAGTTGCGTATGTCCCTGCTATTCGA 

TTTTCAACGAAATTTCGGTTGGCATTCAGTCGGAATGAAAGTGAAAACAAAAAGTGCTACTACTACCCTTTTGTGAAAGCCCCCGGAGAAAAGACAACTTTCAATTGGCCAA 

TTAGTGCTA1TTAAATGGCAAGTTACGCCGTTTTCAAAGGGCCAACTATTTTGTTTTTGAAGTTCCGCGAGTTGTAAATGTAATAAGCAAATTTCGCAGTTCTCCATAGGAC 

ACCTTTACACTGAGTTCCTTTCCAGTTGCTCAAGTTTTGCTCCTTGGCAGCTGCACTTGTTGTTCCTGTTTGCCAGAAAGTTAGTTTCTGGCCAAGAATTTGTCTTTGGCCA 

AGTACTGCTCACGCTCGGTTTCTTTGGGCAAAAGTTGGTATTTGAGTTTCGCAGrGGAAAAGCACTTGCGGCAGATCTTCAAATTCTGCCAAACCCGGCGGGCTCACGTGGC 

CGCCTTCTAGTTATTTAATAAGTAATTAAGTTCCCTAATTAAATACATTCTCGCAGAGATTTACATAATAAGTAATCTACGCCGAACGTCTTCCTCCATTGCGGCGTTAATT 

AAATTTGCAGACCGATTGCATTTCGGAAACTTGATTTGGTTTGCGTCTGGCTGCCAGTCGTTAACAAATCATGATTGCTGAAATGGTCAACTTTGGGCATCGCACTTCGTAT 

ACACTTGAGATTAAAAATTCTTTAATAAGGGGAAATAACACTTATTTATAGAAGTATTTTTAAAATATATACAATTAATCTAGTCAGCGCTTTTGACGTAATTTATATTGAG 

TTCGAACAACTCATTTGACAACATCAAAGAATCCAATATTCAGCAATTTTAAACATACTCACATGCGAAGGGGGAATCGAATTTCCAATAATTTAAAAAGGAATCGGAATTT 

TACATTTTCTCGCGCTTTTGCGGAGAAGCCGTGATATGGTGACTGTATTTATGAAATTTGCTTTTACGCCACTTCCATGAACTGAAAATTGGTTTTCACTGCCTGTTTTTGG 

TCGATATAATAATGCTGAGATATATATATGATATGTAATGTCTTTCTATATATTGTAGCATTTACGAAGGGTTATAGCTTGATATAAAAAAAGAAAGAAACTTTGAACAAAC 

TCAAGTATATAAAAAGGTTTCAATAAAATATATAAAGTCCATTAGCTGTAGCTTACAGGAGCATAAGTTTTCTTGTTTAAGACGAAGCTTTATCGAACTTTATCTTATAGAT 

ACATATATCTATTACTAGGATAAGTAAGTAATTACAATTATTTATGCACGCTGCATTGACATATGCCAAACACTTAAAGTATTCCCACCCAGCCGCACTTGCTACATTTATC 

AGCAATTGAATTAATTTCAGGTTGCGGTTCAGCTGAGTTTGGTTTGTAAAATACATTATAATGTTTAATAATCAGCAGAGAGCAGGAAAAATATGTATTAAGTTCGGCATTG 

TGTTAAACTTTAAGCGAATAGTTAACACAGTGTTATTGGTTACCCGCTGCTTTTAATAACTTTCCCTTTCGATGCCCACACAACCTGAGTTATTTTCCTTAATGGTCTGAAA 

AAATGAAGAAAAAAAAGTTTGGCTCAAGTTTATTGGTTTCGCGCTGCTTTCATCAATTGCTGCACGCAAATGCTTCTCAAAAAAAAAAAAAAAAAAACTAACTCTTCATTTA 

TGGCATTTCTTAGTTGCGAATAAATTCCGATTTAAGCTGGTTGAAAAAAACCCATCCTTTCAACGGTGACTAAACATATTTTAAGCAGACCGCTCAAACTCAAACCGTGGGT 

GGTTAATTAAATAAGAGCATCGATTAAGTCAACTGTCTTTTTATCTGTCCCACTTTTAGGGTCACCCGGTGGACAGCTAATTGCAGCTAACCCCATTGGCATTTAGCAAACG 

GAAAATGCATCGTGACCGCTGCAACCACATTGCACCACTACCACCAGCAGCATCAGCGCCACCACTTCCACCTCCAGCAGCAGCAAGGGAATCCGCACCGCTGAAACTGAAT 

CTGAAGGCATCCCAGTGGCGGCGACACACGGAATCCTTGCCAAATGCGGCCAAATGCGAGGCGCTGGCAAGTAAATAAGCGTAATCCTTTGCGGGCACGTACCACCACTACT 

ACCTCCTGTTCGGAGTAGCTCGTAAATATTTGGCCAAGTCAGTCATGGAGGCCATCATCATGACGACCCTGCCCAACCTGACAACGGATGCAGGTGACAGCAGCTTCTGGCT 

AACCGGTGCCCTCTCGCTCTCCGAGATGTTGGCCAACTCGAGCCACAGCCATTCCACTGGGAGCACAACCTCGACGGCTGGGAGTTCGGCCACCGAATCATCCGCTGTGAAT 

GTAGGCAAGGACCACGACAAGCACGTGAATGACAGCGTTTCCACGGGTCTGAGGTAAGTGCATGTGCCTAGGCTTTTCCATTTCCATGGCAACACACACACAGATTCCATTA 

AGCTCATGACTGCGGGCTCAACAAGTCATCGCGATAAGCTCAGCTTCATTTTCCAGGAGATGCATTCAACTAATGTGCTTAGAATTCTACTATTGTACATAACTAACATATA 

GGGAATATAATCAGATTACTTAGAACCTCAGAACAGTCAATACAATCCACTTAATATAAATGAATTAAAAGTATCTATCGAAACTGAAAAGAAGAGGATCATTAGCTTGGGT 

ACAACATTTATAAGAATTTTGGCTTATGAAGTTGCAAAAAGAATTCTTAAAAAATTCAATACAATGTCAAATAGTTCTGAAACCATAAATTATTAACAATTAAATATTAAAT 

TATTTATATTGATTTAATTCATACATTTAAACATAAGTTACACTACTGCAAGGAGGAAAGTATCAGATGACTTGAGCTATACTTTCATAAACTTAGCTTAGAGCAATCTAAA 

CCAAATTGAAAACTCATTGAAACCCATTTTAATTGAATAAATTGAAACATTTACTTTTCCCATTTCCCCAGCAATTACAGTAATTACCCCAGCTACATTCACTACAGGGACA 

AGTACGACCTGAGCTACATTGCCAAGGTGAATCCCTTTTGGCTACAGTTCGAGCCACCTAAGTCGAGCACCTTCCTGATTATGGCCGCCCTGTATTGCCTGATTTCGGTGGT 

GGGATGTGTGGGCAATGCGTTCGTCATCTTTATGTTTGCCAACCGCAAATCTCTGCGGACTCCTGCCAATATCCTGGTGATGAATCTGGCCATCTGCGACTTTCTGATGCTC 

ATCAAATGCCCGATTGCCATTTACAATAACATCAAAGAGGGTCCAGCTTTAGGAGACATTGGTAAGCTTAAAACTTGGGGAAATAACTTACTAAATATCTATATAAATGAAA 

TTCATTCGGCTTGCATTTATGTCAAAAATCGATCAACAAAAGCAATCTACATAATAAATAAAATTAAATTGGCCCAAAATGCTTGGACTGATTTCAGCTGTTTCTTGCAGAA 

ATTCATTTGTGGGATTCTCTTTTCCCGAAAGCAATTAATAGAATTTAAATGAAACAATAAAAGTTCAATTTGTAGCCTACACATCTTGGTTCATGAAAAACATATTCCAAAA 

TGTAATGGCTATTAAAATTTCACCGAGCCTTTTGTTTGCCTTGAAACGGGTGGTTCATTAAAAATCCTACTGACTTTCCGTCCCACAGCCTGTCGCCTCTATGGATTTGTGG 

GTGGCCTAAGTGGCACCTGTGCCATCGGTACCCTCACCGCCATCGCTTTGGATCGGTACAATGTGGTGGTGCATCCACTGCAACCGCTGAGACGCTGCTCCCGCCTGCGATC 

CTACCTGATCATCCTGCTGATCTGGTGCTACAGCTTCCTGTTCGCTGTGATGCCGGCCCTGGATATCGGACTATCTGTCTATGTGCCAGAGGGCTTCCTCACCACCTGCAGC 

TTCGATTATCTGAACAAGGAGATGCCAGCTCGCATTTTCATGGCACTGTTCTTTGTGGCTGCCTACTGCATTCCACTGACCTCCATTGTGTACTCCTACTTCTATATACTGA 

AGGTGGTCTTCACAGCGAGTCGAATACAATCGAACAAGGATAAGGCCAAGACGGAGCAGAAGTTGGCCTTCATTGTGGCGGCTATCATTGGTTTGTGGTTCCTGGCCTGGTC 

ACCGTACGCCATCGTTGCCATGATGGGTGTCTTCGGCCTGGAGAGGCACATAACGCCACTGGGATCGATGATACCCGCGCTCTTCTGCAAGACAGCTGCCTGCGTGGATCCC 

TACTTGTATGCGGCCACCCATCCGCGGTTTCGTGTCGAGGTGCGAATGCTCTTCTACGGACGCGGAGTGCTGCGAAGGGTCTCCACGACCAGATCATCGTACATGACCCGAT 

CACGATCGTCGTTCACCCACCGATTGAGGACGAGCACCACCGGTGAGGGCGGAATGGGTGATCACCGCATGGAAAACTATCTGATGAACAACAATCTGATGATGGTGCCCGA 

GGAGACGGAGGAGAACGAGGAGATCGTTGTGGTGGCCGAGATCAACAATTCGATTAGCAGCGTCATGGAACAGAGTAAGTTCTGAAGGCTGACGATTATGCCAGTTAAGAGA 

ACGGAAATTCTGTCAATCTGTTTCGAACCAACTAACATAAACATTTTTAATCAGTTTTAAGTTTTACAGACGTGTTTTATATAATTTGAAAATTAGTCAAACCTATGATCAC 

GATGATACTGAAAACTTTGCTTGGAATATCTGAAACTCGCCTGTTTAATTTCCACTTTTTCCAATTGAAAATTTCAATTTGATTTTTGCCCAGGTAATTTATAAAAAAACCT 

TTCCTATCTCGAAAAGTAAACTGTTCGCTGTTAATTGAAAATGTTTGTGCACTGCACTGCAAGGCATATCAAAATGGAAGCTATCAATAATCGTAAATTGAATGGATCGGAC 

TCAAGTTTCATTGTTGAACTAATCTAATCTAATTTGTATGTAGAAATAATAGACCGAACTGAAATCAGCAATGGCAGCAAAGTAATTTCGCTCGGCAATGATATATGCCAAT 

GGCATTGTTCATCTGAGTGAATGCGTTGTCATTGATGGACCTACAATTGAAATGAAAATAATAATAATAATAAATAGAAATGAAGCAGGGAAATCCGCCGAGCAGTGGGAAA 

TTGGCAAAGGCACAACGACGACACAGTGGTGATAGTAAACATATTTAATGAGCTTATGTGATGTTTAGACAGGGGGAGTAGGCGATCAGAGCAGGGAATGGATTAGTTAGCC 

TATATACATATATATATACACAGGCATATGTACATATGTATAAGACATCGGAATAGCAGGCCAATTAGAGAGGCAGGGCCGAAATTAGTACAAAGAAAGAGGGTGGCGTGGG 

TAGGGGCCGAAGTGGAAATTCAATAGTCAGATTATACATCATTTCAGGGAAAGTCACTCGGACCCAAGGTCAAGCCGTGGTATTGGATGTTAACAAGCGGATGTGCAGATTG 

TTTGCATTGTAGTTTAATATTGGAATAACTCACAATCAAAGCAGCCTGCGGACATCTTGTATTTCTGAAATGGAGTC 

(SEQ ID NO: 34) 

Exon: 1001.. 1464 
Exon: 7788.. 8022 
Exon: 8083.. 8341 
Exon: 8920.. 9245 
Exon: 9609.. 10501 
Start ATG: 7845 

Transcript No. : CT17820 

CTGGGATGCAACGGATGCGGATGCCTTAAATCCCCGAGCCGTTTGGCGGTGTTATTGAAAATTGCGTATTCGCCGCGTATTACCAAGTAATAGCTTGTCCAAAGCCACTACG 
CTGTCAAAGGACGGGCTTAAAACGCTACTTGCATTTTTACGTACTATACATAGATAAATTTTCCGGGTAATCGGCGGAAATTGAGTTAAAGTTTTGTGTCCGGTTGGCGCCA 
GTTGGCAATTGGTGCTGGCCAACTTTCGGACCCGAACGATGACAAACTGAAAAGTAAACTTTGTAAAGACTCAGCACCTTACAAACAAATTCAAGTGTTTCGCGAAAAAAAA 
TAAAAAATTTTGAATTGAAGAGGACACAGTTCGGTGAAAAAAAAATATTTGCAATGCACAATGGAAAATCAGCAAAGTGACCTCCAAGTTTACGAAAACAGAAACTAACTCA 
ATAACCATCCCTGCAGGGTCACCCGGTGGACAGCTAATTGCAGCTAACCCCATTGGCATTTAGCAAACGGAAAATGCATCGTGACCGCTGCAACCACATTGCACCACTACCA 
CCAGCAGCATCAGCGCCACCACTTCCACCTCCAGCAGCAGCAAGGGAATCCGCACCGCTGAAACTGAATCTGAAGGCATCCCAGTGGCGGCGACACACGGAATCCTTGCCAA 
ATGCGGCCAAATGCGAGGCGCTGGCAACTCGTAAATATTTGGCCAAGTCAGTCATGGAGGCCATCATCATGACGACCCTGCCCAACCTGACAACGGATGCAGGTGACAGCAG 
CTTCTGGCTAACCGGTGCCCTCTCGCTCTCCGAGATGTTGGCCAACTCGAGCCACAGCCATTCCACTGGGAGCACAACCTCGACGGCTGGGAGTTCGGCCACCGAATCATCC 
GCTGTGAATGTAGGCAAGGACCACGACAAGCACGTGAATGACAGCGTTTCCACGGGTCTGAGCAATTACAGTAATTACCCCAGCTACATTCACTACAGGGACAAGTACGACC 
TGAGCTACATTGCCAAGGTGAATCCCTTTTGGCTACAGTTCGAGCCACCTAAGTCGAGCACCTTCCTGATTATGGCCGCCCTGTATTGCCTGATTTCGGTGGTGGGATGTGT 
GGGCAATGCGTTCGTCATCTTTATGTTTGCCAACCGCAAATCTCTGCGGACTCCTGCCAATATCCTGGTGATGAATCTGGCCATCTGCGACTTTCTGATGCTCATCAAATGC 
CCGATTGCCATTTACAATAACATCAAAGAGGGTCCAGCTTTAGGAGACATTGCCTGTCGCCTCTATGGATTTGTGGGTGGCCTAAGTGGCACCTGTGCCATCGGTACCCTCA 
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CCGCCATCGCTTTGGATCGGTACAATGTGGTGGTGCATCCACTGCAACCGCTGAGACGCTGCTCCCGCCTGCGATCCTACCTGATCATCCTGCTGATCTGGTGCTACAGCTT 
CCTGTTCGCTGTGATGCCGGCCCTGGATATCGGACTATCTGTCTATGTGCCAGAGGGCTTCCTCACCACCTGCAGCTTCGATTATCTGAACAAGGAGATGCCAGCTCGCATT 
TTCATGGCACTGTTCTTTGTGGCTGCCTACTGCATTCCACTGACCTCCATTGTGTACTCCTACTTCTATATACTGAAGGTGGTCTTCACAGCGAGTCGAATACAATCGAACA 
AGGATAAGGCCAAGACGGAGCAGAAGTTGGCCTTCATTGTGGCGGCTATCATTGGTTTGTGGTTCCTGGCCTGGTCACCGTACGCCATCGTTGCCATGATGGGTGTCTTCGG 
CCTGGAGAGGCACATAACGCCACTGGGATCGATGATACCCGCGCTCTTCTGCAAGACAGCTGCCTGCGTGGATCCCTACTTGTATGCGGCCACCCATCCGCGGTTTCGTGTC 
GAGGTGCGAATGCTCTTCTACGGACGCGGAGTGCTGCGAAGGGTCTCCACGACCAGATCATCGTACATGACCCGATCACGATCGTCGTTCACCCACCGATTGAGGACGAGCA 
CCACCGGTGAGGGCGGAATGGGTGATCACCGCATGGAAAACTATCTGATGAACAACAATCTGATGATGGTGCCCGAGGAGACGGAGGAGAACGAGGAGATCGTTGTGGTGGC 
CGAGATCAACAATTCGATTAGCAGCGTCATGGAACAGAGTAAGTTCTGA 
(SEQ ID NO: 35) 



MHRDRCNHIAPLPPAASAPPLPPPAAARESAPLKLNLKASQWRRHTESLPNAAKCEALATRKYLAKSVMEAIIMTTLPNLTTDAGDSSFWLTGALSLSEMLANSSHSHSTGS 
TTSTAGSSATESSAVNVGKDHDKHVNDSVSTGLSNYSNYPSYIHYRDKYDLSYIAKVNPFWLQFEPPKSSTFLIMAALYCLISVVGCVGNAFVIFMFANRKSLRTPANILVM 
NLAICDFLMLIKCPIAIYNNIKEGPALGDIACRLYGFVGGLSGTCAIGTLTAIALDRYNVVVHPLQPLRRCSRLRSYLIILLIWCYSFLFAVMPALDIGLSVYVPEGFLTTC 
SFDYLNKEMPARIFMALFFVAAYCIPLTSIVYSYFYILKWFTASRIQSNKDKAKTEQKLAFIVAAIIGLWFLAWSPYAI VAMMGVFGLERHITPLGSMIPALFCKTAACVD 
PYLYAATHPRFRVEVRMLFYGRGVLRRVSTTRSSYMTRSRSSFTHRLRTSTTGEGGMGDHRMENYLMNNNLMMVPEETEENEEI VWAEINNSIS3VMEQSKF* 
(SEQ ID NO: 36) 

Name: Rhodopsin 3 and 4-like (Rhodopsin 7) 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384543 

TATGCGCAACTCGACAAACATCTTCCTAACCAACCTGAGCATTGCGGACCTGCTGGTGCTGCTCGTCTGCACGCCCACCGTCCTCGtGGAGGTCAACACTCGCCCGGAGACC 
TGGGTACTTGGCCACGAGATGTGTGAGTGGCTTCAGCTCCAGTGCTCC1GGTCTATTTGTTCTCCCCAACCTAATTAACTGCTGTTCTATCCCAGGCAAGGCTGTGCCGTTC 
GTGGAGCTCACGGTGGCCCACGCCAGTGTGCTGACCATCTTGGCCATCTCGTTCGAGCGCTATTACGCAATCTGCGAGCCATTAAAGGCCGGCTACGTCTGCACCAAGGGGC 
GGGCCATCCTCATCTGCGTCCTGGCC1GGGGCATTGCTGCGCTCTTTACGAGGTAAGCGATTAGGCTCGTACTTGATTTTCTTAAATTTATTTGTAAACTACCATTATACTT 
TAAATATCTTAATTAGTTTGGTCGCTTTTCATCGCATGCTTTTTGGTGTTTGTAGTCCTACATTTCGTATGTACATATGGTATTATCCTGTTTAGATCTACTTCGTTTGTGG 
CCTATATCCGTTTATTCATTTCGCCCAGTCAAGCATTAAGTTCCTCGCTCATTAATTATAAGTGTCAATCGCCGTCATTTTAATGATTTGTGTTTTGTTATTGGGTTTTGCA 
CATTTTTCCCCGTTTTGTTTATTTTCCATTATGACGTGGAATTTCTGGTCTTGGTGTTTGGCCTTTTGTGCGCCATCTACGCTGGCGGCGTGTCCATTTGTCCATTTACATG 
CTAAACTGCGACTGTCGCCTATCTATTCCATTTCCACTCCTGTCCTACCACTTCTCCTGCTTCACTCGCTTCATCGTTGCCATTGCCATTTCCATTCCCAATCCCATTCCCA 
TTGACATCCGTCGCCTCAAGTTGCGGACGGCTTTTGTCCAAGTTCCGGAAGCTTGCCGTTATCAGCGCCCTGTCATCTTTAATAAGCACCGATGTCCGCTATCATGGATTTC 
ATTTCATTGACACTTTATTATGGTGCGCTCCGTCCAAGGACGAGCGGCGACCCTTTTTAATTTTGCAAATCTTGCCGCGTCCATAACAATGAAATAAACAGCACATTGTCGA 
GCTGGCCACCGACCCCTCAGCACTCCGGGAGTCATGGGGTCATCTATGGTTGGCAGCTGGGTCCGCAGGACACCCGCCTACGGGCCGAGACCCATTAATTACGCCACACAAG 
AAGCCCGAGGCCAAAAAAAAAAAAAAAAAAACGGGCAAACATTTTTAAACACATTCAAACAGACGCGCCACACGACGCCCATGGGCGCAAAAGGAGGCGTGTCAAATAGCTG 
ACGCACGTACCAAAACGCCCGGATCATCTTTAGTGGGAAAAAAAACACGGCCATCGCCTGTTGACTGTCATGGCATTTCGTTTTCGAGGTCTAGCCAGGGAGGATTTGGGCT 
GGGGCTGTGTTTAATCAGCGGCTCGGATAACTCAAAATAATCAAATATTTAAACTTGTCTGTTTGCTGGATATATTTACAAAGTCAACAGAGTATATCCCACCATAAGTTCG 
CCATCTACTCACTTATCCCCTTGTAAGTGAAATGTAATCACATCAATGAAATTTGGAATATTTTGAGCTTTAGTTAAATGTATTTTAAGATGTATCGATATAACTCATACAG 
ATACTTTTTTAAAATGAGTGGTTTCATTTTTATTAAATTTCTGTTATTATAATAATTTTAAATGCAATGTTGATATTATATTTCAAATTGGCTTAACGCCTACAGATTAGAA 
TACACTAAAGATACACGGTATACATTTTTTTTCTAGACAGAAATATACTTTAATTCAACTAAATTAATATATATATGTACCTCTCCCATCTTTTAGCTATTAAATAGTCGAG 
GAACTCGACTATAGTATTATATCCGGTTTTGGTTTAAAAACCAATCATTGTTTTTTTTTAACGACACAGTACTATTTAATATGGGCTTCAACATATATTTTAAATGATGAAC 
CCAAGAAATGAAAAAACCATAAAAACAAGCGGTCATGAAAATCGCGTCCTGGCCAGACTGGTACATTGGAAACTTGAACTTCATCCCAGACTGGGTTAACCTTTTGCCCATC 
GCCATTCTCTCATCATCACAAACAGCTTCTCTTCCTGCCTGCTGAAACCTCTTCTTTTTCCCGCCACTCTTGGCCTTTTCCTCCAAAATCCAACCGTATCCATTATCCATTA 
TCCACGCCCCCGGTAACCCCGCCACCTGATTACGTTTCTTGTTTGCTTTCTCTCCCCAATCTGCAATATCCCCCGAGCAGCCCCATTCTGTGGGTGGCCGAGTACAAGCTGG 
CCGAGTACATTGATGGATCGTCGGTGGCCGTGTGCCTGACCCAGGCCATCAGCGACTGGACGCTGGCCTTCTTCCTGATGACCATCTCGGTGTTCTTCGTGGTGCCGTTCGT 
GACCCTGGTGGTGCTGTACGGCATCATCGCCCGGAATCTGGTCTCCAACAGGGCGGCCATGCTGCGCGCCCGTCCCACGAAGCCGGAGCTTAGTCTGAAGGCCCGCAAGCAG 
GTGGTCCTAATGCTCGGCGCCGTTGTGCTGTCCTTCTTCGTCTGCCTGCTGCCCTTCCGCGTCCTCACCCTGTGGATCATCCTCAGCACGGACCAAACACTGCACGATCTGG 
GATTGGTGCGCTACTACAGCTTGCTGTACTTCTGTAGGATCATGTTGTACCTCAACTCGGCCATGAATCCGATTCTTTACAACCTGATGTCGACCAAGTTTCGAAGGGGTTT 
CAAGCGGCTTTGTCAGGACGCAGGGCGATTACTGCTGGAATTGGTGACATTGGGAAGAAGGAAGGAAGACTCTTCTCGCGGGCGCAGAGGTACCTTGTCACTGGGCATGGGC 
ACCAATACGAACACGAATACCAACTCCTCAAATGCCACAGGAGCCACGAGCTCCAGCATTCTCTCGAGAAGCTCCAATCGCAGGTGCAGCGAGGATATCAGTCGCACCCGCC 
TTAAGATCGAGATGCAGATGCCATGTGGCAGCGACCTGGAGGCCATGGCCATGCTGCAGCATTCCACCTTGGGCAAGGGAATTGCCAGGCGAGTGAGTGATAGCCGTCTCAT 
GCCTCTGAGAAATCATCAACCACGACGCCACAAGCCGCAAATAAGTTTCGACGAGGAGTCACTGGAGGAGAATAAGCGTAGTGAAGCGAAAATCCCAACAAAATGTCGAGAA 
AAGCTTCCTGGAATAGCAAGAGAAATAGTAAATCTGACCGAAAATACCCTCTAAGCGAGATGGGCGTGAAAAATCGCCTTTAACCCGTTTACACACTCGCTTCACTTGGCTG 
GAATGTGCCGATGACGCTGATCCCGATCCCGATTCCTATTCCCGCTCCTGGTAAACACAAATCGCTTCCTGAACGCTTTTGGCAGCCATAATCGATTTCAGTCACCCGCTGA 
CAGCTGCCGAGAATCCCGAATCGTAGTTGGGCTGCCGCCTTGGACATTTCCGTACCAAAAACAATCAATTTTTCGCGCTATTAAACTCCAATGMTGTGCCAAAGATAAATA 
ACAGCAACGACAGCCGAACAGCACTAAAACTACCACTACTGGTCAGGGTTAGGAGTCCAGATATCGACTGGCAGTAAGGGGTAAGCCATTAGGTGGCCTTCGAGGAGCGGAG 
CTGGCAAATGGAAATGGCTTAATATGCGTTTGGGCGAACCATCAGGTCGGCGGATAAACATGATTTAGGTGTATTGAAATAGTTTCCGTACGCCGGACCGTACAAAATGGCC 
GCCATTACACGGGCGGCCATCACCGGAAGTCGAAAAAACTGTTGCCTACTTGCCGGCAGCTCAAATAAAAAGCTTAGCTCAATATGCACACAATGTCCACATGTAAACACAG 
CTGTACATACACATACATACCCCCACACGCACACATAGCCAAATATTAATATTTATATTTGTACGAACATATTTATATACACATATTTACATATATATAGTCTGCTTATAAA 
TATGTACAAGGAACCACACGCACACACTCACACACACACGAGAGTTTGTAAGTAAATAAATTTTTTATGACAACTCATGGCGGGACTTTTCCACATTACCCCATCCAATCGG 
AATCTGTAATTTATATAATTCTAAGGAAATCGGACAAATGTTCGTCCTCTGTAAATATACGTATTCGTACCATGCGCGTAGCTTTTTAATTTGTTTTCGTAGCCCTATGACG 
TAGACTCCCACTTTTGTAGAGTAAAAAGTAATAAGTAGCTTGAACCCACACCATCCTGCGAGTATGATGAGTATGAGCAGACGCTGCCGTTTCCGTTTCCGCTGTGTGATTA 
AGTTGAGTGCTTCGCCCCCGAAACTTTTCCAGTAGTTAGCCGTTGAGTAGCTTTTAAATATTTGTTTGTTTCCTGCCCGCAGCGCGATAAAACTTTGCAGACAGCGCTAGTC 
ACTCTAGTCCCCGCTAAAACTGATAATATGATTCTTGCAGTCCCATCATCGCCATATCCACCTACAGCGTGGAACCCTACGGGGACGGAACCGATGCTCCCGTTTGCACCAC 
CGCCGCCGATGGTTTCTGGTCGATCTTCTACTTCGTGGGCTGCATCACGGTGTTTTTCTTCCTGCCCTTCGGCATCCTGGTTCTTCTATACGCGGCCATCGCTTACAAGCTG 
CTCCGTCCCAACAACGCCTTCCACCGACCAACCTCCCCGCAGCCACAGCAGCCGTCCGGCGGAGCCACCAGTGGCTCCTCACAGGTGCCCAGCACCAAGGGTAACAGCCATC 
AACAAAGCAACGGGATGAGGAAGCATCGCAAGCAGGTAATCTTCATGCTGGTGGCCGTGGTGTCTAGCTTTTTTGTTTGCCTTCTACCCTTCCGGGCCTTCACCCTCTGGGT 
GATCCTGGCCAGCGCCGAGGATGTCGAGGGTCTGGGCATTGCCGGCTACTACAACCTGCTGTACTTCTCGCGCTTCATGCTCTACCTAAACTCAGCCATGAACCCGATCCTC 
TACAACCTGATGTCCTCCAAATTTCGCAGCGGCTTCTGGCGGCTGCTGCTCACTTGTCTGGGCCAACGGCCACATCACCACCATCGCCACCACTACCATCAGAGGCAGCATC 
CAACGGCAGGCGGAAGTGGGCGCAATGCGTCCACGCGACAGGAACAGGATGCCGAGGAAGGAGCTGCGCTGGCGGGAACGACCAGCGCCCGACATCCACGTCGCACACTCCG 
CCGCGAGGCCACCTTCTTGATCAACTCCATATCCACCTCCTCGGGTACGGATCGCACCACATCATCATCGGCGTGGCGCAGCAACAGTCTGTCCATTTCCGGTCTGAGCGAA 
CGGGAGCGCGGCATACTGGGAGCCGCTATCATCGGCACAACGGCTGCCACCGTTACAACCGCCTGTCTGCAGGAGCGACGCGCCAGCAAGATCTGAATCCCATCCTAAATAT 
GCATCCTTCGAGTCCCTGTCCACTGCCAGCAACCGTTAAAGTATTTAGTTGAATTCCAACAGCAAATAGTCATAGTTTTAGAAAAATTTTGTTGTAAAGACTGAAGTACCTC 
AGAAGAAGTCAATAAACGCAAGAGAAGATGTTTTTGAAAATCGAAACTAATTTTCTTCGGGCTTCGAAAAATTGAAGTTTGCAAGATTCGAAATAGCTTTAACTTGATTTTT 
AGAACAGAAACGAGTATGAACTTTTCATACGATCATTTTTATCGCACCTATTGTGATTTAATGGGAGATAGGAGGAGACCAATTGGGATATGCAAAAATAACGCAACTATTA 
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TTATCTTTGTATGTACGATTAATTTTAATCTCTAAAGAGTCAAAGTACTTCAATAATTTTACAAATCTGTCGGAGGGAATGTAAGCTTGCCCAAATCAAATGGAAGCTACAA 
TCGTTGTCAGATGGAGATATTTTATGTTAGAAGGAATATCTCGGATACGAAATATTTAATTTAAAATTCCTTTTCATATTATTCTTTTTAAAAAATAATAAGTCCTGTATGT 
ATACAATGCAATACATAAAGGTAGAGTATGTCCTAAAAAGAAAAATATCTTTCCGAATATATGTATATAATTTATAAAATAAGTGGATACATACATTTTTCGTAAGTATCCT 
AGAAGATTTGCCAACGTATCAGGTTTACAAAAATTGATTACCTATGTGCAGTTAACTTAACGGCAGTGCACTAACTAATCTGATTTGTGAAATAATGCAGATATAAAACATT 
TTTAAATAGAAATATATTTATACATCTGATTCTTCAACATCTAAGCACTCGTATAGAGTGCTTTGAATATATTCTAAATACATTGGCATGATCCAAGTTTTAAATCAGCCAA 
AAACTTGCTCCAAAAACAGGAAGATTCTACACAAATCGCCTCGCCCTTAAAGTTTCTGCCAAGTATTTGGGTGCGTGCGGATCAGTTC 
(SEQ ID NO: 37) 

Exon: 2.. 134 
Exon: 208.. 388 
Exon: 2321.. 3216 
Exon: 3631.. 3664 
Exon: 4521.. 5472 
Start ATG: 2 

Transcript No. : CT18539 

ATGCGCAACTCGACAAACATCTTCCTAACCAACCTGAGCATTGCGGACCTGCTGGTGCTGCTCGTCTGCACGCCCACCGTCCTCGTGGAGGTCAACACTCGCCCGGAGACCT 

GGGTACTTGGCCACGAGATGTGCAAGGCTGTGCCGTTCGTGGAGCTCACGGTGGCCCACGCCAGTGTGCTGACCATCTTGGCCATCTCGTTCGAGCGCTATTACGCAATCTG 

CGAGCCATTAAAGGCCGGCTACGTCTGCACCAAGGGGCGGGCCATCCTCATCTGCGTCCTGGCCTGGGGCATTGCTGCGCTCTTTACGAGCCCCATTCTGTGGGTGGCCGAG 

TACAAGCTGGCCGAGTACATTGATGGATCGTCGGTGGCCGTGTGCCTGACCCAGGCCATCAGCGACTGGACGCTGGCCTTCTTCCTGATGACCATCTCGGTGTTCTTCGTGG 

TGCCGTTCGTGACCCTGGTGGTGCTGTACGGCATCATCGCCCGGAATCTGGTCTCCAACAGGGCGGCCATGCTGCGCGCCCGTCCCACGAAGCCGGAGCTTAGTCTGAAGGC 

CCGCAAGCAGGTGGTCCTAATGCTCGGCGCCGTTGTGCTGrCCTTCTTCGTCTGCCTGCTGCCCTTCCGCGTCCTCACCCTGTGGATCATCCTCAGCACGGACCAAACACTG 

CACGATCTGGGATTGGTGCGCTACTACAGCTTGCTGTACTTCTGTAGGATCATGTTGTACCTCAACTCGGCCATGAATCCGATTCTTTACAACCTGATGTCGACCAAGTTTC 

GAAGGGGTTTCAAGCGGCTTTGTCAGGACGCAGGGCGATTACTGCTGGAATTGGTGACATTGGGAAGAAGGAAGGAAGACTCTTCTCGCGGGCGCAGAGGTACCTTGTCACT 

GGGCATGGGCACCAATACGAACACGAATACCAACTCCTCAAATGCCACAGGAGCCACGAGCTCCAGCATTCTCTCGAGAAGCTCCAATCGCAGGTGCAGCGAGGATATCAGT 

CGCACCCGCCTTAAGATCGAGATGCAGATGCCATGTGGCAGCGACCTGGAGGCCRTGGCCATGCTGCAGCATTCCACCTTGGGCAAGGGAATTGCCAGGCGAGTGAGTGATA 

GCCGTCTCATGCCTCTGAGAAATCATCAACCACGACGCCACAAGCCGCAAATAAGTTTCGACGAGGAGTCACTGGAGGAGAATAAGCGTAGGTTAGGAGTCCAGATATCGAC 

TGGCAGTAAGGGTCCCATCATCGCCATATCCACCTACAGCGTGGAACCCTACGGGGACGGAACCGATGCTCCCGTTTGCACCACCGCCGCCGATGGTTTCTGGTCGATCTTC 

TACTTCGTGGGCTGCATCACGGTGTTTTTCTTCCTGCCCTTCGGCATCCTGGTTCTTCTATACGCGGCCATCGCTTACAAGCTGCTCCGTCCCAACAACGCCTTCCACCGAC 

CAACCTCCCCGCAGCCACAGCAGCCGTCCGGCGGAGCCACCAGTGGCTCCTCACAGGTGCCCAGCACCAAGGGTAACAGCCATCAACAAAGCAACGGGATGAGGAAGCATCG 

CAAGCAGGTAATCTTCATGCTGGTGGCCGTGGTGTCTAGCrTTTTTGTTTGCCTTCTACCCTTCCGGGCCTTCACCCTCTGGGTGATCCTGGCCAGCGCCGAGGATGTCGAG 

GGTCTGGGCATTGCCGGCTACTACAACCTGCTGTACTTCTCGCGCTTCATGCTCTACCTAAACTCAGCCATGAACCCGATCCTCTACAACCTGATGTCCTCCAAATTTCGCA 

GCGGCTTCTGGCGGCTGCTGCTCACTTGTCTGGGCCAACGGCCACATCACCACCATCGCCACCACTACCATCAGAGGCAGCATCCAACGGCAGGCGGAAGTGGGCGCAATGC 

GTCCACGCGACAGGAACAGGATGCCGAGGAAGGAGCTGCGCTGGCGGGAACGACCAGCGCCCGACATCCACGTCGCACACTCCGCCGCGAGGCCACCTTCTTGATCAACTCC 

ATATCCACCTCCTCGGGTACGGATCGCACCACATCATCATCGGCGTGGCGCAGCAACAGTCTGTCCATTTCCGGTCTGAGCGAACGGGAGCGCGGCATACTGGGAGCCGCTA 

TCATCGGCACAACGGCTGCCACCGTTACAACCGCCTGTCTGCAGGAGCGACGCGCCAGCAAGATCTGA 

(SEQ ID NO: 38) 



WNSTNIFLTNLSIADLLVLLVCTPTVLVEVNTRPETWVLGHEMCKAVPFVELTVAHASVLTIIAISFERVVAICEPLKAGYVCTKGRAILICVIAWGIAALFTSPILWVAE 
YKLAEYIDGSSVAVCLTQAISDWTLAFFLMTISVFFVVPFVTLWLYGIIARNLVSNRAAMLPARPTKPELSLKA^^ 

HDLGLVRYYSLLYFCRIMLYLNSAMNPILYNLMSTKFRRGFKRLCQDAGRLLLELVTLGRRKEDSSRGRRGTLSLGMGTNTNTNTNSSNATGATSSSILSRSSNRRCSEDIS 

RTRLKIEMQMPCGSDLEAMAMLQHSTLGKGIARRVSDSRLMPLRNHQPRRHKPQISFDEESLEENKRRLGVQISTGSKGPIIAISTYSVEPYGDGTDAPVCTTAADGFWSIF 

YFVGCITVFFFLPFGILVLLYAAIAYKLLRPNNAFHRPTSPQPQQPSGGATSGSSQVPSTKGNSHQOSNGMRKHRKQVIFMLVAVVSSFFVCLLPFRAFTLWVILASAEDVE 

GLGIAGYYNLLYFSRFMLYLNSAKNPILYNLMSSKFRSGFWRLLLTCLGQRPHHHHRHHYHQRQHPTAGGSGRNASTRQEQDAEEGAALAGTTSARHPRRTLRREATFLINS 

ISTSSGTDRTTSSSAWRSNSLSISGLSERERGILGAAIIGTTAATVTTACLQERRASKI* 

(SEQ ID NO: 39) 

Name: Thyrotropin-releasing hormone receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000012807287 

CAGAAAGTGTAGGGAATGTAGGGAAATGCAAAGCGAGAGCCTTTGGCTGCTCA'CTGACTGAAATATCACCAGACTCGGCTCGGTTCTGCCGCTCAAGTGGCAGTAAATAAAA 
GTTGTGAGGAGCCTTGGCGGCATGAGTAATAATGACCGAAATCAGAGCACCATCACCATCACCAACCATCGCTACCACCGCCCAACGCGTTGCTCATTTTATTAAAATAACA 
AAAATATTTTGCCGCTCGCAGCCTTTTTACGATGCCGAACCGACTGGCGGCCCATTTGTTATTTGGCGGTCATTGCCACCAAATGGTGTCCTGTCGCTCTTTCTCGTCCTCG 
TCCTCGTCCTTGTCCGTTCCACTGGGCCAACTCGTGTGTCGTGTGTGTTTTGACCAGAGTTATGCAGCTGCGGATGGGCAAAAATGTTTTATTGCCAGCTGGGGAATGCCAA 
AGTGGCAAGAGGCAGATTGCCCGATCTTAGCTGTTTAAATAAATTTAAGAGAGAGAGAGAGAGAGAGAGATCTGGCACGAAGCACTCCATCACTGGTCCAGCGATCCGTTAA 
GCCATCGAGGGTCCTTCGAATGAAATGCCAGCGCTGACAAGGCAAACACTGGAGATTTGTATAATAAGGGGTGGGTGGTTGTGCCGAAATTTGGCACTGATAGGCAGGACAC 
TTGTCAGGCTTGGCGCAAACAAAACTGACATTTATATTGATGTGTCCCTGTACCCAATTAGATGCCCATCATTATGAGAGCAGGCTATCTCTGCTCTCGTGTCCTTCCCCCT 
TCTATTTTGCGCCCGGGAAGTCATATATCGAACATTAGGCTCTGTTTGTGTGCAGAGCAATTTGGCGGGAAGTGGCAGCAGGAGGACGCACATGATATTAGTAATAATGTAA 
TTATATGGTCATAAACAAAATAATAAATCAATAAACAACACACACGCACACGCAGGCGGCACACGAGTAAAAGTTATCACGGGGGGGAAAAAAATGAAGGAAAAATGGGAGG 
CAGAGCAGAGGAAGTGAAGAAAAGGGAACCGGAAAGGAAAGCCATTCATTTGGCCCCGCGACTCAAGTGCCGTTGTCTACGTTTGCGAATGTGTGTGGTTGTGTGCGTGGCA 
GGACTTCCTGTTGCCACATGGACGGAAACGGCGCCATAATTATGCCATAAACCCAACCATAAAGGACATTTTTATAGATGTGCCCGAGTTAATCCCTTTGTTGTTCCACTCG 
CTACCGCACCCATCTCTCATGTCCTTTTTATCCGCACATCCTTTCAGCTTACTCCACCGCCGCCCTGCTGGCCATCGTGTTCGCCATTCCCTTCGGTATCCGCATGCTGGTC 
CACAAGGATCGAGGACAGTGGGAGGAGTTCGGCCCAGCCTTCTACACCGCCCACCTGGAGCTTTATCTGGGCAACGGCTGTTTGGGTGAGTCTAGGATTTTGGATTTGCAAC 
AGGAGGAGCTTTCAAGGGTCGAAGCTCTTCTAAAAGGTTCAAAAGTTGGACTCAGCTTTAAGTTCACTCGGATATACTAAAAATATATATAAAACAACACGTTATGAAACTG 
TTTATGTACCATAAATGGAGCGTAATGTGCTTGATTTAAAGGAACAGATGTCAACAGTGCTTAAATTGTAATAAAAAGCTATTTGTTTTAGGAGCTAGCCAGACATTATTAG 
TATGGACTGATAAGCATATGTTAGAAGATAATATTTCATACATTTAGATACGCCTATAATTTCATAATAATAAATCATAAAATTTATTCCGTTGGGATCATTTAATCCTTGA 
ACAGATATCTGTTATTTGATAATATTTACAAGGAATATTATCCCATTTGCACTCCTAGGCGTTGGCGTAATGATGTTACTAGTGCTGACCATTGAACGTTACGTGTCCGTTT 
GCCATCCAGGATTTGCTCGACCAGTGATGGGACCACCTGGGTAAGTCGCATTGAAAGTCTGTAGATGGTCTATTTAATTGATCCAACCTAGCGTTGTGGTATTCCTCACCTG 
CTTGGCCACGGTAATCGTCTATCTACCGAGCATCTTTCGTGGCGAACTGATCAAATGCATCCTTGGATCGAGTGACGTATATGTATATTTGCGACGCGACAACACTATCTAC 
CAGCAGACCATCTTCTATCGCGTCTACAAGATCATGCTGGAGGTGATCTTTAAACTGGTGCCCACTTTGGTGATCGGTGGCCTCAACATGCGCATCATGATGGTCTACAGGC 
GGACCTGCGAGCGCCGCCGCAAGATGGTGCTCAGTCGTCCGCATGCCCAGGGACATGGTCACGGACATGGGCACGGACATGGTCATGGACATGGACATGCACATGGACACGG 
CTATTTGAAGGACGACGATCCGCGAAAGTTCGCCGAGGAGCGGCGCTTGTTCCTGCTGCTCGGCAGCACATCGATTTTGTTCCTGGTCTGCGTCTCGCCGATGGCCATTCTC 



Start ATG: 1 



FIGURE SHEET 20 



WO 01/070980 



PCT/US01/09341 



21/89 

CACATGACGATCGCCTCGGAGGTCTATCCGAGCTTCCCCTTCCAGGTGTTCCGTGCCAGCGCCAATCTGCTGGAGTTGATTAACTAGTCGCTGACCTTCTACATCTACTGCC 

TGTTCAGCGAGGATTTCCGCAACACCCTGGTGAGGACGATCAAGTGGCCCTGGTTGAAGGGCAAGTTCTGCCACCAGGCCGAGCACGAGGTGAGTGCCAGTCCACCGGCCAC 

AGCCGGAACCGTAGCGGTAGCCGGAACCGGAAATGGCCACGTTTCTATTTTTCATCCGGCAATACCGGCGCTGACTCTCACACCCGCAGAACCCGACGAACGGCCCCGGTGT 

GCCAATGGCGTGCTTCACTAAGGTGGACAGGGGCCATCAAAAGCACCACATAACTACCACCTCGGGCCTGGGACGATCGAGTAGCATTTAGACAATGCCTACCACCCAACAA 

CAACACCACCACAACCCATCCGCTACTCGAGATCCGAACTCCGAACACCGAGAACCCTCCCAGCCCAGAGTAACCCCACCACCACCACCCAACTCCAAAACCGTCATCACCA 

CITCCACTTACCCCCCAGGTCGTCGAGGTTAAAGGGTGCCCTTTCTCTCGCTTGAGATCCTGCGACATTTGAACGTGTTCAAGTATCGATGAACGAAAAATGATATACGATA 

AACAACTAACGATTAACGATAATTGATATTCAGCGCCACGCATTGCGTATATAAGTGTCTGTGTGTGTTCCTTAAACATATATATTTAAATATATGGATAAACATTTTTTTT 

TTTTTAGCACTAGAATTAGGTACTTAAAAACCAAACTTAATCGAAAACACATCCGTAATTGGTGAAATAATTGATATCGATAGGTAAAACCAGTTCGACTTTCGATCAGCAT 

TTGCGCCGAATGATTTCACCACAATGTAGCTTGCGGAATGTGTATGTCGCAGGATCAAGATCCTCAAGCCACACGTATAGATGCAGATATGATAGTTTTATCAATGGATAGA 

TTGCTTGCCGTATTGATAACCAAAGAGCTCGTTTACTCTAGCATTAGATGCAGACAGAAACAGGAACAGCGACAGGACAACGACAGACAGACATATTGACATATAGACAGGT 

ATAGACAGGTGAGATGGAACAATAGCGAGAGAGAGAGAGAAAGAGTTTTACGATGATAATATGCAGACATACAATACTTATATATACACTTGCATATGCCTATATGTGAATG 

ATATTTATGTAAACGACGTATTTATGTATACACGCGTAACTTTAAGCGAATATATTTATATATGCATATATGTATCTATTGATTATAGCATTTAAGTGCGCGCAAGAGGAAT 

CCGGATTCCAACC 

(SEQ ID NO: 40) 

Exon: 1001.. 1088 
Exon: 1280.. 1429 
Exon: 1851.. 1944 
Exon: 1996.. 2821 
Start ATG : 1001 

Transcript No. : CT18637 

ATGGGAGGCAGAGCAGAGGAAGTGAAGAAAAGGGAACCGGAAAGGAAAGCCATTCATTTGGCCCCGCGACTCAAGTGCCGTTGTCTACCTTACTCCACCGCCGCCCTGCTGG 
CCATCGTGTTCGCCATTCCCTTCGGTATCCGCATGCTGGTCCACAAGGATCGAGGACAGTGGGAGGAGTTCGGCCCAGCCTTCTACACCGCCCACCTGGAGCTTTATCTGGG 
CAACGGCTGTTTGGGCGTTGGCGTAATGATGTTACTAGTGCTGACCATTGAACGTTACGTGTCCGTTTGCCATCCAGGATTTGCTCGACCAGTGATGGGACCACCTGGCGTT 
GTGGTATTCCTCACCTGCTTGGCCACGGTAATCGTCTATCTACCGAGCATCTTTCGTGGCGAACTGATCAAATGCATCCTTGGATCGAGTGACGTATATGTATATTTGCGAC 
GCGACAACACTATCTACCAGCAGACCATCTTCTATCGCGTCTACAAGATCATGCTGGAGGTGATCTTTAAACTGGTGCCCACTTTGGTGATCGGTGGCCTCAACATGCGCAT 
CATGATGGTCTACAGGCGGACCTGCGAGCGCCGCCGCAAGATGGTGCTCAGTCGTCCGCATGCCCAGGGACATGGTCACGGACATGGGCACGGACATGGTCATGGACATGGA 
CATGCACATGGACACGGCTATTTGAAGGACGACGATCCGCGAAAGTTCGCCGAGGAGCGGCGCTTGTTCCTGCTGCTCGGCAGCACATCGATTTTGTTCCTGGTCTGCGTCT 
CGCCGATGGCCATTCTCCACATGACGATCGCCTCGGAGGTCTATCCGAGCTTCCCCTTCCAGGTGTTCCGTGCCAGCGCCAATCTGCTGGAGTTGATTAACTACTCGCTGAC 
CTTCTACATCTACTGCCTGTTCAGCGAGGATTTCCGCAACACCCTGGTGAGGACGATCAAGTGGCCCTGGTTGAAGGGCAAGTTCTGCCACCAGGCCGAGCACGAGGTGAGT 
GCCAGTCCACCGGCCACAGCCGGAACCGTAGCGGTAGCCGGAACCGGAAATGGCCACGTTTCTATTTTTCATCCGGCAATACCGGCGCTGACTCTCACACCCGCAGAACCCG 
ACGAACGGCCCCGGTGTGCCAATGGCGTGCTTCACTAA 
(SEQ ID NO: 41) 

Start ATG: 1 

MGGRAEEVKKREPERKAIHLAPRLKCRCLPYSTAALLAIVFAIPFGIRMLVHKDRGQWEEFGPAFYTAHLELYLGNGCLGVGVMMLLVLTIERYVSVCHPGFARPVMGPPGV 

VVFLTCLATVIVYLPSIFRGELIKCILGSSDVYVYLRRDNTIYQQTIFYRVYKIMLEVIFKLVPTLVIGGLNMRIMMVYRRTCERRRKMVLSRPHAQGHGHGHGHGHGHGHG 

HAHGHGYLKDDDPRKFAEERRLFLLLGSTSILFLVCVSPMAILHMTIASEVYPSFPFQVFRASANLLELINYSLTFYIYCLFSEDFRNTLVRTIKWPWLKGKFCHQAEHEVS 

ASPPATAGTVAVAGTGNGHVSIFHPAIPALTLTPAEPDERPRCANGVLH* 

(SEQ ID NO: 42) 

Name: G-protein coupled receptor-like 
Classification: Gjprotein_linked_receptor 



Celera Sequence No. : 142000013384543 

aataatccaccttcgaaggtaaaccatcttaattatgctttaaaattgctggattatataaacccctaaggtagttcatcccacttgttcaactgaaatcgacccacacaaa 
tggtggttggtatctatctaccagAgttaggagctcgtttaacccaatattgcctattcgtagagatccgcattcctgcgaggagaagtcgaactgccgtagctgagctgcg 
agtcggtcttcttcagaatggaaacgtggtgggtgggtgcatcacccgtccggttggccatcagcgagtcctgtttgtggagcaggccgcgtcgctggtcggcgaaggagat 
gaccgagtcctgcttcaccagttgatggcgcctttgctctggtcgctggtccacaaacgagatcaccgagtcctgctttacgaacttctgccgcttatcgagcagagagctc 
ggtgtggcgccctgtcgccggcccacatccatggaaacgctatccctgtccaccagtgtgtcgttgagaccgtgtcctgccgacctgcgtgcctccatgtaggacacaattg 
agtcctgcttggacagggagaccttggagccattgctgtcctgctggcagaggggtcgccggatcccagaagcatcactgcaggagctctcctgcttgatgaggaactgacg 
cgatccgtgagagtcctgcttaagcagcatctgctgcgacaggttgctgtcctgcttgagcatcatgttgtggcgccgcagcgagttgtgctcgtagatgtaggacaccgcg 
ttgcccttgctgctcggactcttcctcgagctgttctggtggtagtgcgatgacttccgggtgctggtggtgctggaggcgatggccacgttgtcgtagtacctaggtggtt 
aaggtggaaaaagggtttttatcgaagggaacacgatttcttacaaatcttcatatcaattttgaaggccttcacatattttatcaatatatttgaggttagaaattaacta 
acctgaggcgtccgcagcagctgcgcgacttgatgatggccgcgaatccgcgtcggtacttcttgttcactgagtagagtatggggttgatgcagctgttcgaggagcccag 
ccactgggccaccggcatcaccttcttgaggatctcgaactcctcctgcgagatgtccgagccgaatttgatccgcgcaaagatgacatagagcggcagccaggacaaaaca 
aacaggatgaccacggccaccaacatcttgatgaccttcaccttgctcttctgctgcatgcggtccatttgcgcgtccttggactcgccggggatggacctcgtggagacct 
tgatccagatgagcacgtagcagagcgtgatcagggacatgggcagcaggtagcaggccaccaggttggctagcaggaagtagaggttcccatccgtgcccgggggccacac 
ctcctggcagaggaactgcggctgcgagtaggccgagaccagggcgtcggagaagacctcctcggcgggcaccaggtcgaagaagagcagccaaggaatggtggtcaccagc 
gcgattacccatatgccgattatcatgatgcgggcacgtcgctttgtcatctgcttcaggggccaccagatggcaatgaacctgcaagaggggagtcccaacaaaggggatt 
tttaccaagtagtcaaagaaggagtcttcgtaggagcagcagcagcagcaggagctccggcttcagatgccactgacggctttgagccgtcaaggataatggcaatcatcaa 
gctgtcgagtcgagtcgaggcgacgatgacgatgacgacaccgtttcaactaggaggagcagctcggcatgcaaatatctccgctgattaattcgcacacgcaggatgccga 
tgccagcagccatccttaagcctttttctgctgagtgttcgctttccaatctcgactattgaacccgcctgctactcctgctcctgctcctccgccccaaaaaagcggcgct 
gcggatcctgtcattttttattctgcttgccataatgaattgtaatttgaataagagctgcgaggtagtgagacgagtcggtatggggccaacaacgagttcagcatttttc 
atttagcttagtgcagccggctcattgtggcgcccgctcagtggtcatagacaagacttgtcccggacgaacgaaactggttgtctgctcctggcccgaaaccatgtaatta 
cgtgcattgtgttcgtattttcgacttggaccttgggttgtcgcctgcgtctgtccgccaaaacatttggcagataacacagcggtaattccattagagggcggccatgcca 
ggccactaccccagaatcctattcctcaccgactaaaaatagccagcttaacaatgtggccgagaccctcatctctgcgaaaaaaaaaaggtgggccgcctgggcgaccaaa 
caaagctgtccgcaaagccaataatcaaattgtcataacgtttatttataggagcaaaccaattgcatttgcataatttgcccatgcgacttctatggtcataaatcaggcg 
agagcttaattgagttctttttgggtaaaaacagcggagtcggaaaaagtaatcagcagttttcgctggtgcagactaattcaaactaaacctctaatttgatgttgaaatt 
ttgttaaatcgtattaactacacttttgatgatgtaaaccattttatgagctttaaaaactaaaatttctcttatgagttctcttatacgatttatgtatgatttgtttcac 
aaataaactctccttatagattcaattgctgtcggaaaattaaattattaatattattttggtaatcacgggtaattgttgcccatcaattccgttgagtaccatatgcgta 
aatctctaaggacgtcagttcagtattgttagcatatttattaagaaaaatcttgccacaatcaacttaaacaaagtagttaatactcgcgcttttaaagcatttaataagc 
cgcaagtgtattaatttaatatttgccacacaatcctgtgtttctgtctagatccacgtggttttatggctcctatttcgcttcctactgcggttacccatacgcactgttg 
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ACATTTGCGRCGCAAAGAAGACACAAAAGCCGCAATGCCAGCACCGTAAAGACACAACTATGTCTATAflGCGAGTGTGCTTTCTTATAGACTTTATGATTATMGCAflTCTA 
GTTATATACATATACGTATTCAGACAAGAAAAGACCCTAATTAAATGACAGTGGCATAAGTATAAATATAAGTAGAAGTAGAAGTATAAGCAAGTGATAAGCTAAGTCGAAG 
GGGCATTACGGGCTCTAAGACCGTGGAGAGATTTAAGTTTCATGTTACAGATAATAAATCTTAAATTTCGATTCAGTTTACGATGGCTCGTAAaAGGAGAGAATAACAGCAA 

aacaactggataaacggacatcaatgacgctggactcggacttcgactgggacttggaagtggaagtggaattggaagtggatgaggatgtggatgtggatttggactcggt 
tgggttggcgttaagcgtgggcggaggctgacatgcgggcctgtgggtgaccaaacgatcaagtgaggcgctatgcctttgtcccgctggctgaaaaatgacagtggcaaaa 
acagaagcttctggccaaaactcctcctctgattgtggccagaaattgatataaatgaaagtgcacataaagcgacgggcgtcttattgtccttgcggctatggctagcaca 
atttgagggacttcagcatcacttgataaatctgttgaccgcgacatttgccgaatttattgcttttttatactttatgggccaggaaacgcagggctgacacatcgattcc 
tggcggccaatcatcaatggtatcaatattcgcggtatcgcggccaacaacattctaaggcaattgaaatgcacataaaatgagaaacatgaaattgattgtaagcagcgcc 
taatrcggacaaggtgagccatagaagcatatgatgatcttgatcttgagacgactgctccactgactctcctacataactgcctaactggctaactgccttactgaccagt 
tttccccgcagctgattgcattgagtggcgctcactttcggttcggtcgcctggtgcaaatccactataaaagccgcactcggagctcaagttgtcaccagttgacaatttg 
aaatcattagcctagctacctttcctacttcagttatcccgccaaaatccaacgcctcaaaatgttcaaagtagtgagtatcattgtgcagaagaagttgtaaccagtttga 
atccaagcatcgtatcattatgatattatatcagtttgctctatagcatttaaaaataaatgcafltaaattaagtttatattacaactcatatgagtcacctagtttgttgt 
taggattggtcttatcttactcagaatgagtaatatatgctttaaagtctgcgacaactaaccaaattcactgtggagtggcagccacctgctgctccgtctaatgccacta 
atcaggctaattcgtgaactctgatccgccgcagctgttcgtgctcgccgccttcgtcgcctctcaggctatcgcccatcccggcgtggtggccgtggcacccgtggtggcg 
cacccggcggtggtccacacgcccatcatccatcatggagcccactcggtgcactcgtmggccattcatgtccaatctgcttatcaccaccgaacaactcacccctttcat 
tttcagccatgttgttcatcatccggcagccgtcaaggtcatcacacccgtcgtccacaagcccgtggtggcggtgcatgctgtcaggccggtggttccattggttccagtg 
catcatgccgcccccgccgtcgtcgtgcatcattaagtgcagcgaatacggttcactttcaatacctttcccgctcccgatcccgttcccgatccccaccagacccagacca 
atcccttcctgagcaccataaccacgatcccacaataaaagttcatagccacgtaattggagcgagtgttttcaacttacctgtccagggaaacggcaatcaggctgtagac 
cgaggctgccacggagacaccttgtatgtagggcacaaacttgcacatcagccatccgagcatccaggctgcgagttccagcagttcgcgaaacatgaaagggggaaaagtg 
cgggaggtgagtaaatattggtcataaacaacgcacaaattatgctgagtaactttcggtcacattgtccttgctggtgtcctagccatacactgaaggaaatgtgagggcc 
tatggcaacaaaaaaagaagaagatgcagatttttctaaaataccccaaggacactttgctcgaaaactttataaatatcctaaccacgaaaatcaaagcaaattaatgtgg 
ttaaccgaccgaaggatgcttattgttctcatgggaaatattcgaaatagtagatggtgtatcactttaaatacaaaccgtaagtcatcttgactgcttagctaagaaaaga 
gtgaaaatattataaaactaataattctccgctaagtattataatatatttcatatcttacttttattcttgaataagtacagttttacagctggggatcattgagttcttt 
atagctttactgcttttccccgagtgcactggtagtggttatgctcatggttgtggtgatggtggtggtggtgatagtggtagtcatggacagttgggttcgagatctggtt 
agcgttcaatgcttcttgtcattgccgccgcccacgttcccgtcgttgtttgctgactgagatttatgtgctgcatatcattcataatttggtgacaagtctcagattgcgc 
ccagccatcgtagtgggtggggcagaatgtggagtcctggggggcagagcattgtttgcccagacccagactcaaagccgcggtggaccgcaggactgcctaacttaatttg 

TATTCCTTTTACTCCATTTTGTGGCTGGATGATGATGGTAAATGGAATACACGGGCCCTACTTGCGCATAGTTGAGCGTTATGAATTTTATGCTAAGAGGCATAACTCACCC 

agacacaaaaggttataaagcatacatacatgggcgtgcaatgggaggggcaacttcccagggttgatgacagtcgggcggagaaaggcgcccggttcatgacatatcgaat 
ggcaattaaacaagtgcaagtgcggcccacacatcattagcagttagctgtttggcagtttggcaatttgcagaaggagtttaccgagttcaatgctcgaccaggagacgcc 
tcctgcgtcatggggctcctaatttcctgtctcgttcgcctgtcttatctctatttgttgtcgctttttattatgcagacgcaccgtaagccgccaaggcagcccaataaaa 
gctarctttaatatgatttggcacataaaaatataaatattattggcgctgtggcgcttcgtgcaagtgaactctgtggctgtggacaggttttggttctggacaggatcat 
gtgtgtctgctgggaggacatacacatcctcccgcctggcaaacaatcaacagtgtgtacacaggacagcagaccgaattggtttttcggttttattcctccgatttgaaat 
ggctttcgggcagacctgcaagtgcaacagactggatccgaggctattaatgcgagtgtatttggccactcacttgatattatgctgataagggtgaactttgatatatagc 
tgtctgcgggagctcttgactcggcaaactttatgctttcaagttgtttgaaaacaatttaaattcaaattcggttccgaaatgcttcatcacttgcgggacaccggattag 
cagtgccactgggtgaacttttttttttttttaaatacatgttcttccaacttcccttcggcaaataaactatctaattgtgtatgcaattctattaggacaggcaaacgtg 
actcggttggtttgtctgggattaaagtttgcatcagtgcactttggcagtcaagtgtcttaaccaagtttacattgtggttagcttagcgcagtcaagtgaccacttgata 
atcccgaacaaacaatggttttcagttcatatggaaagtattttgacaggctgaggcattcagcacggttcaacttattaaattgtccaactcgctgatttggatatttaat 
ataaaaatggaaaaaccacattttacagaaccaacgagaaaagctgacatttaaaattggtttttgt'raacataaaacgattgagaaggaataatacataattaatgaaccg 
caaaaaccgcattatatataaaaaaaatcaaataggtcaagggttatgccagtagttatttatagataccacaagcttaggtaagacatgattatttcaaggaacaccgagt 
cattcgaaaccaatctttaattaaccccaaccttaaggccacattaatgtgttgtacaaacataattgtagctaccgaaataacctaattgccttgcgtgacggggctaacc 
cgaacctccttgtggcattacccatttgaggactcaagtgaaaccaaacactccacagtttgctcggccaacatccagcctccccgtcaaggtgagaatgggatgggtgttt 
gatccgacttggagtgcgtttgacatggcaacatatggcaaacagacgcctccatttatacgcgaaagctggtgctcaatattgatcaactagaggagataataccaatcga 
tactcaggcttatcgccagccttcgattgtgggtggtgggtggtgggcttgtgcgccccatcagcccttaagaaaatgtcaagtgcgccctaagccttttattttttagcct 
cttaccctccttacaacgagagcctttgtttttcggtttcggtatatatgacttttagaaatgtgattcaactgaaaatatttaccttatttgataatttgcctgcagaatt 
aaatttaacgcggttatttatgaataatgtccccttatttggaaacaaatttttctcaaggacaactttaaaacgcgttatcttttacagcaacaataaacattttctctac 
tgtgctttatgaggctttaatagcgccaacaaagttttgttacatttttatccccaataccgtgggtgattgttcctatgtaggcaaagtggattggattattccgggcatt 
gcggattcgtttcggtggcggcattgtcatgcacggagtttgatcgtaaagttaatgaaataatcaggcgtgacaaatccgcaggtttggacttgggagtgggaacgggcgt 
ggattgcagtggagtggagtggagtggagtggccatggaaaaggcataacataacacttctggcaaatgcgttattgttatgattatcgccgcggcgtcacgtcctgcggcg 
acttctgtggcagctcttcgggaaaattgaataccacagctagttcgcagagaaatgcgcctgggacgcggagtgggaataggaattggaatctgggaatcggtggtcctcg 
atgtgagatacgggattgcgggagtaaagaagctgcagatcttaggtggagggctcagaggttggatgccatttgccgacggcacgtagagactaaattaaaatacaattgg 
cgagctgaaaaatgacttcatcagggatagggaaacagaaatgcgggagagtctgcctccaagttacgcatacgccgcgttcgcccggcacaaatcacgtggccccctaatg 
aacaaactggcaacgtgactgaaagatttagccaacgggagatgctgatctaaaaagggccgtaatgataacgattccccattgataaaatagcacaacaatggccataaag 
cgccatctgagacaattgccgttgctgttgcaaggatatccttcgtctcaaggcggcatcgttgctgggaaatcaattcatttcgcattttattggatttcaaagcacacac 

ATAAATAAGGGCCAGCGCCACATTCCGACCAACAACCACCACCCACCACCCACCACCCACAGGCCACCAGCCAGGAACCACCCACAAAGAGTACACGGCCCAACCCAATACA 

ggtgctcccacaccccacgcatacacacacacacacaccctcaaccacacacactggcacactcgcaagtgtgtgtgtgcataaccctgactattaggtttgccaagcgaaa 
tatttaggtcaattcatccagccgcaaagagccttgaatgcaaacaaccttccattgcaattctgaggcttgaaggtgtgcttagataatggcgctgccagcggcgatggcc 
gtgggcgtcgctgccagctgcgactgcggcggcgacgtcgtctttgacggctggaatttgatttcatataattcattttggtggcaaaacaacccgcccactttgccactcg 
ctcggggcgagtagtgggaactatcagacagcgatgtgaaatgctcgatgtacaaatatgctaaacttccgaactcccacgcactgtgctacacagggagaaaaggttttgt 
tagaggagagcaaaggattaagtggtatgactttacactgctaatttgattatgaagaatattcgtaagagttatagcatttcatcacctgacgaagttcaaaaacatttaa 
gtttaaagccaaaaagtgtatatttctgaaaacgtgtggattgtttaatgaaaatgacgtgttaaacttttgtttgtaaataatacgctacacatgtgtttagttggcaaca 
catgttttttcccctgcattagtgctatttcaccgttgccgctgtgaaaccgttgccttcgttttaacttcagcaggttgtcgaaagtgaataggaatgtaagtggggttga 
gagttgtggggatgaagaggtgaagctgaaatgagccaccccaaaagccccaatgccagcgggagggctgcaatttgttgtgactggcgtttttgctaggcatcggcggctc 
agctgaagctaagacatgccaagcgaaaattgcggagtgggcggccaaaagggtgccccatccaaacaaccatacaaccatctagcccacccaatctaccatccaccgccag 
ccacccaccacccccagtgacccagtaagagctgtcggccctgcgcccagcttgaatttgtgaaattaggtcagttttgtgtcgatgatcctttgggccaccccgaaaagca 
gccaccaatgcacctacacatgcacagtagcgagcaaacacatgggcgtgagagcgggcgagagagagagagcggagagagtgcgcagtgacacggacacaaaaacggagac 
acggacaccgatacagatacacaatgcaggcggaggtaaactcattcacttttcgccactcgcttcattcgtttccagcgattcgcactcggcagccttcgtttactacatt 
tcccccacagtgggcttctctgggggagtgtattggtatggggtgggatcgagtgcagtgccgagtctaagagtagagctgcattggaaaataaatctgataagtcaactgc 
atctgcatatttgttaccagtttgtaatattaatattcaggttaataaataatctgaggggaagaggaactaaaactatttttattattattattattaatttaatttaatt 
taatattattattattaattattttaaataattttaattattttcatatacttgtggcaaagaaatgtagtaacacaatttgttaaactatttcctattaatcgaacatatt 
gttttacgagtatgttgacaactcttattttgccagcgtactcgctgtatagcaacaagaaaaggagagttctcacagggctgagccacttaattggcaacacttggcacaA 
cctgcgagcctacgacctggcattgggatttgggactagggacctggcactgggattggggtagccatcgaaggcaaattgcagtgaaggatggagcttgcatggcgtgctc 
cacaatcggaatttgatttaccaaggatttcgatcatttttccatgtcagcggcgatttgcatagacatttgccaacctgctcaccttgtcgtccaggtctgcgactatcaa 
tttggatgtcagtccacaaagttcgtttcgaatgcaagcggggaagttgcggaggggaaaccaaccaattgggggtgtaagacggcagacagggacagatagatagatagat 
agatggacggatggataggcatgtcagggaaatttatttaaattaaagcatacaattatgcgtagcctcgtttatcgggtcgtttgtgggtcgtctctgggatttggtggaa 
ttggcctctaattaaatggacacttgttctttggtccatgcggattgtcgtttccaagaaggtaagggctcccatggagagagcacaaattaatttaattgagtgcgaacat 
ccgttgctacggaaatctgacacatttctggtaaagtacgctgaatgaattcttctttttttctttgctgcgtactgtgtgagatgtagttccttgatttaatgaataactg 
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GCTTTTGTCTTGGCAGTCTATTTTAAAAGTTflTCTTGACCATTGCAGflAAGCTftTTflAATTTTGATflCTAAAGCTTGGCATTTTCCAGAflCAACTTTAATGATGCTGCATCG 

TTATGTTCTTCTTGTGTATGTAGTAAAAATTCGGAAGAAGCCAGGGAGTCTGGTATATAAAATGCAGCTTACAAATAAACGGAAACACAGTATGTCTGTCAAAAGTAAATAT 

AACCGAGTATGAATAAAATCCCATAATTTATTATTTTCCACCAGAAATATGAGCACTTTCGTTTGGACATTTATGCCGACATTATGTCTAATGAGGGTCAATTACACTTGCC 

CAAACATATCGGGCACATTACGCATACGCAGCGTAAGCACGCAGGAGCAACAACTTTCGTCAGTGCCATTTGGCAGGGCTTTATCTAAAGCGGCGATTAATTGCAGTACATA 

AATCGGGCTAATATGCAGAACAAACAAAAAAAAACTTCGGCTTCCCCCTAGCAGCTAACGTGGCCCCCAGTGGCGAATAAATCTCGTCGGCATCTTGCGGAATTAAAGAGAT 

ACTTACGCACAAAGATGTTGCCAATCAGTGTGGCTGGCAGGCAGAAGACAATGACCAGGATGTCCGCAATGGCCTTTTCGTTGGAGAGGGGGGAATGTAAATTTTATTTCAA 

ATTAATGGGGTTCATCCACGAGATTTACGGGGTCCACCCACCAGATTCACAATGAAGTAGTTCGTCACCGTCCGCATGCGAGGAGCCCGCAGGACGACGGCGATCACGAAGC 

TATTCCCGATGAGTCCCACCAGGAAGACGACGATGTAGGCCACGCAGTAGACCATGGACATGGCCAGCGAGTGACGGTAGAGTGGGTCGAATCCGAAGTCCACGCTCCCGTC 

CAGCTGCTCCCACCGGAGGGCACTGTCGTTGTACGACTTGGTGGCCGCAGCCGCCGAGATCAAGCTGGTGGTCGCCGGCGATAGGTTCAGCAAGTTGGAATAGTAGGTGGAT 

GAAGTGGCGTCTGCGATGGCCAGCGACTTGCCCGCCAGGGATCCTCTCGCAGATGCTGTTGTGGTGGCCGCCAAGGTGGCTAGCAGCGAACTGGCCGCCGGAGCCACCGTGG 

CCGTGGCCGTAGTGAGAACCGTGCCCAGCAGAGTGTCCGTAATGGCGCTGTTATTCAGTTTAATGGCACTCGATGAGCCCAAGGAGCTGACAGCAGTCCCACCGCTACCACT 

AGTGCCATTCAATTCGTTGAAACTCAAGAGGTTCTGCAAATCGACGAAATTATGCGTGAAATTATTGTAATCCCCGGCGAGGGCGGCGGCTTCCTCCATTGTCTCCGCGACC 

ATTTTCCCCGGCGCCATGGTCGGAATGGGCATGGGTACACTAGTCGTCGTACTGGGAACGTCGCCACTTGTATGGCTTTTATGCTTTCGCTTGCGAATCCGCCCGCTGGCTG 

CCATCATCAAATCATCCGGCGTCTTGATTGGGCGGATGGCACTTTCACTGGCGCCTCGCTTTCGCTATCACTATCTGTAGACGTATCTCGAATCTTGAATCTGGTATCTCGC 

TTGTTTTAGAAAAGGTCTATAAAAATGGCATCGTCACTTGGCATTTCGTTGGCTTGTGTAGGATATGCAGGAAGTTTCGTGGCTTTGGGTTAGAGAACAGTTTTTAATTAAC 

ACCCATGTCCATACAATATCTCGTTTATCCGGGGATTGTTTTGCGTTTGCCCCGAAGTTGCACAATAAATGCAGTTTTCTAGAAACTCGCTTCTAGCGCCCAAGGACCCTTC 

CGTTTTCGTTTTTTTTTTTTTTGATTGAAATCACCTACAAAAGTTTGTTGCCGTTTCGGCTGCGAATCCGAGTGCGGAATGCAATGCATTGCCTGGGTCGTTCGTCAGCTGG 

TTGACGTTGCCTTGGCTGGCATTACGAAAAAGTCGGCGGCCCAAGGCCTCGTGTGCATATTTCAGATAAGGCCAGATGCTGGGTGCCTGGATAAATGCAGCTAGAGGGGATT 

GGGGATCGGGCATTGGGCATTGGGCATTGGGCATTGGGCATCGGACATCGGGCAAAGGAGCCGAGAGCATCGGCTCAGGGACATGGTCACCGCCACAGTTGCAACACATGCG 

CGTCGCTAATTAATTTTAATGCCCGTTCTGCCATTGTTTGGCTGTTGTTTTTGGCACAATTGAAAAGCCAGCGAGCGTGCACTTTGACCTCAAAAAATCGACACCAAACATC 

GAACAAACGAGCAATTGAGCAACTCAATAGCAGCGCATAAAAATTCCATTAAGCACAACTGCCAAGCGCACGCAGCACTCTTCGATTGAGTGCCAAAAAATTGCCAGTAAGC 

ATTGAAACAATTGGTAACAAACGAAGCCAGTGCGACATTCTTTTGTGCGGTTCCATGGGAGTGTCCACATCAGACAAACTGGCGACTAAGGGCCCCTCGGCACACAATTCA 

(SEQ ID NO: 43) 

Exon: 13111.. 12362 
Exon: 12281.. 12156 
Exon: 164 9.. 1001 

Start ATG : 13111 (Reverse strand: CAT) 
Transcript No. : CT18916 

ATGATGGCAGCCAGCGGGCGGATTCGCAAGCGAAAGCATAAAAGCCATACAAGTGGCGACGTTCCCAGTACGACGACTAGTGTACCCATGCCCATTCCGACCATGGCGCCGG 
GGAAAATGGTCGCGGAGACAATGGAGGAAGCCGCCGCCCTCGCCGGGGATTACAATAATTTCACGCATAATTTCGTCGATTTGCAGAACCTCTTGAGTTTCAACGAATTGAA 
TGGCACTAGTGGTAGCGGTGGGACTGCTGTCAGCTCCTTGGGCTCATCGAGTGCCATTAAACTGAATAACAGCGCCATTACGGACACTCTGCTGGGCACGGTTCTCACTACG 
GCCACGGCCACGGTGGCTCCGGCGGCCAGTTCGCTGCTAGCCACCTTGGCGGCCACCACAACAGCATCTGCGAGAGGATCCCTGGCGGGCAAGTCGCTGGCCATCGCAGACG 
CCACTTCATCCACCTACTATTCCAACTTGCTGAACCTATCGCCGGCGACCACCAGCTTGATCTCGGCGGCTGCGGCCACCAAGTCGTACAACGACAGTGCCCTCCGGTGGGA 
GCAGCTGGACGGGAGCGTGGACTTCGGATTCGACCCACTCTACCGTCACTCGCTGGCCATGTCCATGGTCTACTGCGTGGCCTACATCGTCGTCTTCCTGGTGGGACTCATC 
GGGAATAGCTTCGTGATCGCCGTCGTCCTGCGGGCTCCTCGCATGCGGACGGTGACGAACTACTTCATTGTGAATCTGGCCATTGCGGACATCCTGGTCATTGTCTTCTGCC 
TGCCAGCCACACTGATTGGCAACATCTTTGTGCGTAAGTATCTCTTTAATTCCGCAAGATGCCGACGAGATTTATTCGCCACTGGGGGCCACGTTCATTGCCATCTGGTGGC 
CCCTGAAGCAGATGACAAAGCGACGTGCCCGCATCATGATAATCGGCATATGGGTAATCGCGCTGGTGACCACCATTCCTTGGCTGCTCTTCTTCGACCTGGTGCCCGCCGA 
GGAGGTCTTCTCCGACGCCCTGGTCTCGGCCTACTCGCAGCCGCAGTTCCTCTGCCAGGAGGTGTGGCCCCCGGGCACGGATGGGAACCTCTACTTCCTGCTAGCCAACCTG 
GTGGCCTGCTACCTGCTGCCCATGTCCCTGATCACGCTCTGCTACGTGCTCATCTGGATCAAGGTCTCCACGAGGTCCATCCCCGGCGAGTCCAAGGACGCGCAAATGGACC 
GCATGCAGCAGAAGAGCAAGGTGAAGGTCATCAAGATGTTGGTGGCCGTGGTCATCCTGTTTGTTTTGTCCTGGCTGCCGCTCTATGTCATCTTTGCGCGGATCAAATTCGG 
CTCGGACATCTCGCAGGAGGAGTTCGAGATCCTCAAGAAGGTGATGCCGGTGGCCCAGTGGCTGGGCTCCTCGAACAGCTGCATCAACCCCATACTCTACTCAGTGAACAAG 
AAGTACCGACGCGGATTCGCGGCCATCATCAAGTCGCGCAGCTGCTGCGGACGCCTCAGGTTAGTTAAT 
(SEQ ID NO: 44) 

Start ATG: 1 (Reverse strand: CAT) 

MMAASGRIRKRKHKSHTSGDVPSTTTSVPMPIPTMAPGKMVAETMEEAAALAGDYNNFTHNFVDLQNLLSFNELNGTSGSGGTAVSSLGSSSAIKLNNSAITDTLLGTVLTT 

ATATVAPAASSLLATLAATTTASARGSLAGKSLAIADATSSTYYSNLLNLSPATTSLISAAAATKSYNDSALRWEQLDGSVDFGFDPLYRHSLAMSMVYCVAYIVVFLVGLI 

GNSFVIAVVLRAPRMRTVTNYFIVNLAIADILVIVFCLPATLIGNIFVRKYLFNSARCRRDLFATGGHVHCHLVAPEADDKATCPHHDNRHMGNRAGDHHSLAALLRPGARR 

GGLLRRPGLGLLAAAVPLPGGVAPGHGWEPLLPASQPGGLLPAAHVPDHALLRAHLDQGLHEVHPRRVQGRANGPHAAEEQGEGHQDVGGRGHPVCFVLAAALCHLCADQIR 

LGHLAGGVRDPQEGDAGGPVAGLLEQLHQPHTLLSEQEVPTRIRGHHQVAQLLRTPQVS* 

(SEQ ID NO: 45) 

Name: Tachykinin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384554 

TGTTTGCAGAGCCCACAGCAACCAATATGTTTTCATTGTCTGCCCGGTTTGTCGCTCAATTTATTTGTGATGTTGACTTTCCAACAGTGACGTCAGTGGAGGCCACAGACAA 
TACCACATCATCGCAACAATTATAAATGGTGTGGTGGCGAAAAAACGTCCTTGTGCCCATTGTTCAGTGGGGCGAACTGAAAAGTGGATTTTCCAGCGCGTGCCAGGAAAAT 
GAATGAAAATGCGGGACATTCTTGAAGTCGTTGCCTTTATTCTGCTTTCACACATAATTATGAAATAAATATTAGCTTAAACCAAATTGACTACAACGTTATGCTCTATTAA 
CTCTTATGGTCTACTTATGCTTAATGGGTTTTGGTTTAAGATGCTTGGGCATGTTTATCCACCATTACGAGTGGCTATATACACGGATGAGTTGGAATTTAAATGGCCTTAA 
. TAGCTAGTTAGGACTGGAAAGAAATGCATTTTAAATTTCTGATAAATGGCCGCGATTCACATACAAGTAAAATCAGTCTGTTTAAGAGAATATAACTAAACCGAGCTACTAA 
TGGATTGAAACCTAGGTTCTGAGTGGTGGACAAAAAATCTTAATAGTTTTCGAAACACTTATTTCATTAGGATACCATATACTTAAATTAATTTCTCCAGTTTAAAGTTGAT 
TGCAAAAATGCCTAATTAAATGTTTCAGGTAAGTTTTTTGGAAATATAAAACCTTTGTACCAATATCAAATTATAATGGGTTCTTTAGTTAAAATTTAATAAAAATCAAAAT 
CTTGAATAGATTCAATTACCCATTTTGACACTTTAGATTTTTAATTTTAGAGTTAATTAAATGCCGTATAAATGTTATCTTAACTATAGATTGCGAAGTAGAAATAGGTTTG 
AAATTCGTAGCGTAGACTTGGCGTACTTACGACTAGAAACATACATTTGGCTTTACCCATGCCTGGTATAACTATCGAACTTATTGTCTAAGCCTCTGGTGCCTTCACAGGA 
TGTCCGTGTTCCCCTTTAGAGAAACTCCGCGTCGCTAAAGGACGTGGGACCATGAATCGAGAACGCGGATCTTTTCCGGCTGGTGTGGGCCAGCTATCATACCTCCGACATG 
GCTGGCACGGTAACCACCTTGGTGGGACGTTCGCAGATTACCACACGGGCGTTCGTCTTGTTGGTGGACCTTCGGGAAACCGTCAACGAAGTGGTCAGTGTCCTCATCGAAT 
CGCAACTGTTGTGAAGCCGACGACCAACCGTGTGGCAGCGGTTTTGCTGCGAGTTGTTGCGGTATGACTTGCAGCAGCATGTGAACCACTTAAAAGGCGGAAAGCGACTGGA 
AATATAATAAAAATAAAAATATCAAAGTAAGTAAATTAAAGCATAAGAAAATGTGTAAGATTTTTTAACATGAAATCTTAACATACCTTAATGTGCGAAAGACCTGCGATGA 
GAAGAGGCAATAGATTAGTGGATTCGCCGCCGAGTTCAGCGGTGCCAGACTTTGGATGAAGGTGGCGATGGCAATGTTGGTCTGTGAGTGTGGAATCTGGCCAAAGACCTGC 
AGCAGATCGAAGATGATATACGGCGACCAGCAGATGATGAACACAAACACGATGGTCAATGTCATCTTGACCGTTTTGACCTTTGCCCGTGGAATAATGCCCCTCGAGCTGG 
CCCTCCTGGCAGGTGCAGCTCCAAAACCAGCACGTTCTGTTAAAATATAATTAACAAATAAAATAAAATTATTTTTGGGAACTCCGATTTGGCAATATGCATAGCCAGAAGA 
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TATCTTACCCGTGGGTACAAAAATGGAACCCTTTGCCCAAATCGTCTTTACGATGATCGCATflGCAGGCAGflTATGATCAGCGCAGGAATGGCAAATAGAGTGGCCGftCRCC 
AGGCTCATGTACACCTGCCAGGCGATCGGTGAACCCAACTCAATCCAGCATTGCGGATGTCCTTGGATGAGCTTCTCCTCGTACAAAACCAGGATGGGAAGCGAAAACAACG 
CCGAGATGAGCCATGCGCCAGCCACCAGGTGACGGGCTCTTTTCCCTAAAAGCATAAAATTTAAATATATATTTAAGTGAAATATATTTCTAATTTAACTTAAATTTGTATA 
TTTTTTTTGGTTAATTCCCGAACTAAATTATCCAAGCAGTGTAAGCAGAAGCCACAAACAGCGCTCATCTTTGGCTAACAACATTTGGCAATTAGAAAACTTTTCATACGAC 
GACTTAACTTACACGACTTTGAGAAGTTCATGGGGTGTGTGATGGCATCGTATCTGTCGATGCTCATGGCCACCAGCACGTAGGTGGACGAGTATGTGACGCAGACCTGCGA 
GAAGCGGATGGCCTTGCAGGCCAGGTTGCCTGCCCGCCACGAAATCGTGATGCGCCATATGATGTCGGTGAGGACGTTGAGCAGTCCCACGCACAGATCTGGAAATAATGGA 
CTGGGCATTAGAAGCAAGGGAAGAGGTCAGTCCCAATCCCAAAACTCACCTGCCAATGCCAGCTGTTTAATGAAGTAGTTCATCCGCGACTTGCGATTCTTGTTGATGAACA 
TCACGAACAGAACAGCTGAATTGCCCAGAACGATGACGGTGAACAGGATCCAGAGCACAGCAAACTGTTCGGTCTGGAAGAGTGCAAAGAACAAAGTGTGGTCACATTTCAT 
TAAACGCCTATCAAGTTCATTGTCAACAAAGTGTTAAGCACGGAAATCACGCATAAAACTCAATTTAATGCCATCCAAGCACACACACACACACACGCCTTGTTTATGGCTC 
TTTAAATAGCCATAAAAATGATAGAGCTATGAATTCAAGCTGGTTGGTGCGGCGGGCAGGATGTGCCGGCGGAAGTTTGCTTCTTCCAGTTCAGTACTAACAAGACTAGGAC 
AACACGACATCCTGCACTCAAACACCCGTGGACGCTCACCCACACACCCAGACGTGATGGAAGACAGACAGCCACCGTTATTCCAGCAGGAAATTGCTGCCAAGATGAAGGA 
CCTGCACTTCGAAAAATATTTCTTTCGACTTTTAAGGCAGTTTCACAGCTCAAAATGCGTGGCTTGGTATTTAATTTATAGAACTTATACTCCAGGCATTCGAAATTCAATT 
CATAGCACTCAAATCTAAAATTGTAACAATTCTTATATAGAAAACTAGTAGATAAAGAATAATGAACCCTTATTACTTTTACTTTTTTAACTGAACTGTTATTATTTTAAAA 
ATATTTTAAACAGTGTGAAGTGCAGATATGACAGTGAGTAGTTCTGCTAGTTCTAGCTTATTGCCGAAGAAAATGAAATATTCTTTATACGCCCGCATGCAGTTTATATAGC 
AGAATATCGCATATATGCTAATATGACTAATGATCTCTTCATGGTATCTACAAAGTTTTTTGCTGCAAATTAAATTTGCTCTCAAGGCATCTCCCCCTATGCCCACGTCTAT 
TGAAGTTAAGTTTGAATTATGGCCTCGGGGTCTTGCAGTCTGACGCCAACACCGATGTTGATGCGGACTTACTTAAGGGCAGGCCAAGTAAACTTGTCCCGAAAAAAAAGCT 
GAAATCCTGACAGGCTTCGTCCTTGGAACTGCGCTCAGACTGAAGACGCCCCACCTTATTCTACAACTTTTTGTCCATACCAAACTTGCAGTATTCACTTTTTG 
(SEQ ID NO: 46) 

Exon: 2688 .. 2514 
Exon: 2450.. 2253 
Exon: 2061.. 1801 
Exon: 1716.. 1431 
Exon: 1339.. 1111 
Exon: 1042.. 1001 

Start ATG: 2688 (Reverse strand: CAT) 
Transcript No. : CT19191 

ATGAAATGTGACCACACTTTGTTCTTTGCACTCTTCCAGACCGAACAGTTTGCTGTGCTCTGGATCCTGTTCACCGTCATCGTTCTGGGCAATTCAGCTGTTCTGTTCGTGA 

TGTTCATCAACAAGAATCGCAAGTCGCGGATGAACTACTTCATTAAACAGCTGGCATTGGCAGATCTGTGCGTGGGACTGCTCAACGTCCTCACCGACATCATATGGCGCAT 

CACGATTTCGTGGCGGGCAGGCAACCTGGCCTGCAAGGCCATCCGCTTCTCGCAGGTCTGCGTCACATACTCGTCCACCTACGTGCTGGTGGCCATGAGCATCGACAGATAC 

GATGCCATCACACACCCCATGAACTTCTCAAAGTCGTGGAAAAGAGCCCGTCACCTGGTGGCTGGCGCATGGCTCATCTCGGCGTTGTTTTCGCTTCCCATCCTGGTTTTGT 

ACGAGGAGAAGCTCATCCAAGGACATCCGCAATGCTGGATTGAGTTGGGTTCACCGATCGCCTGGCAGGTGTACATGAGCCTGGTGTCGGCCACTCTATTTGCCATTCCTGC 

GCTGATCATATCTGCCTGCTATGCGATCATCGTAAAGACGATTTGGGCAAAGGGTTCCATTTTTGTACCCACGGAACGTGCTGGTTTTGGAGCTGCACCTGCCAGGAGGGCC 

AGCTCGAGGGGCATTATTCCACGGGCAAAGGTCAAAACGGTCAAGATGACATTGACCATCGTGTTTGTGTTCATCATCTGCTGGTCGCCGTATATCATCTTCGATCTGCTGC 

AGGTCTTTGGCCAGATTCCACACTCACAGACCAACATTGCCATCGCCACCTTCATCCAAAGTCTGGCACCGCTGAACTCGGCGGCGAATCCACTAATCTATTGCCTCTTCTC 

ATCGCAGGTCTTTCGCACATTAAGTCGCTTTCCGCCTTTTAAGTGGTTCACATGCTGCTGCAAGTCATACCGCAACAACTCGCAGCAAAACCGCTGCCACACGGTTGGTCGT 

CGGCTTCACAACAGTTGCGATTCGATGAGGACACTGACCACTTCGTTGACGGTTTCCCGAAGGTCCACCAACAAGACGAACGCCCGTGTGGTAATCTGCGAACGTCCCACCA 

AGGTGGTTACCGTGCCAGCCATGTCGGAGCGACGCGGAGTTTCTCTAAAGGGGAACACGGACATCCTGTGA 

(SEQ ID NO: 47) 

Start ATG: 1 (Reverse strand: CAT) 

MKCDHTLFFALFQTEQFAVLWILFTVIVLGNSAVLFVMFINKNRKSRMNYFIKQLALADLCVGLLNVLTDIIWRITISWRAGNLACKAIRFSQVCVTYSSTYVLVAMSIDRY 
DAITHPMNFSKSWKRARHLVAGAWLISALFSLPILVLYEEKLIQGHPQCWIELGSPIAWQVYMSLVSATLFAIPALIISACYAIIVKTIWAKGSIFVPTERAGFGAAPARRA 
SSRGIIPRAKVKTVKMTLTIVFVFIICWSPYIIFDLLQVFGQI PHSQTN1AI ATFIQSLAPLNSAANPLI YCLFSSQVFRTLSRFPPFKWFTCCCKSYRNNSQQNRCHTVGR 
RLHNSCDSMRTLTTSLTVSRRSTNKTNARVVTCERPTKVVTVPAMSERRGVSLKGNTDIL* 
(SEQ ID NO: 48) 

Name: Vasopressin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384294 

TTCCAATGCGATTTAAGGTAATTGACTATAAGTACCCATGACGAACTTCTGGACAGTTCCCCAGTGGAAAAAAAAAACTGGATTTATTTCTTATCGCAAAACTTGCGATAAC 
CAATTTGTATATGTTCATTCAGGGAGCGTGCCGATTTTTTAGTTTGCGAAATTGAGTTGTTATGTTTTCTGAGTCATTGTCGAGCGGGTTTTGGTAGCCGGGGATTAAATCT 
GCTGTATCATCTTGGTCCAGCGTTTATTTCTATTTGCTTTTAATTTCACAATCAAAACAGCAATAATAAAACCGTGCCTTATCTTACGGACATATCTCCCCAAGCGTCGTCT 
CGCTCTAATTTGGATTTTTCCTCTCTCTCTCGCTCTTTTCTTGCAGCTGGCGCCCGACGGTGGGAATTTCATCAGCTGTTGTTGTTGTTCATTTCACAGCAGCAGTCGGCCT 
AATAAAAATGCAATTTTTATGGTTAAGAAAATAGTGCGCGAACGCCGTTGTGTTGCACATTGGAAATTGTGCAGAAGCCAGAAGAAAATGTGTCTTGGAAAATGGAAAAGCA 
CGAGCAGCAGTAGCACCAGCAGCAGCTATAACATTGAGCCAGTGATGTAATACAGAGCAAAAGTGAAGCGGCGGCAATTGGCATTGCAAAAAAGAAACAGATTATACAATTT 
GTGTTGCTGCCGCACTGAACTAGGACAACAGCAGCAACAACAGCAGCAGCAACAACAACCACTAGTGATAAAGACCGGCTGGCAACAGTGCAGCGGCCACTTGCAGCAACAT 
AATTTCAAAGTCGGAGCGTGGGAATCGCAAATCACGGCAGTGGCAGCAACAACAGCAACTGCAGCAACACTCGAGCACTCGTGGCTCTCCGGATCGCTCATTTCGCTCGCCG 
TTCTCGCGCTGGCGTTTCTCCGTTTCGCTCGCACCCGCCTGTGGAGCAGTCGCGTATATTTGGAGGCGTTGGGGCGAACGCTCACGACAGTTGTGCTTGGAACGCTTCGGAG 
TGAAGATCGTGCTCGTCCGATCGGTGCGCGTGATAAATACACGGCGAACAAAATTGAGTCAAGAGTCAGACGGTCGCAAGACAATTGACAAGACACGAAAAATCATAATAAA 
CACAAGTGGAAAAGCAACTGCTTGAGAGCAAAAAAGAAATTGTTTTTTACAAATCTCGCACGTAAATTAGCAAGCGTTCGCAAGAAACAAACCGCATACAATACTGAACACA 
AAAAAAAGGGTATTCAACAAAAAACCAAAAATATTTGGCTGCAAATGTGAAGAGACGAGAGACAGCGGATATACTACCGAGTGAAATTTGTGTCATAAGCAGAGCCAAAACA 
AAACCGAAAAGCCAAATTCAATAAAAGGCCATAAGTGACTGTGAAACTGTAAAGTGATGTGTGGTAATCGAAAAACAGCATAAAGAATGGCCATAGACCTCCTGATCCTGGC 
CCTGCTGCTCGTCTCGTTCCTGATCAACCTGCTGGCGCTGTGTGCCTTTTGGATAACGCCTGGCTTGAGGACCACCGCCAATCGCTTTACGATCAACCTGCTGGCCATCAAT 
CTCATTGGTTGCTGCATCTTGGCCCCCACTTTGTTCCTGGGACTGCCGGGAAAGTCTGCGGAGGCATCCACTTCCAACGCCGAGACCCTGGAGTTCTTCTCGAAGCCGGGAA 
ACCATCAGGTGATACTTCGTCGCAATGGCCAGCTGGTGGAACAGGACGGCGTGGTGGTGCGAAGGAACATCAGCGAAAACGGCGATACGGTGGAGACCTTCTTCAAGTGCAA 
TGCCACCTACTGTCGCGAGTTGACCATCGATGAGCGCGGAGATGGTGGCTTTGTCATCACGGAAACCGAGACACACGAAGAGAATCTCTCGGCCTTTGAGAGCTTACCCACG 
GAGGCTCCCATTTTGCCGCCCGTCCAACTGAGATGTTGGTCCATAGACATGACTGCTGCGCTGGGAGCTCTGGCAGTACTGCTCGTGGTGGGGGACACCTGGTGTGCCGTTA 
CGGATCCCCTGCGCTACCACAGTCGCATTTCGGGCGTGAAAACCTGGATCTTCATAGCACTCACCTGGGTGGTGGGCATACTCTTTGGAGCCCTTTCCGCCTTCCGAGTTTT 
GGACTTTGAGGCGGACGCCCTGTTCAGCAGACAACGCCGCCTGGCAGTCACGTACTTCAACATCAGCAGCACCAATAGTATTTTCGGAGTGGTGTACGCCAGTGTCTACTTC 
ATCGTGATCATCCTCCTGCCCTTCGGCTTCGTGTGCGGAATGTACTGGAGGATATTCAGTGAGGCGCGCGGAAATGGGCTGCGCATGCGCCAGAATGGCTCTTCGCCGCTGC 
TGCAGAGCGCCTTGAATCTGACGGCTGGCCAGCAGGCGGCTCAAGCCAACCAGTTCTCCAATAGCTTGTGTGTTCATCGGCATTCCATATCCTCGGCCAGTTCGCATGGCGG 
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CAACAGCAGCTTGGGCTTGGGCGGTCTGCAAATGCAAflTTGATCftGCGCCAACAGCCGAGGAGTTCGCCCRGTTGTCTCCGTCGGGftCTCGGCCGCGAAAGTACTTTTGCCC 
ACCATCTCGGATGATGGTGGCTCAGATGCGGAAAGTGGAGCCGGAGTTCAACTGATGCCCGTACAGGAGCACTCACTGTCAGATAGAAATCAGAACATTATGCTTACTCTGC 
AAACAGCCAGTGGTGAAATAAAGAGGAACTACTCCGCCAGACAGCTGCCCCTGTTGGGCACTTCTTCGCAGGATCTGAGGGAGACTAATCGTCTGCAGGGCATACGCCAGGT 
GCACAGTTCGCCCAATCTCCACAAGTACACGGAACTGCGCCAGGACTCGCTCAGTGAGGAGTGTGGTTCTCCGCATCTTTTGGGTCACGCCCAGCGGCAGCAGCAGCAGCAA 
TTGCACCTGCAACATCAGCAGCAACAGCATCATCACCACCAGCAACATCATCCGCACTTCAGTTCGCCGCGCCACCAGCAACACGGCCACGCCCTCCAGATACCCGCCATTC 
ATGCCTCCCCCAAGGCTCTGAGTTACATGAGCTCCTTGCGACACCGTTTGAGCAACGCCAGTTCGCTATTCAAATACCGTGAGGAGTCGAGAGCAGCCCGCATCAGCATCCT 
GGTGGTGGTGATGTTTGTGGTCTCGTACCTGCCCTTCGGACTGCTGGTACTCCTGCAATCGCGACTGTCGGCTGCTAACTTTGGCGGATCCTCCCAGCTGGCCATCTTCATG 
ATCCTGCTGGCCAATCTCAGTTCGCCATTCATCTTCGCCTACAGGAACAAGAGGGTGAGGCGAGGAGTCAAGCGGCTGTTTGGTCTGGATTCGTCGTCGGGCCTGCAGCGCA 
ACTGCAGCAGCAGTGTAAAGACCAATGGCACCGCAGGTCCTGCTGCAAGTGGTGCACAATTGCAGCGCAACAGCAGCAAGTTGTCGCAGTACAGCAGCAACAGCTGCAAGTA 
TCTCACCCCCCAGAGTTCCCTGGTCAGTCAAGTGCCGGTGCATACGACCTTGACGTTAAGGCCCAACAGCAGCTGCAGCACCATCATCAATTTCGGTGGCGCCAGGGGCTCA 
GCGGACTCCGATGAGCAGCCACCGGCAACACCACCACCTACTGTGGCAGTTGCTCCACCGCCCACACGACCGCAGCGCCCCAAGCTGCAGAGGGGCATCACCATCGTTGAGC 
ACATAGCTATAACACCAACCATGCCCCAAAAGTTCCAGAATCGGCGCGCCAGGCTCTTCGACATGTTCTTCCGCAGCTCCAAGAAACTGCAGGCGGGCTGCCAGAGCCAGAG 
TTTGCCCACTGAGGTCTAAACGGGGGGTTATGATAACTATACATATAGGAAGGTAAACAAACACTAAACTTGCTTGCTGAAGAATTCTAAGAACAGTATTCGTAATTTGTAA 
GTCTAGCAATTAACAAAGACAACTGTTTGAATTTGAGTTTTAAGACCAGATCATGCTGTCCTAGAAGCTTCTCATGCCAAAAAACAAAACCACAAAGAAAGTCAAACTAGAA 
AGTTTGAGATGCTTTAAATTAACAAAAAAAGAAAAGAAAAAAACAAAAAACAAAACAAAGTACCTTTTAAAAAATAGAAAACTGTGTACAAGTCAAATTAACAAAGATTGAG 
AGTGATGGGAGTTATAATCTTAGGGTGGATATCAAACTGAAAAGTGGAACCAAGCCAAGCCAAGTTGAGATCGTAATCAATGTACCTAAGCAACTTGCATATGAACATTTAA 
TGTTTTTACTCAGATGAAATCACAGGAAAATAGCAAAGACAGCACAGCTTATTCGAACTTCCCCAACTAGTTTATTATTTATTGAAACAATTGTAAATATTTATTAGACATC 
CACGCCGCAAAGCTTTAACTCCAAAAACATTCTGGCTAAACAAGAAGAGACGCATACATTCGATTCATAAGTTAATTAAGTCAATGTGCAAAGCATATGTGTATTCTAGAGT 
TTAAGTTGTAGAGCCGAGTAGTCTTACACCGAGCACCTAGTGCCGAGCAGTGCTTTTTCACCTGCCCAACTGGAGCGTCTGTTGCAGGCGGCATTTTCATATTGAAAATGCA 
TTTTGGCCAGTGCACAGTTGGTTTTCCAGTTGGCCAGAATGTCAGTCAGCAGTTAGTTAGCCTTTGGTTAGCCAACGGCTATTGGGCCATTAAGTCAATTTGCTTTTTAGAA 
CACCGCGGGCATTGCGTGCAGCTAAATACCTCATTAGTTATAATTTGAAGTGCGAGATGGATGGCGGCAATAGGTCAAAGGTCGGAGCCATCGGCACGGCCCATGCATAATG 
CATGGAAAATGTTCGATTGCCCCTGGCATTTGTATCATTTGCTTTACTATGCTGCAGGCCAGAAAGCTTTATACCTTAAGTTCTCCTGCTTTGATTTCCCTTGGCTGCACAA 
AAAGGCAACAAAAGCAACACACACTCACACAACACAAAAAAACCTCGAATTTATATTTCTAAAGTTTATATTTTATAAAGAGTGTGTGCATAGGAAATATCTATAAATATTC 
ATGCATGTGTG 
(SEQ ID NO: 49) 

Exon: 1001.. 4051 
Start ATG: 1431 

Transcript No. : CT19320 

CTTCGGAGTGAAGATCGTGCTCGTCCGATCGGTGCGCGTGATAAATACACGGCGAACAAAATTGAGTCAAGAGTCAGACGGTCGCAAGACAATTGACAAGACACGAAAAATC 
ATAATAAACACAAGTGGAAAAGCAACTGCTTGAGAGCAAAAAAGAAATTGTTTTTTACAAATCTCGCACGTAAATTAGCAAGCGTTCGCAAGAAACAAACCGCATACAATAC 
TGAACACAAAAAAAAGGGTATTCAACAAAAAACCAAAAATATTTGGCTGCAAATGTGAAGAGACGAGAGACAGCGGATATACTACCGAGTGAAATTTGTGTCATAAGCAGAG 
CCAAAACAAAACCGAAAAGCCAAATTCAATAAAAGGCCATAAGTGACTGTGAAACTGTAAAGTGATGTGTGGTAATCGAAAAACAGCATAAAGAA1GGCCATAGACCTCCTG 
ATCCTGGCCCTGCTGCTCGTCTCGTTCCTGATCAACCTGCTGGCGCTGTGTGCCTTTTGGATAACGCCTGGCTTGAGGACCACCGCeAATCGCTTTACGATCAACCTGCTGG 
CCATCAATCTCATTGGTTGCTGCATCTTGGCCCCCACTTTGTTCCTGGGACTGCCGGGAAAGTCTGCGGAGGCATCCACTTCCAACGCCGAGACCCTGGAGTTCTTCTCGAA 
GCCGGGAAACCATCAGGTGATACTTCGTCGCAATGGCCAGCTGGTGGAACAGGACGGCGTGGTGGTGCGAAGGAACATCAGCGAAAACGGCGATACGGTGGAGACCTTCTTC 
AAGTGCAATGCCACCTACTGTCGCGAGTTGACCATCGATGAGCGCGGAGATGGTGGCTTTGTCATCACGGAAACCGAGACACACGAAGAGAATCTCTCGGCCTTTGAGAGCT 
TACCCACGGAGGCTCCCATTTTGCCGCCCGTCCAACTGAGATGTTGGTCCATAGACATGACTGCTGCGCTGGGAGCTCTGGCAGTACTGCTCGTGGTGGGGGACACCTGGTG 
TGCCGTTACGGATCCCCTGCGCTACCACAGTCGCATTTCGGGCGTGAAAACCTGGATCTTCATAGCACTCACCTGGGTGGTGGGCATACTCTTTGGAGCCCTTTCCGCCTTC 
CGAGTTTTGGACTTTGAGGCGGACGCCCTGTTCAGCAGACAACGCCGCCTGGCAGTCACGTACTTCAACATCAGCAGCACCAATAGTATTTTCGGAGTGGTGTACGCCAGTG 
TCTACTTCATCGTGATCATCCTCCTGCCCTTCGGCTTCGTGTGCGGAATGTACTGGAGGATATTCAGTGAGGCGCGCGGAAATGGGCTGCGCATGCGCCAGAATGGCTCTTC 
GCCGCTGCTGCAGAGCGCCTTGAATCTGACGGCTGGCCAGCAGGCGGCTCAAGCCAACCAGTTCTCCAATAGCTTGTGTGTTCATCGGCATTCCATATCCTCGGCCAGTTCG 
CATGGCGGCAACAGCAGCTTGGGCTTGGGCGGTCTGCAAATGCAAATTGATCAGCGCCAACAGCCGAGGAGTTCGCCCAGTTGTCTCCGTCGGGACTCGGCCGCGAAAGTAC 
TTTTGCCCACCATCTCGGATGATGGTGGCTCAGATGCGGAAAGTGGAGCCGGAGTTCAACTGATGCCCGTACAGGAGCACTCACTGTCAGATAGAAATCAGAACATTATGCT 
TACTCTGCAAACAGCCAGTGGTGAftATAAAGAGGAACTACTCCGCCAGACAGCTGCCCCTGTTGGGCACTTCTTCGCAGGATCTGAGGGAGACTAATCGTCTGCAGGGCATA 
CGCCAGGTGCACAGTTCGCCCAATCTCCACAAGTACACGGAACTGCGCCAGGACTCGCTCAGTGAGGAGTGTGGTTCTCCGCATCTTTTGGGTCACGCCCAGCGGCAGCAGC 
AGCAGCAATTGCACCTGCAACATCAGCAGCAACAGCATCATCACCACCAGCAACATCATCCGCACTTCAGTTCGCCGCGCCACCAGCAACACGGCCACGCCCTCCAGATACC 
CGCCATTCATGCCTCCCCCAAGGCTCTGAGTTACATGAGCTCCTTGCGACACCGTTTGAGCAACGCCAGTTCGCTATTCAAATACCGTGAGGAGTCGAGAGCAGCCCGCATC 
AGCATCCTGGTGGTGGTGATGTTTGTGGTCTCGTACCTGCCCTTCGGACTGCTGGTACTCCTGCAATCGCGACTGTCGGCTGCTAACTTTGGCGGATCCTCCCAGCTGGCCA 
TCTTCATGATCCTGCTGGCCAATCTCAGTTCGCCATTCATCTTCGCCTACAGGAACAAGAGGGTGAGGCGAGGAGTCAAGCGGCTGTTTGGTCTGGATTCGTCGTCGGGCCT 
GCAGCGCAACTGCAGCAGCAGTGTAAAGACCAATGGCACCGCAGGTCCTGCTGCAAGTGGTGCACAATTGCAGCGCAACAGCAGCAAGTTGTCGCAGTACAGCAGCAACAGC 
TGCAAGTATCTCACCCCCCAGAGTTCCCTGGTCAGTCAAGTGCCGGTGCATACGACCTTGACGTTAAGGCCCAACAGCAGCTGCAGCACCATCATCAATTTCGGTGGCGCCA 
GGGGCTCAGCGGACTCCGATGAGCAGCCACCGGCAACACCACCACCTACTGTGGCAGTTGCTCCACCGCCCACACGACCGCAGCGCCCCAAGCTGCAGAGGGGCATCACCAT 
CGTTGAGCACATAGCTATAACACCAACCATGCCCCAAAAGTTCCAGAATCGGCGCGCCAGGCTCTTCGACATGTTCTTCCGCAGCTCCAAGAAACTGCAGGCGGGCTGCCAG 
AGCCAGAGTTTGCCCACTGAGGTCTAAACGGGGGGTTATGATAACTATACATATAGGAAGGTAAACAAACACTAAACTTGCTTGCTGAAGAATTCTAAGAACAGTATTCGTA 
ATTTGTAAGTCTAGCAATTAACAAAGACAACTGTTTGAATTTGAGTTTTAAGACCAGATCATGCTGTCCTAGAAGCTTCTCATGCCAAAAAACAAAACCACAAAGAAAGTCA 
AACTAGAAAGTTTGAGATGCTTTAAAT 
(SEQ ID NO: 50) 



MA1DLLILALLLVSFLINLLALCAFWITPGLRTTANRFTINLLAINLIGCCILAPTLFLGLPGKSAEASTSNAETLEFFSKPGNHQVILRRNGQLVEQDGVVVRRNISENGD 

TVETFFKCNATYCRELTIDERGDGGFVITETETHEENLSAFESLPTEAPILPPVQLRCWSIDMTAALGALAVLLWGDTWCAVTDPLRYHSRISGVKTWIFIALTWVVGILF 

GALSAFRVLDFEADALFSRQRRLAVTYFNISSTNSIFGVVYASVYFIVIILLPFGFVCGMYWRIFSEARGNGliRMRQNGSSPLLQSALNLTAGQQAAQANQFSNSLCVHRHS 

ISSASSHGGNSSLGLGGLQMQIDQRQQPRSSPSCLRRDSAAKVLLPTISDDGGSDAESGAGVQLMPVQEHSLSDRNQNIMLTLQTASGEIKRNYSARQLPLLGTSSQDLRET 

NRLQGIRQVHSSPNLHKYTELRQDSLSEECGSPHLLGHAQRQQQQQLHLQHQQQQHHHHQQHHPHFSSPRHQQHGHALQIPAIHASPKALSYMSSLRHRLSNASSLFKYREE 

SRAARISILVVVMFVVSYLPFGLLVLLQSRLSAANFGGSSQLAIFMILLANLSSPFIFAYRNKRVRRGVKRLFGLDSSSGLQRNCSSSVKTNGTAGPAASGAQLQRNSSKLS 

QYSSNSCKYLTPQSSLVSQVPVHTTLTLRPNSSCSTIINFGGARGSADSDEQPPATPPPTVAVAPPPTRPQRPKLQRG1TIVEHIAITPTMPQKFQNRRARLFDMFFRSSKK 

LQAGCQSQSLPTEV* 

(SEQ ID NO: 51) 

Name: G-protein coupled receptor-like 
Classification: G_protein_linked_receptor 
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Celera Sequence No. : 142000013384666 

ATTCTATTGTCGTAGGCGTGGCAAAAGGTTTTTTGGTAAATCGATAGAAATGTACAAGAGAATGTAATTAATATGAAAAGGTAACAAAAAATTTTGCAAAAGTGTGGGCATG 

GCAGTTTTGGTCGGTTTGTTGGCGTAAGGGCATTATCGGTATTTTCCTATTTTCBGCTATATGATGCAGTCGCCTAATTTTGATCAAACTATAACTAAAATTGTATATTGTC 

AGAAACAACGAGCATGAATTTATAGTCAATTTATCTTTTTATTATTTCTTATCTAAAGTTATATGCATATAGAATAAAAACAGATACGAGCTTCCTTAGAAGCGGACGGAAA 

TAATGTCTCGAGATCTGGACCTTCATTAAGACAGACGAACATTAGGACAAGGCCAGGTCTACTCAGCTAGGGATCCTGATCAATGTATATGTATTTACTTTAAAGGGGCGTA 

AACTTGTATTTTACATGTTACATACTTTCAAGCGAATCTAGTAAACCCTTTTAATTTACAAGTAACGAATATAAAAAGTTGAATGGTAGCGGGAGAAAATTAATTGTATAAT 

ATAATATTATCTTTGTTAATCCAGAGTCTTGGCAACTTGTGCACACAATTTTTTTTTTTTTTTTGGAAACTGAATCAGTTAATTGTGCCAAAATTTAGAGTGTAGATTTCTT 

GGAAATCTCAATGAATTTTATGGTCAAGTTCCAAGAAGTGAAATGCTTAACAAACAACAAAAACAAAACACACAAAAGCAAACAAATTATGCAGAATTATCGGATGTATTTT 

ATATGTATGTATGTACATACATATACAAAGAATAATAAATGGGTGGGTCTGTTGGCTTCTATGGAATTTCCAATTTAAGCTGCAATACATGTACATTTTAACGTCTTTCTGT 

ACGCTCTCAAATAAAAAAACACAATTTTAGGATTAATAAAAAAACAGATATGTACGTCTTCTATTCCCATATGAAGAAAAATATTCATATGATTTTTTGTTTGGCAGGAAAT 

GCGAATAGTTATTGGATCGTTCACCGCATTTCTTTTGCTGTTATTGCAAAACTCAAATGCCGAAATTCCCGGTTGCGACTTCTTCGACACCGTAGATATTTCAAAAGCGCCA 

AGATTCTCGAACGGATCGTACCTCTACGAAGGCTTGCTGATCCCCGCCCATTTGACAGCTGAATATGACTACAAGCTCCTGGCCGACGATTCGAAGGAGAAGGTGGCGAGCC 

ACGTACGAGGATGTGCCTGCCACCTCAGGCCATGCATTCGGTTTTGTTGCCCCCAGTACCAAAAGATGCAAAAGAGCAAGTGCTACGGCGACATGTCGGAGGACGAGCTGAA 

CAAGCACGATCCCTTCGTGAACGTGACGCTCAGCGACGGGTCGGTGGTCAGGAGACACTTCAAGGAGGATCTGATCGTGCAGTCGGATCTGGCCAAGCCCGGATGTCCCCGG 

ATGTACTTTCTGAATCACGAACTGCCGGGCAATGAATTCACTCTGTTTGAGGTTACTTTCCTGTGTTTCTGTCACTTCTGAAGATCGTAACCTTTTTTGAATCTCTGTAGAA 

CGGCTCTCTTCTGCGTCACTGGGACAAAGTGGAACTGAGCAAGCCGGGAGTACTGCGTCCAGCATCTTTCATTTAAGGATGATAGTATCCGAATTGCACCCCACTTCTGTCC 

ACTTTCGTCTGAGCACTCCAGAACGTGGAAGACCGTTGGTAAGCTGAATTCCCTGCTTGACTTTAAGGTTTTTAGTTAATGTCCATTTTCACAGCGATAGTGATATCCTTGA 

TATGCATAATCCTAACGATCAGCGTGTACCTCTATGTTGAGAAGCTTCGCAACCTGCACGGAAAGTGTTTCATCTGCTACTTGGCCTCCTTGTTCCTGGGATATTTCTTCCT 

GGTCCTCAACGTGTGGAAATACTCCTCCGGCTTCTGCGTTACAGCAGGTTTGCTTGTGGCGATTACAAGATAAAGGACGACCAATTTCACATTTCGATATCATGATCAGGGT 

TCCTGGGCTACTTCTCCGTCATCGCCGCGTTCTTTTGGCTTTCTGTCATCAGTCTTACCCTCTGGAATTCATTTAGCGGCAACTCCAGCTGGTTGAACCGTTTCTTGCCACA 

GAATCGGTTCCTTTCTTACAATTTATACGCCTGGGGCATGGCGCTGCTCCTGACTGCAATCACCTATATAGCTGACCAGGTGGTTAAGAACGAGAAGTTGAGACCCCGCGTG 

GGCGTCGGCAAAAATTGTTGGATCTACAGTAAGTTACTATTCCTATAATAACAATTGCGGTGTAACTCGTTTTTTTTTTAACCAACAGCTGGGGATATGACTGTCATGATCT 

ACTTCTATGGCCCGATGCTACTGATAATTGTTTTCAACATAACAATGTTTGTCCTGACGGCTTTTCGTATAATGAAAGTGAAGAAGGAAGCACAAAACTTTACTCAACAGCA 

AAAAACAACCAATAGGCTCAATTCGGACAAGCAGACGTAATAATTTTACCATATTTTATACAGTTCTAAATCCAAAAGTTTAAATTTTAAATTATTTTCTATGGCTTTTAAA 

GTTACGCCCTATTCCTGCGACTCTTTATCATCATGGGCTTGTCCTGGAGCTTGGAAATAATCTCGTTTTTGTTGAGCAAAAATCAGGCTTGGGCCAAGGCTTTTATGGTGGC 

TGATTACTTCAATTGGTCACAGGGCACCGTCATATTTCTGCTATTTGTTCTGAGGCCCAGCACACTGAAGCTACTGAAGGAACGGTAAGTTATCGGTTCTTCCATAGGAGGT 

CCAATTAACCTGGTTCTATCATCTGGCTGAGCAGGATTAAGGGTGGGAGGGATGAAGCCGGCGCCAGCGATGAACATATCTCACTTCAGAATACGAAAATTGACCCCAGTGT 

TTTTTAATATATGGGTAATGTGAAAATACAGTTTATTAGTTCATTTGGTGTTGGAAACCTTCTATTTTGAATCACTAACAGGCAGCCTAATGATCGTGACCAAAGGCTGCTT 

TGCTTGCTCATTATTGCTCACATATCGTATGCTCATATGGGCTACGAATATTTGCTACTTATCACTTTTCAAGTATCTTGATTTCTGCACGGTGTCTAAGCATATCTAAGTT 

TTAATTAAATGTTGAATGTAATTGTCCTTTAAGTTATAACTTTTTGCAATAATAATCTATTCCAATCCATCCTAGACGTGCCATCAAAGCGAGATCAAAGCAACTACTACAT 

GGGGATATGTACGAATCTTATAGGCATGAATACTCGGCATTTATATACCCTGGTCTATTATTATGGATGGAGGATAAAAACTCAGGAAAATACTTTTCGCATCAGCACTTAT 

GTATGCTCACTCGTGATAATAATTTCGTAATAATTCGTACTGTTTTTACTGCCTCAACTTAATTCAAAGAACTACAAATGATTATATTATGACTTTTTATGACAATGACTAA 

CTAAAAATAAAAATTTCTGAATATTTATAAACAACTTTAAAATGTTATTAACTTAGGATAATAATAATAACACAGTTTTAATTTATCAAGCACATCGTTAAAAAACAATAAA 

TTACTTGAATATAAATTTAAAAAAAAAATGGAATATTGGTTTACATTAACGGTATGCAAAAAATCATAAAAAGTCATAGAACTGAACGATAGAGAAAATTACAAAAAAACGA 

TCTAGTCGAGTTCCTCGACTATAAGATACTCATTTCTCAGCTAGTTCTTTCATTTAAAAATTTCTCTTTCGTTTCGATAGTAATTGGCAAGAAAAAATAATAAAATACAACA 

AATTAATGCATTTTTGAAAATTATGGGCGTGACTGATTTGGTCGGTTAGTGGACGTGTCAATATTATGAAGCAAACGTGGACACTACAAACTGAAATATATATATATATAT 

(SEQ ID NO: 52) 

Exon: 1001.. 1507 

Exon: 1567.. 1703 

Exon: 1756.. 1951 

Exon: 2014.. 2268 

Exon: 2329.. 2500 

Exon: 257S..2772 

Exon: 2835.. 2919 

Start ATG: 1007 

Transcript No. : CT20339 

CAGGAAATGCGAATAGTTATTGGATCGTTCACCGCATTTCTTTTGCTGTTATTGCAAAACTCAAATGCCGAAATTCCCGGTTGCGACTTCTTCGACACCGTAGATATTTCAA 

AAGCGCCAAGATTCTCGAACGGATCGTACCTCTACGAAGGCTTGCTGATCCCCGCCCATTTGACAGCTGAATATGACTACAAGCTCCTGGCCGACGATTCGAAGGAGAAGGT 

GGCGAGCCACGTACGAGGATGTGCCTGCCACCTCAGGCCATGCATTCGGTTTTGTTGCCCCCAGTACCAAAAGATGCAAAAGAGCAAGTGCTACGGCGACATGTCGGAGGAC 

GAGCTGAACAAGCACGATCCCTTCGTGAACGTGACGCTCAGCGACGGGTCGGTGGTCAGGAGACACTTCAAGGAGGATCTGATCGTGCAGTCGGATCTGGCCAAGCCCGGAT 

GTCCCCGGATGTACTTTCTGAATCACGAACTGCCGGGCAATGAATTCACTCTGTTTGAGAACGGCTCTCTTCTGCGTCACTGGGACAAAGTGGAACTGAGCAAGCCGGGAGT 

ACTGCGTCCAGCATCTTTCATTTAAGGATGATAGTATCCGAATTGCACCCCACTTCTGTCCACTTTCGTCTGAGCACTCCAGAATTAATGTCCATTTTCACAGCGATAGTGA 

TATCCTTGATATGCATAATCCTAACGATCAGCGTGTACCTCTATGTTGAGAAGCTTCGCAACCTGCACGGAAAGTGTTTCATCTGCTACTTGGCCTCCTTGTTCCTGGGATA 

TTTCTTCCTGGTCCTCAACGTGTGGAAATACTCCTCCGGCTTCTGCGTTACAGCAGGGTTCCTGGGCTACTTCTCCGTCATCGCCGCGTTCTTTTGGCTTTCTGTCATCAGT 

CTTACCCTCTGGAATTCATTTAGCGGCAACTCCAGCTGGTTGAACCGTTTCTTGCCACAGAATCGGTTCCTTTCTTACAATTTATACGCCTGGGGCATGGCGCTGCTCCTGA 

CTGCAATCACCTATATAGCTGACCAGGTGGTTAAGAACGAGAAGTTGAGACCCCGCGTGGGCGTCGGCAAAAATTGTTGGATCTACACTGGGGATATGACTGTCATGATCTA 

CTTCTATGGCCCGATGCTACTGATAATTGTTTTCAACATAACAATGTTTGTCCTGACGGCTTTTCGTATAATGAAAGTGAAGAAGGAAGCACAAAACTTTACTCAACAGCAA 

AAAACAACCAATAGGCTCAATTCGGACAAGCAGACTTACGCCCTATTCCTGCGACTCTTTATCATCATGGGCTTGTCCTGGAGCTTGGAAATAATCTCGTTTTTGTTGAGCA 

AAAATCAGGCTTGGGCCAAGGCTTTTATGGTGGCTGATTACTTCAATTGGTCACAGGGCACCGTCATATTTCTGCTATTTGTTCTGAGGCCCAGCACACTGAAGCTACTGAA 

GGAACGGATTAAGGGTGGGAGGGATGAAGCCGGCGCCAGCGATGAACATATCTCACTTCAGAATACGAAAATTGACCCCAGTGTTTTTTAA 

(SEQ ID NO: 53) . . . . . . 



MRIVIGSFTAFLLLLLQNSNAEIPGCDFFDTVDISKAPRFSNGSYLYEGLLIPAHLTAEYDYKLLADDSKEKVASHVRGCACHLRPCIRFCCPQYQKMQKSKCYGDMSEDEL 
NKHDPFVNVTLSDGSVVRRHFKEDLI VQSDLAKPGCPRMYFLNHELPGNEFTLFENGSLLRHWDKVELSKPGVLRPASFI* 
(SEQ ID NO: 54) 

Name: rath-like 3 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384666 



Start ATG: 7 
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AATTAATGCATTTTTGAAAATTATGGGCGTGACTGATTTGGTCGGTTAGTGGACGTGTCAATATTATGAAGCAAACGTGGACACTACAAACTGAAATATATATATATATATC 
ATTTCATTGCCCTTAAGTCTGAAAAACGGCTTTAAAATTGAGAAGTTACTTTAAGAGATATTCGAGGTATTCGGTGACAGCAGCGCACATTATCGACTGGCCAATGGGAATA 
TTAGATTTGTAGAATAGGGTAATATGTAAACGTAAACCTTTCGATTGTATAAAGCATTCAATTCCTAATATGGATATGTCCGTTCACCAGCCTGTATTAACTTTGCGTGTTA 
TTACTTAAATTTATCTTTCTATGTACATGCCAGAAACTGATAGAATGGCACGGTTACCGCCGATAGTCGCTGAGTATGGGGTATCTTATAGTCGAGCACATTCTCGTTCTCC 
CTTGACATCGCATATATCGGTTGAGTTGTTCGCATTTTGGGATCAGTCATCTTAAAACTCTGGTCGCATAAACATGTTTTTTGTGTTAGTCGTTCCATTTGACCCTTGCGTA 
AGTATTTATATTCAAAGTTTTTACGATTTTGGATCGTTAAGCATAAGAAGGACCACTCGAGTACGATTTGACCGTTTTTATACAATTTCGTCCTCTTAGACGAATAAATATT 
GTGCCTGCCTTGTTTCTAAAAATGAACTTGCTTCAGCTTCGTAGAGAATGCTTAGTCATAACAGATGGATTTGTTGAATTTATTTTTAACAGTTGATAAGTTATACTTTTAT 
AAAAAAAAAAAAACAACTTACTCAGAATGCTATAGCACATGACTCAATTTTTTGTTTTTCATAAGTATTTAGATCAGTTTTCAGTTTTGCGGAGGTTATTTTGCTACTTTAT 
TTCTAATTGTGAACACGTAACTTGGTTTAAAACCGTATATACTTACAGAATAGCATACATCATTTGAATTCAGTTGCCTATCATATTAATTTCCTTTGCAGAAAATGCGGAT 
ATTGTTAATTGCAGTTCTTTTTCTGTTGATGCCAAAATCAAATGCCGAAATTCCCGGTTGCGACTTCTTCGACACCGTAGATATTTCAAAAGCGCCAAGATTCTCGAACGGA 
TCGTACCTCTACGAAGGCTTGCTGATCCCCGCCCATTTGACAGCTGAATATGACTACAAGCTCCTGGCCGACGATTCGAAGGAGAAGGTGGCGAGCCACGTACGAGGATGTG 
CCTGCCACCTCAGGCCATGCATCCGGTTTTGTTGCCCCCAGTACCAAAAGATGCAAAAGAGCAAGTGCTATGGCGACATGTCGGAGGACGAGCTGAACAAGCACGATCCCTT 
CGTGAACGTGACGCTCAGCGACGGGTCGGTGGTCAGGAGACACTTCAAGGAGGATCTGATCGTGCAGTCGGATCTGGCCAAGCCCGGATGTCCCCGGATGTACTTTTTGAAT 
CACGAACTGCCGGGCAATGAATTCACTCTGTTTGAGGTTACTTTCCTGTGTTTCTGTCACTTCTGAAGATCGTAACCTTTTTTGAATCTCTGTAGAACGGCTCTCTTCTGCG 
TCACTGGGACAAAGTGGAACTGAGCAAGCGGGAGTACTGCGTCCAGCATCTTTCATTTAAGGATGATAGTATCCGAATTGCACCCCACTTCTGTCCACTTTCGTCTGAGCAC 
TCCAGAACGTGGAAGACCGTTGGTAAGCTGAATTCCCTGCTTGACTTTAAGGTTTTTAGTTAATGTCCTTTCTCACAGCGATAGTGATATCCTTGATATGCATAATCCTAAP 
GATCAGCGTGTACCTCTATGTCGAGAAGCTTCGCAACCTGCACGGAAAGTGTTTCATCTGCTACTTGGCCTCTTTGTTCTTGGGCTATTTCTTCCTGGTCCTCAACGTGTGG 
AAATACTCCTCCGGCTTCTGCGTTACAGCAGGTTTGCTTGTGGCGATTACAAGATAAAGGACGACTAATTTCACATTTCGATATCATGATCAGGGTTCCTGGGCTACTTCTC 
CGTCATGGCCGCGTTCTTTTGGCTTTCTGTGATTGGAATCCACTTGCGGATTAAGTTTAGTCTAGCCTCTAACTGTTTGCACCGATTGCTGCCAGAAAATCCGTTCCGTGCA 
TACAACTTATACGCCTGGGGTATACCGCTAATCATGACTGCAATCACCTATACAGCTGACCAGGTGGTTAAGAACGAGAAGTTGAGACCCCGCGTGGGCGTCGGCAAAAATT 
GTTGGATCTACAGTAAGTTACTATTCCTATAATTACAATTACGGTGTAACTCGTTTTGTTAACCAACAGCTGGCGATATGACTGTCATGATCTACTTCTATGGCCCGATGCT 
GTTGCTAATCGCTTTCAACATAATAATGTTTGTCCTTTCGGCGATTTACATATATAACATAAAGAAAAATGTGAAAGGCCTTGTCCACAAGCAACAAACAAACCAACAGATA 
AATGACCAACAAATGTAATTATATTATATTATAAGATAATAACTATAATGTCAAATGGCACTTTTTGTAATCTTTTATAGGTTTGCTATATTCCTGCGACTTTTCATCTTAA 
TGGGTTTGTCGTGGAGCTTTGAGATATTATCCTTTTTGTTAACCAAACAGCAAGCTTGGGCTAGGGCTTTAATGGTGGCTGACTACTTTAATTGGTCCCAGGGTACCATCAT 
ATTCGTGCTTTTTATTTTAAAGCCTAGCATTCTAAAACTTATAATAGCAGGGTAAGTAAAGTATTACAAATAACCTTGAAATTTGACCGTCTCATACGTATGAACAATCTTT 
TGATTTAGGGGACGTCAAAACCTCCCAGGAAGTCACCATAATTCGAGATCAAAAGCAGCTCGATATAATTCAACTCATACGGCTTGTGAAGGATCAATTGCGGATCCAAACG 
CTTACTGCTAAAAAAATGGAATTCAATAAGATGCAACGTCTCTGTTATTGATCACATTATTTTAATTTTCTATGGAGCAAAAGTTTAAAAAAAATATTTGTTATCAAGACAA 
TAAATAATGTCTTGGAACATATAAAGCAAAATATATTAAAGGGTACAAAAAACAGGAAAAGTATGGGGCAATACAGCAAAATATTTTTCGGATCATCAACTTTTTTGTAAGA 
CAAAATGTAGAAATATTTCTGGCACCGTTGTTTCAAACCAGGGTTTGAGTCAACAATGCTGGCCCACCACTGGCAAGTACCAAACGATTACAGAAAACTTTTTCTAATCGAT 
TAAGCATCGGGGGTCATCGATGTTTCTCCACCACTACAATTTCATGTTGTTTTTTTAGTCGCGGCTGTTTGACGTTTGTCGCTTTCCTCCGTTGCGAATATAATATATTACG 
TAGCTCATTTTTATACAAACGGAATTACGAGCGCAACGACGACAGCAACACTAGTAGCACTAATCGTAAGCGCAGGGGCCAAAAATTAAATTGCGTTTGCGGCCGCAAAGAT 
TTGATGACGTCGCATACGCCGTCTTCTAGGGCGTAAAAAGCAAAGCAAAGCAAACAAACGCGAAAGCGAAACGTGTAAACGGCGTAGAAGCGATAAACGCGACTCAAATACG 
CAGCAGATAAAATACAATACGCGAGAAGAGAAAAGTCACGGGAAATATTGTTCATA 
(SEQ ID NO: 55) 

Exon: 1001.. 1492 
Exon: 1552.. 1702 
Exon: 1759.. 1935 
Exon: 1998.. 2252 
Exon: 2627.. 2640 
Start ATG: 1001 

Transcript No. : CT20351 

ATGCGGATATTGTTAATTGCAGTTCTTTTTCTGTTGATGCCAAAATCAAATGCCGAAATTCCCGGTTGCGACTTCTTCGACACCGTAGATATTTCAAAAGCGCCAAGATTCT 

CGAACGGATCGTACCTCTACGAAGGCTTGCTGATCCCCGCCCATTTGACAGCTGAATATGACTACAAGCTCCTGGCCGACGATTCGAAGGAGAAGGTGGCGAGCCACGTACG 

AGGATGTGCCTGCCACCTCAGGCCATGCATCCGGTTTTGTTGCCCCCAGTACCAAAAGATGCAAAAGAGCAAGTGCTATGGCGACATGTCGGAGGACGAGCTGAACAAGCAC 

GATCCCTTCGTGAACGTGACGCTCAGCGACGGGTCGGTGGTCAGGAGACACTTCAAGGAGGATCTGATCGTGCAGTCGGATCTGGCCAAGCCCGGATGTCCCCGGATGTACT 

TTTTGAATCACGAACTGCCGGGCAATGAATTCACTCTGTTTGAGAACGGCTCTCTTCTGCGTCACTGGGACAAAGTGGAACTGAGCAAGCGGGAGTACTGCGTCCAGCATCT 

TTCATTTAAGGATGATAGTATCCGAATTGCACCCCACTTCTGTCCACTTTCGTCTGAGCACTCCAGAACGTGGAAGACCGTTGCGATAGTGATATCCTTGATATGCATAATC 

CTAACGATCAGCGTGTACCTCTATGTCGAGAAGCTTCGCAACCTGCACGGAAAGTGTTTCATCTGCTACTTGGCCTCTTTGTTCTTGGGCTATTTCTTCCTGGTCCTCAACG 

TGTGGAAATACTCCTCCGGCTTCTGCGTTACAGCAGGGTTCCTGGGCTACTTCTCCGTCATGGCCGCGTTCTTTTGGCTTTCTGTGATTGGAATCCACTTGCGGATTAAGTT 

TAGTCTAGCCTCTAACTGTTTGCACCGATTGCTGCCAGAAAATCCGTTCCGTGCATACAACTTATACGCCTGGGGTATACCGCTAATCATGACTGCAATCACCTATACAGCT 

GACCAGGTGGTTAAGAACGAGAAGTTGAGACCCCGCGTGGGCGTCGGCAAAAATTGTTGGATCTACACAAGCTTGGGCTAG 

(SEQ ID NO: 56) 



MRILLIAVLFLLMPKSNAEIPGCDFFDTVDISKAPRFSNGSYLYEGLLIPAHLTAEYDYKLLADDSKEKVASHVRGCACHLRPCIRFCCPQYQKMQKSKCYGDMSEDELNKH 
DPFVNVTLSDGSVVRRHFKEDLIVQSDLAKPGCPRMYFLNHELPGNEFTLFENGSLLRHWDKVELSKREYCVQHLSFKDDSIRIAPHFCPLSSEHSRTWKTVAIVISLICII 
LTISVYLYVEKLRNLHGKCFICYLASLFLGYFFLVLNVWKYSSGFCVTAGFLGYFSVMAAFFWLSVIGIHLRIKFSLASNCLHRLLPENPFRAYNLYAWGIPLIMTAITYTA 
DQVVKNEKLRPRVGVGKNCWI YTSLG* 
(SEQ ID NO: 57) 

Name: mth-like 4 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384544 

GATTGCCGTGGTCGTGCGCCGCGGGAGAAATGCCCTTTTCGTGTACGCAATTACTAGGCAAAAAGGTTTGGACGGCCCTTGCTTTATTAGGCCATAATTTACGCTGTCGAGC 
GGCCATTAGAGCGGCATCCGCTGGCCAAGTTGGTCTATAAAAGCCCGGCGGCTGCAAGTGGCGTTTTCCATTCCACGTGAAAGGACACTCGCAGTCGGGCCGACGAGATGCT 
GCGCCACCTGCTCCGCCACGAGAACAACAAGGTCTTCGTCCTGATCCTGCTCTACTGCGTCCTGGTCAGCATTCTGAAACTCTGCACGGCACAGCCGGATAGCTCTGTGGCC 
GCCACGGATAATGGTATTTGATAAGGCATTTGATAATATCATATTATTCGCTCTAATGGATTCCATTTACCATCCAGATATTACGCATCTTGGCGACGATTGTCAGGTGACG 
CCCGTCATCCATGTGCTCCAGTATCCTGGATGTGTGCCCAAGCCGATTCCCTCGTTCGCCTGCGTGGGTCGCTGTGCCAGTTATATCCAGGTGGGTGTTGATATACTTGAAA 
TATAAATTATAATTTGATATAATCTGATGACTTTAGGTTTCGGGCAGTAAGATCTGGCAAATGGAGCGTTCCTGCATGTGCTGCCAGGAGTCTGGTGAGCGGGAGGCAGCCG 
TCTCGCTATTCTGTCCCAAAGTGAAGCCCGGCGAGCGTAAATTCAAGAAGGTCCTGACCAAGGCGCCATTGGAGTGCATGTGTCGGCCATGCACTTCCATTGAGGAGTCTGG 
CATCATACCACAGGAAATTGCCGGCTATTCGGACGAGGGTCCACTCAACAATCACTTCCGGCGCATTGCTCTGCAATAGATTCCCCCATCAGTTTAGCACTCATACCCATGC 
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CCATTTGCTCATTAAAATAAATTAGAGTTGCCGTTTCTGCTTGTACACAGTGAAAAATGTGCCTCTGAAAGCTCATGCCAGTGAAAGGGAAACATCAGCAGATATATGATTA 

TTTTAAATTGTAATTTATTTCCCAACTATCTCATGTCATTGCTACTAGTTCACTAAATGGTGTGTGTCCTGCTTCACAGGGTATGTATGTAGATGCGTAACTCCAAAGAAAA 

CAAAGATAATCTTTACACTGCAACTGGATCTAACAATAGGAGTTTAACTACGGCATAATGCATATGACTCGCGCCGCCTACAAGTACTCGACGATATCGCATTTGATGGATA 

TATTCGGACAGCTCTGGCCCACGGCACTGCGGCCCCGATAGTTGGGACTCTTCTCGGTGAGGAGTTTCCGGAAGATGGGCAAAATTGCCGTCTCGCCCTCGAGCAGTTCCGT 

CTCGCTGCTGGCGCCGTAGATGCTGCTATCCTGCTTGCCGCGCTGCCTGCTCCGCTGCTTATCGGGCGCCACATTGGGCGAACTGTGGTGGTGGGCGGCGTGCTGGATGTTA 

GAGCTCCGGCTGACCTTACGCGGCTGGACGCACGGCTGACTGTTGGGCGGCGGATCTCGTTCCCGTTCGCTGTCCGTCTCGCTCTGGCAAACGGTGCCCTTTTGCGGTTTTG 

GCCGATGTTTGTTGCGGTTCTTCTCGGAGACGGACCGCTGGATGGGCGAGGTGGTGTTCGGGTCATAGGTCATCCCGTGGCCGCCATGATGTTCCGCACAGCGCTGCGGACT 

GGCGTAGTTCCGCGGCGACTGGCGCTGCTTGGCGTGCGACAGCGTGTGCGTGTCCCAGAGGTCACTCTGTGAAACCACGAAATTGGACTTCAGAGCGGAGCCCATGCTGGTG 

CGGCGATGGTCGGAGGCAGTGGGAGCTATTTTCGATTAGAAAATCGATAGTATTATAGTATTATAAGAGACGCGCAGAAAACTTACTCTTAATCACAGCCGGAGTACTGGGC 

TTTGAGTGTGGACACATGTTGCTCAACTCCGATTGCGATGTGGAGATGTTGGTGGAGGAGTTCCGGTGCGAGCTGCTGTACTTTGGCCGGGCGGTGGGCGTCTGGCTGCCAC 

TGTCGTAGTGCCTGGAGGCGGTCGATTGACGCCTCTCCCGCATGCTGGAGCAGTCTCCTCCGCTCCGCTTGCCCGCCGTACTGGTGATGCTGCCTATATTATCATCGGTCTG 

AGCGGACACCGAGTTTCTCTTCTCCAGATGGCGATGATGATGCTGCTGCTGCTGCTGTTGCTGTTGCATCTGCTGGTGTTGCTGCTGCTGCAGGTGCTGCTGCTGCATCTGC 

TGATGTTGCTGTAAGTGCTGCTGCAGTTGCTGCTGGATGGGCTGCATCATAGTTGGAGCAGGGGCATGCGAGTGGTGCTCTACAATCGATTGCTTTTTCTTTTTGCTGTTGG 

AGAATATCCATAGATTAGAGTGTATTTAAGACCAAAATTTAAGAACTTACGGAGGAAGCGATGGCGCCATCACTGTAGACTGCACCGACACAAAGGAGTTATCCATCTCCAC 

ATGAGTCGAGGTCACACTATCCACGGACGTCATCTCGGTAACTTCTATGACAAACGGGTTGCAATGCATGAACATATCTACGAAATCCCCCTCCTTAGCCACTCACCTGTGG 

TGCTGGGCATCTCTCGACGCACCGGTGGAGCTGAGAGCCTAACAATGTCATCGTTCAGTATGGGCTCCAGTTCGCTACCGACGTTGGAGCCACCTGTGCAGGTCACCCCATC 

GATCCATTTGCGTGCCTCGGGTCCCAGCTTGCGGATTAAGGCCTGCAGCTCGTTCTCCTTCTCCATCAGCGCCTTTCGGAATCGGCAGTTTGTGTTCTTTACATCTCGCAAT 

CGTTGCTCCAGTTCGCACACCGAGGAATCCCGGCGTCCGTTTTTGGACATGGGTCTCAACGTGGCCCTAACGCGTTTGTCCACCACGCCCTGGGGATTCCGCTTCAGCTCCA 

CCAACTGCAAAACGATTCGCATGAATATTAACCAAAGATTATACTACGCGGTCAACATCTAGGAAGGATCAACTCTACGGACAGTAATATAGGATAGTTAAAGATATTCCAA 

CAGCAAGGAAGTGGGATTTGCCTACAACCTTGCTGCTGCTCTTAGTGGGTTCTATATAGTTCAAGTTTATAGTTAGTCTGTAGTGTTATATAGCATTTTAAGATCTGTTTTC 

GAGTACATGTTGTTTCTGTATCAAGGTGTATCAACGCTAGACGAAAGATCAGCTCAGCTGGCGGTAAAGTACGATCAGTTAAGAAGCTCTAACCACAACTGATCGGCCTATT 

ACCGCCGACAAACAACATTCATATTACACGATTATTATATGTAACATATAACAGCTACAAAAACTACAAAACTTCCTTACTACCAGTCTTACTTTCGGTACGAACACCAAAC 

ACAAAGTGGCTGTCGTACAAAAAATGATAAAAAACGAGAGTAAGACAAAAACTAAATCCTTTCGATCCGATAGCACCAGGGATATAGCCGCTCCGGCCAGACAAGTGATGAA 

CACGTTATAAACGGAGAAACCAATATGCTTGGAATCGTTCAGAGCCGGTATAGAAACATGTCGAGTTTCCCAGGCCAAAAAGGCGCCAAAAACCTATTGAAAAAAATTATTG 

CTTTATTATAATGATATCTTACAAATTAATGTTTATTATATACTCACCAACAACAGTCCCTTGTAGGCATAAATAATGCTAACGAATATGGTCATGTGCTCAGACTGGCAGT 

ACTCGTTTTCGGGGATCACCAAGACATCATCAATATTCTCGTGATGCTGTAGTTAATAATAATCATTTTAGATTTTTCACTTCAAATATGTTCTATCAATGCACTCACCAAG 

GGTTCCAACTGTTTAGTTTCGCGGTAAAAGGGATCGGCAATCTGCCAGGTGGTTATAATGGCTATATCAATGGCCAAAAGCACGCCCACAACCATAAACAATTGATAGTCCT 

TGATCACCTTCTTATTGAGCTTCAGATCGGTGAATATCGAATGCACCCGCCACGTCTTCGAGAACATGGCTCCAAAACTGAGACTGAATCCAGCCATCAAGATCCAGGCTCG 

AGCTGTGCAGATATAGGGAAAAGCTGCCACACTACTTAATGTGGTATCGAGACCCAGGAAAATGATGCTCAAATAGGTCATCATACAGCCCACAATGATCAGATTGTTCAAA 

TGGGGACTGGACATCTTGATGTATCTATAAAAACATCAATATTTTATTAAACGAAATTGTTTAATATAGCATTGTTAAGCTCACCTTTGATTGCGATACTTAATGTTAAAGG 

CCAGAAAAACTGTGGCAATAATCACTCCAATGACCGAAGCACTAGCCGATACAATATATATGGTTGGATTGACCTGACTGTGCTCGATGTAGATCAAAGTGCGATCCTTGGG 

AGGAGTTTTCCCCACCCATTTGACTGGTTTTCCCAAGCTTAAATCCAAGTGTGACTTCTGTGAGTGGTATTCCCCGATCTTTTCCATTTGTCCCAGCTGAAACTGATTGATC 

AGGATGTTGGCCTTGCGCTCGTTGTTGTAGAAACGCACGGGTCCCTTGAGAATGTAAAATATGTTTTAACTGCAAAAACTGGATTGTTTATCTATCTACCGTCACACCCTCG 

AAGGATGTATTACGTAGAGCCTCAAGGAAGACACTCTCCCAGTCCTTCACGCGATAATCAAAATGTGTTAGCAGATCCTCTCGCTTTTCGGCCACATACTGAATGGCCAGGG 

CAGCTGCCCAGATACCATCGTAGGTATAGCCATGAAAGCGGGAATATTCAGTGCCTCGCAGTCTGTCGTACTCCACAAGATACTCATCAGCAGTCTAGTAAAATATGAAATT 

TATTACAAGAATTTTAATAGACTTTACTGGTACTAACAATGCCAGCCACTGTGATGTCACCACTGGTGGACAAGGGCAAAAGATCCACTAGAATGGCACCTTCCAAGGCTGT 

AGCGATCTCCTCCACACTGCACTCGCTGTCCTGCGTGACATTCCACCAATCCGTGGAATAGGTAGCCATGATCAGCCATTGATAGGCTCTGCCATACATATCCAATCTGGAA 

GGGAGTTTATTGCAATAATTTTAAATTGAGACTTAAGTTAGTATATCCAACTCACTTATAAGCCTCACAGAATGCCTTGCGTGCAAAGTGCTCGTTAAAGTTGCCCAGAATG 

ATCCTCACGTCCTTCTCGCGCAGTTTCTTCAATGATTCAGCCACATCGTTGACGAAGCTCTGCGTTTCCACCACCTCGACCTCCATGGCATCCAGGTCAGCCACCATGTGAT 

TGTGGGGCAGCGAATAGCGTGGCTCATTCTGGTAGACAGTGCCCACTCTGGTCCAATTGAACTCCTTCAGCAAGGCCAGTCGCGGCGCATTAAAGGCATTCTCCGAGGGTAC 

CACGCGAAAGAAATTCGGAAACGCATCCTTGGTGAACATGGGATGGGTGTCCGCGTAGCTGAGCTGGGTGAGGTGCCAGTGCTTGCTGGCCTTGGCTATGGGATCGGTCACA 

TGGGTGCACGCAGCGCCGAAGAGCATCACTTTATTGGGACCCGAATGCATCATATCGAAGAAGGACTTTACGCCCACAGCAGCATTGCACTGGAAAGGAAACCAAAATTACA 

TTCCAGAATATATGGAAATATGCATAGGTAAAGGTGTGATATTACAATATGAACCAATGAACATTAAAAAAAATACCTAATTGAACTTTAAAAACCACAAGTCCATCATCAG 

AGATTCACTTAATGAAATCTTCAAGCTAAACCCTAACTCACCTGAGTGTCGTTCCACCACATGTGCAGCCTGTAGTTGGCCAGTATCTTTCCATGCTCATTAACGTGACCCA 

AGGCGAGCTTTACACTGGGCATAACGCCCCGACCGGTGTAGGAGTTTTCCACGCCATCCCCGTACGGGAAGAATCCCGCTATGTAGACGTCCGATCTCTTGGCTGTCCTGCC 

ACAGGCGGTGCTCCAAAGGAGCAGGAACAGCAGGCTGGCGAATGGAAACCAACTTGGCCGGAACATGTTGACCGCGTGGGCTGTAAAACGATTCTAAATGTGAATAGCGCAT 

ATCGAGGACGTGAACTTGGAGCCAAGTTGTGGGTGGTGAATGGCAGTGGTTGGTTTCGGGTGAGCCAAGGCTGCCGAGGCAAGGCTGCGCCCAGTGGGTGTGTATTATTTTC 

1ACGTTCTACCTGTGTTTGCTGGCACTCGCATGCAATTTTACAGGACAGGTGGAGCAAGGCTAAGGCCAGTCCAGGCCAGCAAAATTGCTTCAAATTGCTGCAGCAATGCTG 

GCACGTTTAGTAATTATAAAGCAATTTAATGATCTAAAGGATTCGTGCATTATTTAGTATGCTCGTTAACGCGAAGGAGTATCTTCCAGGTGTTGCATCCTCTTAAATCTTA 

TGTGTACATGTTTTTAGGAACCTACCAGCTTGCTTAGTCCCAGAACTGGCTTTTATTTCACTACCGATGCCATTTGAAACACTTATTTTACATCAAATCTCAGCCAACGAAG 

AACTTTTGCAGTGCGAGGGTCCGAAAAATTATGTATCCCCCTGACTTGACATTTGTCTGACAGGAGGAATTCGAGGCAGGAAGTGCGCATGTCCTTAAACCGAAAGCTGTCA 

CTGCTCCGCCTTGGCCTAGGCTTTCCAACCCCCGTAATTACCATAAACCCACTGCGCTTGTAAACATCCTGCAACAGGTGGTCCCATCAGCTGGTACTCCGGCAATTGTTGT 

GCCCGTTGCCATGGACTACTGGGGTCGAAAAACGAAGTAGATAACTGCTCGGAAAATTACCAAATTGAAAATTGTGAGTAACCATAATGGAGGACTATGCGCCCAGCCGCTA 

TAAAATGCTTGAGAAAATCGGCGAAGGCGTCCATGGCTGCGTGTTTAAGGCCATCGATCTGCAGCGCAACAAGGAGGTGGCCATCAAGAAGGTGGCCCTTAAGAACAAGTTC 

GGAAACATAGCCCTAAATACTTTGAGGGAAATCAAGACCCTGCAGCTTTGCAAGTCTGAATATGTAGGTATACCATATATCTATTTTCCATTTGTTAATTATGTAACTAATT 

GTAGATTCTGGACATCATTGATATATATCCTGACCTAACTGGTCTCTCCTTGGTGCTGGAATACCAGCCGGACACTCTGTACAACCGCCTCAAGAGCGAGGTTAATCCCCTG 

AGCCGCCAGCAGGTGCGTAAGTTTGCACACCAAATGTTCAAGGGGATTGCATATCTGCATGAAGCGGGTCTCATGCACCGGGATATTAAACCCGCCAATCTGCTTATCAGCG 

ATACAGATATGCTTAAGATTGCCGATTTCGGACTGGCCCGTCTGTACTTCCCTGAAGACGAGTCGCGCCTTTATTCGCCTCAGGTCTCAACCCGGTGGTATCGAGCGCCAGA 

GATTCTATTTGGCAGCCAGAAGTATGGAACCGGAGTGGATATGTGGGCGGCAGGTTAGCCATGACTCTTTGATGGGTTTACATATATCCTTATGGCTGACATTATTGCTAT 

(SEQ ID NO: 58) 
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AAAAGTTCTTCGTTGGCTGAGATTTGATGTAAAATAAGTGTTTCAAATGGCATCGGTAGTGAMTAflAAGCCAGTTCTGGGACTAAGCAAGCTGCCCACGCGGTCAACATGT 

TCCGGCCAAGTTGGTTTCCATTCGCCAGCCTGCTGTTCCTGCTCCTTTGGAGCACCGCCTGTGGCAGGACAGCCAAGAGATCGGACGTCTACATAGCGGGATTCTTCCCGTA 

CGGGGATGGCGTGGAAAACTCCTACACCGGTCGGGGCGTTATGCCCAGTGTAAAGCTCGCCTTGGGTCACGTTAATGAGCATGGAAAGATACTGGCCAACTACAGGCTGCAC 

ATGTGGTGGAACGACACTCAGTGCAATGCTGCTGTGGGCGTAAAGTCCTTCTTCGATATGATGCATTCGGGTCCCAATAAAGTGATGCTCTTCGGCGCTGCGTGCACCCATG 

TGACCGATCCCATAGCCAAGGCCAGCAAGCACTGGCACCTCACCCAGCTCAGCTACGCGGACACCCATCCCATGTTCACCAAGGATGCGTTTCCGAATTTCTTTCGCGTGGT 

ACCCTCGGAGAATGCCTTTAATGCGCCGCGACTGGCCTTGCTGAAGGAGTTCAATTGGACCAGAGTGGGCACTGTCTACCAGAATGAGCCACGCTATTCGCTGCCCCACAAT 

CACATGGTGGCTGACCTGGATGCCATGGAGGTCGAGGTGGTGGAAACGCAGAGCTTCGTCAACGATGTGGCTGAATCATTGAAGAAACTGCGCGAGAAGGACGTGAGGATCA 

TTCTGGGCAACTTTAACGAGCACTTTGCACGCAAGGCATTCTGTGAGGCTTATAAATTGGATATGTATGGCAGAGCCTATCAATGGCTGATCATGGCTACCTATTCCACGGA 

TTGGTGGAATGTCACGCAGGACAGCGAGTGCAGTGTGGAGGAGATCGCTACAGCCTTGGAAGGTGCCATTCTAGTGGATCTTTTGCCCTTGTCCACCAGTGGTGACATCACA 

GTGGCTGGCATTACTGCTGATGAGTATCTTGTGGAGTACGACAGACTGCGAGGCACTGAATATTCCCGCTTTCATGGCTATACCTACGATGGTATCTGGGCAGCTGCCCTGG 

CCATTCAGTATGTGGCCGAAAAGCGAGAGGATCTGCTAACACATTTTGATTATCGCGTGAAGGACTGGGAGAGTGTCTTCCTTGAGGCTCTACGTAATACATCCTTCGAGGG 

TGTGACGGGACCCGTGCGTTTCTACAACAACGAGCGCAAGGCCAACATCCTGATCAATCAGTTTCAGCTGGGACAAATGGAAAAGATCGGGGAATACCACTCACAGAAGTCA 

CACTTGGATTTAAGCTTGGGAAAACCAGTCAAATGGGTGGGGAAAACTCCTCCCAAGGATCGCACTTTGATCTACATCGAGCACAGTCAGGTCAATCCAACCATATATATTG 

TATCGGCTAGTGCTTCGGTCATTGGAGTGATTATTGCCACAGTTTTTCTGGCCTTTAACATTAAGTATCGCAATCAAAGATACATCAAGATGTCCAGTCCCCATTTGAACAA 

TCTGATCATTGTGGGCTGTATGATGACCTATTTGAGCATCATTTTCCTGGGTCTCGATACCACATTAAGTAGTGTGGCAGCTTTTCCCTATATCTGCACAGCTCGAGCCTGG 

ATCTTGATGGCTGGATTCAGTCTCAGTTTTGGAGCCATGTTCTCGAAGACGTGGCGGGTGCATTCGATATTCACCGATCTGAAGCTCAATAAGAAGGTGATCAAGGACTATC 

AATTGTTTATGGTTGTGGGCGTGCTTTTGGCCATTGATATAGCCATTATAACCACCTGGCAGATTGCCGATCCCTTTTACCGCGAAACTAAACAGTTGGAACCCTTGCATCA 

CGAGAATATTGATGATGTCTTGGTGATCCCCGAAAACGAGTACTGCCAGTCTGAGCACATGACCATATTCGTTAGCATTATTTATGCCTACAAGGGACTGTTGTTGGTTTTT 

GGCGCCTTTTTGGCCTGGGAAACTCGACATGTTTCTATACCGGCTCTGAACGATTCCAAGCATATTGGTTTCTCCGTTTATAACGTGTTCATCACTTGTCTGGCCGGAGCGG 

CTATATCCCTGGTGCTATCGGATCGAAAGGATTTAGTTTTTGTCTTACTCTCGTTTTTTATCATTTTTTGTACGACAGCCACTTTGTGTTTGGTGTTCGTACCGAAATTGGT 

GGAGCTGAAGCGGAATCCCCAGGGCGTGGTGGACAAACGCGTTAGGGCCACGTTGAGACCCATGTCCAAAAACGGACGCCGGGATTCCTCGGTGTGCGAACTGGAGCAACGA 

TTGCGAGATGTAAAGAACACAAACTGCCGATTCCGAAAGGCGCTGATGGAGAAGGAGAACGAGCTGCAGGCCTTAATCCGCAAGCTGGGACCCGAGGCACGCAAATGGATCG 

ATGGGGTGACCTGCACAGGTGGCTCCAACGTCGGTAGCGAACTGGAGCCCATACTGAACGATGACATTGTTAGGCTCTCAGCTCCACCGGTGCGTCGAGAGATGCCCAGCAC 

CACAGAAGTTACCGAGATGACGTCCGTGGATAGTGTGACCTCGACTCATGTGGAGATGGATAACTCCTTTGTGTCGGTGCAGTCTACAGTGATGGCGCCATCGCTTCCTCCC 

AAAAAGAAAAAGCAATCGATTGTAGAGCACCACTCGCATGCCCCTGCTCCAACTATGATGCAGCCCATCCAGCAGCAACTGCAGCAGCACTTACAGCAACATCAGCAGATGC 

AGCAGCAGCACCTGCAGCAGCAGCAACACCAGCAGATGCAACAGCAACAGCAGCAGCAGCAGCATCATCATCGCCATCTGGAGAAGAGAAACTCGGTGTCCGCTCAGACCGA 

TGATAATATAGGCAGCATCACCAGTACGGCGGGCAAGCGGAGCGGAGGAGACTGCTCCAGCATGCGGGAGAGGCGTCAATCGACCGCCTCCAGGCACTACGACAGTGGCAGC 

CAGACGCCCACCGCCCGGCCAAAGTACAGCAGCTCGCACCGGAACTCCTCCACCAACATCTCCACATCGCAATCGGAGTTGAGCAACATGTGTCCACACTCAAAGCCCAGTA 

CTCCGGCTGTGATTAAGACTCCCACTGCCTCCGACCATCGCCGCACCAGCATGGGCTCCGCTCTGAAGTCCAATTTCGTGGTTTCACAGAGTGACCTCTGGGACACGCACAC 

GCTGTCGCACGCCAAGCAGCGCCAGTCGCCGCGGAACTACGCCAGTCCGCAGCGCTGTGCGGAACATCATGGCGGCCACGGGATGACCTATGACCCGAACACCACCTCGCCC 

ATCCAGCGGTCCGTCTCCGAGAAGAACCGCAACAAACATCGGCCAAAACCGCAAAAGGGCACCGTTTGCCAGAGCGAGACGGACAGCGAACGGGAACGAGATCCGCCGCCCA 

ACAGTCAGCCGTGCGTCCAGCCGCGTAAGGTCAGCCGGAGCTCTAACATCCAGCACGCCGCCCACCACCACAGTTCGCCCAATGTGGCGCCCGATAAGCAGCGGAGCAGGCA 

GCGCGGCAAGCAGGATAGCAGCATCTACGGCGCCAGCAGCGAGACGGAACTGCTCGAGGGCGAGACGGCAATTTTGCCCATCTTCCGGAAACTCCTCACCGAGAAGAGTCCC 

AACTATCGGGGCCGCAGTGCCGTGGGCCAGAGCTGTCCGAATATATCCATCAAATGCGATATCGTCGAGTACTTGTAGGCGGCGCGAGTCATATGCATTATGCCGTAGTTAA 

ACTCCTATTGTTAGATCCAGTTGCAGTGTAAAGATTATCTTTGTTTTCTTTGGAGTTACGCATCTACATACATACCCTGTGAAGCAGGACACACACCATTTAGTGAACTAGT 

AGCAATGACATGAGATAGTTGGGAAATAAATTACAATTTAAAATAATCATA 

(SEQ ID NO: 59) 

Start ATG : 109 (Reverse strand: CAT) 

MFRPSWFPFASLLFLLLWSTACGRTAKRSDVYIAGFFPYGDGVENSYTGRGVMPSVKLALGHVNEHGKILANYRLHMWWNDTQCNAAVGVKSFFDMMHSGPNKVMLFGAACT 

HVTDPIAKASKHWHLTQLSYADTHPMFTKDAFPNFFRWPSENAFNAPRLALLKEFNWTRVGTVYQNEPRYSLPHNHMVADLDAMEVEWETQSFVNDVAESLKKLREKDVR 

IILGNFNEHFARKAFCEAYKLDMYGRAYQWLIMATYSTDWWNVTQDSECSVEEIATALEGAILVDLLPLSTSGDITVAGITADEYLVEYDRLRGTEYSRFHGYTYDGIWAAA 

LAIQYVAEKREDLLTHFDYRVKDWESVFLEALRNTSFEGVTGPVRFYNNERKANILINQFQLGQMEKIGEYHSQKSHLDLSLGKPVKWVGKTPPKDRTLI YIEHSQVNPTIY 

IVSASASVIGVIIATVFLAFNIKYRNQRYIKMSSPHLNNLIIVGCMMTYLSIIFLGLDTTLSSVAAFPYICTARAWILMAGFSLSFGAMFSKTWRVHSIFTDLKLNKKVIKD 

YQLFMVVGVLLAIDIAIITTWQIADPFYRETKQLEPLHHENIDDVLVIPEHEYCQSEHMTIFVSII YAYKGLLLVFGAFLAWETRHVSIPALNDSKHIGFSVYNVFITCLAG 

AAISLVLSDRKDLVFVLLSFFIIFCTTATLCLVFVPKLVELKRNPQGVVDKRVRATLRPMSKNGRRDSSVCELEQRLRDVKNTNCRFRKALMEKENELQALIRKLGPEARKW 

IDGVTCTGGSNVGSELEPILNDDIVRLSAPPVRREMPSTTEVTEMTSVDSVTSTHVEMDNSFVSVQSTVMAPSLPPKKKKQSIVEHHSHAPAPTMMQPIQQQLQQHLQQHQQ 

MOQQHLQQQQHQQMQQQQQQQQHHHRHLEKRNSVSAQTDDNIGSITSTAGKRSGGDCSSMRERRQSTASRHYDSGSQTPTARPKYSSSHRNSSTNISTSQSELSNMCPHSKP 

STPAVIKTPTASDHRRTSMGSALKSNFVVSQSDLWDTHTLSHAKQRQSPRNYASPQRCAEHHGGHGMTYDPNTTSPIQRSVSEKNRNKHRPKPOKGTVCQSETDSERERDPP 

PNSQPCVQPRKVSRSSNIQHAAHHHSSPNVAPDKQRSRQRGKQDSSI YGASSETELLEGETAILPI FRKLLTEKSPNYRGRSAVGQSCPNISIKCDIVEYL* 

(SEQ ID NO: 60) 

Name: GABA-B receptor like 
Classification: G_protein_linked_receptor 
Gene Symbol: BcDNA :GH07312 
FlyBase ID: FBgn0027575 



Celera Sequence No. : 142000013383823 

GTTCCTTAAACGTGATCAAAAATATAACTAGAGCTACTACTAAGCAGCGAGCACAGCGTGCGATATATCAACAAGTCCAGCTGGCAACATGTTGCCGCGCCTGTGGTGAGTT 
GTGCCTCTATGTCCTTTGGCAGCATGTTGCGGTCGCTTGGCGCCGGCTGGTGGTCACAAAAAGCGCCGACGCTTGCCGTCAGTGCTTCGCGAAGATTGCGAGGCGCGACACA 
CATCGCGGAACCCGAACTCCATACGGCTGTGCCGACACTCAATCCCGACCTAAGCCCAATTTTTTGTTGCGTGAAGTTGACGAAGTGTGTTGTACTGCTGCAAGTGCAAGTC 
CCCGATTGCTAGTGCTATTCCGGGATCACAAACGAGCATCCTTCTTCGGCCTCACGATTGACGCCTTCTACCACTACCTGCGCCAGGCGCTGCCGCTGGCCAAGGAGGCTGC 
GATCCACCTGAATGCCAGCAACGAGATCTCCGCTGTCGGTGACGGGGTCACCATCACAGGGACACCCGGTGATCTTCTCAACTACAGCGGGCTGGAATTGGATCTCGGACTG 
GATTTGGATCTTAATCTGGACATGGACCTAGCCACTACTCCCAGCAGCAGCACCTTGGCGCCAGCTGTAACCGTCAGGACACCGGGGAACAGGAGCGTGGTGAGGGTCAGTG 
CCGATGTGCCCATCTGGGTGGTGCCGTGCTATTCGGCCATCCTTCTGTGCGCCGTGGTCGGGAACCTGCTGGTGGTGCTAACGCTCGTCCAGAATCGTCGCATGCGCACCAT 
CACGAACGTTTTTCTGCTCAACCTGGCCATCTCGGACATCCTGCTCGGCGTATTCTGCATGCCGGTGACCCTGGTCGGCACTCTGCTAAGGCATTTCATATTCGGCGAGCTG 
CTCTGCAAGCTCATCCAGTTTGCTCAGGGTGAGTGGTGGAAGCCAGTAGGCCAGTTCATTTATGTCTTGTCCGTGTCCAAATTAAAAGGTTATCCCAAACTTCCAACTGTGA 
ATTCCGCATTTATTAGTCCCTATCCCCGTGACCGAGCCCCATAATCGCTTGTCAAATGGCCACGAACCAGCACGACCTCGAGCGGATTCGCCAGCAGATCGTGTTGGCCAAC 
ATTCAGGAGCTGATAAAGAAGATGACACGTCGCTGCTTCGACGTATGTATCGCTATGCCGGAAATGGAGTTGCGCTCCACGGAGCGCGACTGCCTGGCCAACTGTATGGATC 
GATTCATGGACTCGGTTCAGGTGGTGTCGAGCCAGTACTTCCGTCGCCGGCGTCGCCATCAGCAAATTCGTTTGTCCCGTTCGACCGCCTCCTCCGCATCACCACCAGCATC 
CGCATCCGCATCCATGCCCAAATCCGCAGCAGCAAATGAATCTGAATCCGCATCTAGGGCCTCCAATGACGAAAAGGTCAAATAATGGTTAAGTCTAGACAGATAGCCACGT 
GTGGAGTTTATTCCAAGTCGAAAACGAAGGCTGTTCCGTCGTAGCCCCAGATCCCCAGTGTACTTGGGTGATGGCCACTTGTCCACATTGGTCATCATCCCAAAGCTTTCGA 
GGATGATTTCATACTTCGAGACCTTTGGGAAGGGCAAGAAGGGGTCGTCTAGCTGGAAATGAAAGTGGATTTTCTGCAGGCGTGGCCATCGATGATTTCTTGCTATTGGTTC 
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TTAACAAAATCCTAGTAGAGTCTTAAGTftAATGACGATTGGCAGRAAATAAflGATGTCGTTCAGTGTACATTCACTATGCTTTGAGATTCTCTTTAAAGTCflCCCTTTTGAA 
ATCACTTAGCTAAGCTTAATATTGTATAATATACAAAGTAGTGTAGTGGGGAGAAGATAAGTTTTAAATGACCATACTGTATATTTTAGATTGTGCGCTTGTGCACATCACT 
TTTCCACTTAGGAAAACTGAGTCTGCCATCGATTTTACGCACAATTTCCGCGCTTATTTATGGCCTAAATCATATTTCGTTGGCAAGAACGCACTATATCTCGTCTCCCAAT 
CTGCATCCGCCTGAGAATGGCCGTATTTTATGATGGCCGCCCGCATTTGCGTAACGACAGTCAGGGTTTCCCCGACCAGCGAGCACCGGGTTTTGGGTTCCTCAAGGGTTCA 
AAAGATTAAATAGGCCCACGCACACGACACACACACGCACACACACGCACTCGCACACTGTAAATATGCGGGGGAAGCTCTTTAAAACCATTTCAACTCAATGACTGTGCCC 
AAAACTCATGTTTGGTCCTTGAAAAGTCCAGTCAGATGGGTAAACATTCTCACCAGGCATTAATTTGTGTCTCCAGCTCAGATAAGGGTTGGTGATGTGGGTGGCGATATTC 
ATCAGCTGAGCCCGGTTCAGCCATCTGTGATATGGTTCGCCTTCGGCGTGATTCCATGTCCCGTCGGCGGCAGTGGAACAGAATGAAGCAGAAATGTGGCCCTGACACCTGT 
CACAAATGCTAATCGGCATTTCGGTTTCAAGGAGCCGAGCGGCATTAATTTGCTTATTTAATCAAAGTGCATGCCATCGGACATTCTAATTATTTTCATAACGGTAAGGAGA 
GGTCCTCAACCTGTTTCACCTCCTAGGACTCCTTGGAGGCGCATCCAAACGGATGATGAAGCTCAGTCCAGGCCAGTCATCCTCCATTCCCGCAAGATTAATCCAGGCTGAT 
CCCCATTTCCAATGCAAATTGCTCGCCTGCCGGGAGCTCTGCTTCTGCTCAGCATTCTGGTCAATTTCCATATGTACGCAGTGTCGATCTGGGAGCCTACTCACCCTTCTCA 
CCCAGCTCCTGTTACCCTGTTCCTTGCTCCTTTGGCGTTGTAGCACTCAAGTCGCAGCCTAAGCAGTCTGGAATTGCTTTTAACCGTGGCAATATTGCGAAAATCACTCTGC 
TATGCAGCGTCACGTCGCGATTTTCTTGTAGTGTTCCGTTCCGAAGCCCCTTTCCCCCATCAGCTCCATCAGCTCCGCTAGCTGCTCCACATCCTTTACGTCCCCTCGCCGA 
TCCTTCTGACACCCTTCCTTGCATTTTCATGACTTGTTTCCTGTTTCCTGCGTTTCCGTTTCAGACGTGCTGGTGATGTTGTTTGTCTTGCTGCAAGTTTGTTTTCGGTACT 
GCCGGAACCGCTGTTCTCTCCTGGCCTCCAACACCACCCCCCCCCCTATCCACTGCCCCCGGCATCCTGACGTATCCTGGCGTACCCTGGCATATCCATCTTGGCCGCCCCC 
GCACTAACTTATTAACAACATAATAACAATGGCTGGCAAACAAGTTGAGCGCTCAGGCACACACATGCCAATTTGTTCAGCAAGCACTCGGCGCTGTCTCGAGCCCCAGAAA 
GGCCCAGAAAAGGAGCTGGCATTCAGTTTCCAAGGATTCCGCGCCCCCATCCTCATCCGCATCCGCATCCTTGTTCGCATTCCCATTCCTTTCCATCTGTTGCACGGGGCGT 
CGACTCGTGGGTCGTAAATCTGAAAATGTCATCCCTGGCTGGGCTGAGGCACCAATTTATGGCCATCATCACGTCAAGGTGCAGCGATGTCCTGACACGTCCTTCCTGGCTC 
CCTGTCCCACCTGGCTCCTTCCGGAGTGGGCGTGAACGCGCCCCTGGATTTATGATCGTAATTATGCATTTGTGGCCAGTGGCATTTGCATTGGACTCCTGCCAGTGGCAAC 
CAATCGCCACCCACTGCCATTCGATTTCATCATCTTGCTTGCAAGAGTAATGTTGGCATTGTCCATTACTTATGTAAATCGAAACCGAAGGATAAGCTCCTGTAAGTCCATA 
TTGGTTATAGTTTCTGCATATTCGATCTACCAAGAATTTCAACGAAATTCACAAGCGAGGCAGCAATAACAAGCTCAAAGTTAGTCGGATTATGGGGAGCTACTAAAAATAC 
ATAGTTGTTGACTTTTGGCTCAGAATTAGAGGTAATAAAACACAAACCGATGGTCTTTTTTTATGGTCCTTTTCTTTTCGCAAGAAATTCCTCCTCGAAAACGAACATCTTA 
GCAACTAATATTAGCAACTTTGTATGTTGCCCATACCAAAAAGAGTGTGAAAGGAATGTTTTCTGTTTGATGAACACTTTAAGGCTAGGACATAATCTCAGATAAACTGTTG 
TCTAATTAACTTAATTTGATATCCAATTAAAGTAACTATGTGTGCTGGTAAGACAAAGAAAAGGTTTCTTTGTGGCGCTTGAAAAGGACATTTGTGTGCGGAAAAGGGTCAA 
ATTGACATCCTTAGACCAAAGTTTATGGCTTTGGGCTTTAGAATTTTCTCTATTCCACTGCTCGACCTGGTTGCACTTTTTGTACCCCAGGCGGGACGGCCTTGAACCCTGA 
CCTGTGGATGGGCCACCCATTGGGCGCGTCCAACTCGTCCAAGTCGCTGACAGCTCGGCAAATTAATGATTAATTGAATTTTCCTTGGCTCTGTTTTGGGTTTTTTTTTTTT 
TGTTGTCCATTCTATTATTTCTTGTTTTCTTGGCAACACAAATACAAATCAGCGCCACACAGACGTAATCATTAATTAGGCGCGCAGGGAGCTAAGTGTCTCAAGAAATGTG 
GGCGCAGTGCAATTCCAGCTAGTTTTTCAAGGGCCAAAGGCACTCGTAATGAGCAAATTTATGCACAAACACACAGTTTCTACAGTTTCTGATGGAGCTTGGCAAACATTAA 
GGGGCCAAATGGCATAAGTAGAAAAACGAATTGAAATAAGAACTAACCACGATGCTCTCGTTTTTCTTATTTCCTATCCCCCCATGTGGGCGGTACATGTTATTGACGCCCA 
AAGTGCCGCCCCCATAGACCCATGCCACCGTGCCACGCGCCCACTCTTTTTCTTTTGCTGGCTCCAAAGTTTGAACGGCATCTGCAGAGAAACGGGCTTATTTGACTGAGAA 
GGAGGTAGGAGGTCGGCGAAGGACGCATTTATTTGAACTTATGTACACAAGTGCACATATATACATATATATATATACATACATACTGCGCTGTGTATGCCTGCCATATTCA 
TGTTCATGTTGTTTTCGCCTTTTAATGAGAGTTAACTGAGCGGGGGAACTGGAAACTCCGCGGCCAACGATGAATATGAATGCGCCACTCAACTTAATCCCACTCAATCATT 
ATGTGGCAATCCGCCCAGTTCTTTTTGACCATGGGTCTGCAATTGTCCTCTCGGCTCGTACTCGTACACAAGCCCCAACATTATTTGCACCGAACGTATCTCTGTACCGATC 
TGAATTCGTATTGCATGTCGAACTCGACACTTTATGGTGCTCCGAAATTGTTTTGTAAGCGAAATAATGGCAAGTTATGACACCCAGTGTTACACACAATTGACATGGTTAT 
TGATTTTCAATTATATAAATATTTGCAACTAAGGAAAAAAGCCTCTGCTGCCACTGCTGCCACTGATGTCCTGTGGAAAAAGCATTTAAATACCGTGTTCATCGGCAATCTG 
TGTGCGCTTCTGTGTATTTTCAATTTGCATACACCTGCCCATGAAGAGGCGAAAGTGATGTGTTAAATGAAAATATGCTCCTAATAAACGTTTTTCATCGGTTTCCCGTATA 
ATTGTTTTCTGGCTTTCAACACTACACTGCACGAAATTTAAGGCTAACGCGCGAGTGGGGAGCGTGTGGCATGAAATCAATTTCATTCGTTGGTTTTAAAGCCCCGTGTCCC 
ATTGATTTCGAGGCAAAAGCTTATATTTGTCAAGATCATAATCAAAGCTGTCGGCCATTAAATAAAATGCTCAACGAGAGAAAAGAAACCCATTAAAAACTGAGCGAATTAA 
AGCGATATAAAGCACCATTTTTTGCGTGTGAGTATTATTTGCGTGGCGAAATGAAATATACTCCTGAGCTGAATTAATGAGACCCGGTAGGCGGGTTGAAGTGGGTGGGCTT 
TGCGGACAGGCATCTTAAGGGCGCAAGGACATCGGGAACATGGAACACGGAACACGGAACACGGAGCATGGAGCATGGGCACAAAGGGAATACAAATGAAAGGTGATTTTTC 
AGCTCCCTTTGGCCTTTTGAGGATTTCCATGGGGAAACCACAAAGTGCGGT TGACAGGGCC'f AATGTGAAATTAACATGGCTTATATGCAACGCATACTGCCCTGTTGCATT 
CATGTCAGCCACACACACACACACACACATCCATGCACCCAAACACCGACAGCTCAACCGTCCATCTTTGTGTAGCGGGTCAAGTTGCCATCAAGAACACGGTTGTGCCACT 
GCCTCTGTTCATCCTGTTTAACAGCCCGGTATTGGATCCCGAATACCAGCCACCACCCTCCGCTCCCTACAACTTGCCGCAAATTTCCTTGGACCCAATTTATTTTGCAATT 
TATCGCTGGATAAAAAGTGACTGCGAAAAGCATTTCAAAGCGGAGGCGCAGGACATTGAAAAAACAGTGCCATGACAACAATTACGGCGTGTTTAAGTAGCCCGTCTGGGTG 
AAAGTCACGCCGTGAACTGCCACTTTCTCCCCTTCGTTGGATGGCATAAAAGATAGATTGTTTATAGTTGGTATCAAAAGGAAAAAGGCCTGGCCAATGTCGCAGATTGAAT 
TGAGTGCTCTGGAGGGAATCGGGGTCACCAAATGCTGGCCTCAGAGTCCAGAACGCCAGCTCTAAGACTTGTATTATTATGGTATTAAATGGTCAGGGACACCAGAGTACAA 
CTTTAAAGTATCTTGGTATAAAGTAAAGTATGCAGTTTTAATCCTAAACTTGGCTCAAATGACCTTATATTCCACCATTTTTAAGTAACAGTCCAGTAGATTTCAATGCATC 
ACAATTAGAAATGAATTCAGCCCTGAAGGCTGAAAAATAAACCCAATTAATAATTTAAAGCTTCGGCTGAACGATTAGCATGCATTATTAACACTCACAAATATATGCCATA 
ATATGTACTTTTATGTAGAGCACTCTCAAATTGTGTCACATTCAGCACATTCCTTTCAGTTCGTTTAATTAAAGTGTCCAATAAATTAACGCTCGACAAGAAGGACAAACTT 
AAATGCACTTCAAATGGCGCAGAACCCATCAAGACTTTCCATCAATTTGCAAAAGTGTTGGCCAAAAGTGAGGCTCCCCACCTGGGAACTTTGTTTCCGTGAACTTTGCACC 
GGCACGCGGTGAACGCGAAATGCATTATTCACTTTTATGTCTTTTTCCTCAACTACTGTCCGGGACAAAAAAATCAGTTTGCAACTGAAGAATCTTACCTCGACTCGAACTC 
CAATTGCGATGTCCTTGTTGCCCAGCTCTCAGCTCCTTGATGATGCAGCGCAGCGGCAGTTGGGCTCTCCGTAAATCGAGCGACATAAAGCGGTCACCCAACATTGGCCATT 
CGATGGGAGTGTGACAAAGCTCCCAGCTAAACTTCCCCGGCCATGTCATGCATGCGAAATGGAGTTCTCGAGAAGCAGCTAGTGAATCTTTAATGGATTCATCGATTGCGAA 
GCGAAGGAAGCTGCAAAACTTTCGGTTTTTAAGCCACGCTCGGTTTGGTAGTTGGACACTCGATAAATTTACCCAATCACCAGAAAAATATGCAACAGCTTTTGGGCTTCAT 
TCTACTCTAGTTTTCCAATTAAAATGCAAAGTTCTAGAGTTTCCAAAACTCAATGCTCCTGGACCCCCATCCTCGCCGGCGTCGTCATCCTTGGGAATGGAGATCTCACTCG 
ATTTGGCCATAGAGTGTGTAGGACATTTTAATTTGTTGTGTTTTACAATTTGCCCCCATACAAATTCATGTCGATTGAGGCTTTTTCGCCTGAGTCCGCTCAGTTTATGGCA 
CTTTTATAACTCACATGTATATATGTATATATGTATGTACAGACAGGCGAGGACCCCCGGGGCGGTAAAAAGTTGCTCTTTTTTTTCGATGGCCTTAACTTGGCGTATCAAT 
ATGTGGGGGGGAAAACAATGGGAAGCAATTTTTTCAAATAATTTCACAGGCGAAATCAAAAGGAAGCAGCCCGGGAAGAACAAGAACGCGCGGCACATCGACCATCTGATAC 
CTCTTGTCTATGCTGGTGCGGTGTCCATTGTGGGCGCCACAAGTGGAGGGTAGTGATCTGTTGATACCATCGACATTCATCCGAGAAAAAAGCGTATCCTTGTGCCTTTTCA 
ATCTGGCCATCTTGCCATCCGGCCATCCAAGCATCGGCTGCCATCGAGCGTCGAAAAGGCCAAACTTTTGCACAACCGTTTTTTATTGTTGACTCAAATTTGGAATTATTTT 
TATTTGAGCACCGTGGAGGGACACACGCACTGAGTCCATAAAAATCCACTTTCAGTTATGTGGCAGTGGGTGTAAGTGGAAATGCCGAAAGATGGACTTATTGTCATTGAGT 
ATCATCGCATCGAATGGGGCGCTAAATCAAAAACAACGACCAGAGGGCAATCCGGAATTGGATTGGTTTCGAAATCTGGAATTCGCGGTGGGGGGAAATTGAAAATGGGCCA 
TATTTTGGCAATCGATTTGCATGCGCCGCTTAAAGTCATTTCCAATTCGATAATCATAATTTATGAAGCTGTTTTGTATTGCAATTTTCGCATATAAATTACATTAAATATT 
TACGTCAGCCGTGAAAAATACATAAATAGTGCGTCTCTGGTGTGTCCGCCGATTCCGCGCGAGTCCTTTGTATCCGCATATCCTGTTCAAACAATACTGCGCTAGAGTCGAA 
ATCCAAATATTTGCGCAGCAAATGAAAAGCATTTAAATCCGGTCGCAGAGGTTAACCGTAGTAGTTTTTAAGCCGCCTGGCTGCCGCAACCGGCAAACTTTTCGATCCACTC 
TCGCGACTAGCGAGCAATTAGCTGCGGTCGCATTTCATTGTGCTACCCGCTATTTGCACTAAACTTTACACTCATTAAACAGGCGAGAAATTATTGCAGCGAAGTGTCCGGG 
ATTAAAGTGTTGCATCCCCATCCCGCTGCCAGTCGGATGATGATGAGGCATGATGGGTTTCTGCCAGGTGGTCGCACTTTTAATTGGTATTCAACTTTCTCCTTTGACCCTC 
ACAACTCGAAACATGCTCGTAATTTACCAACCAAAATAAGCGAGTTTCCCAGTTCCGTTCCAACTTCCTTGGTCACCTCAAAAGCTGAAGGCGAGAAGCCTGTTCGCTAAGC 
TAAGCCGCACTCAAAGGTTGGCTCTGTGTTTGCCTTACCTTTGCAGTTGAGATGAAGATAAACAGAAATAAATGAAATGTTACATTTGTGTACCTTGAAAATATTCAGAAGC 
CATGTTTATAAATGAAATTATACATTTATACTATATTTTAATGCAGTGCCTGGGAAAACTCTAAGACATTGGTGTTGCATTAAAAATAAAGTGCTTAAAAGTATGCAACACA 
GAATTGATAGTACTGCATTAGCATGTTGATTCATCTGAATATTGAGTTACATACCCCACATGTTGTACTTATCAGATAATTATTTGATTTACATGCTTAAATCATATCAGTG 
CTAGCTTACCCAGAAGCCCTAAATTACTTCTATCCCAAGACATTCTCTCAGTGTACATGCGGTGGGTATTCCCAAAAGTTTAGGAAAATTGCCCGTGGCATTTGGCTTGCCA 
GAAACGGATGCCTGTCGAAGTGCAGCTAGTTCTTTTGGGCAAAGGCCAGTTATACTTTTGCGGTTGTTAGCTTAATTTAATGTAACTTTCGACATGGTCAGATGGAGTCGAA 
AGGTGAGCGGTGTATCTGGAGAAGATCGAAAAGCAAATAACCAGCTGGAAAATGTTTGGCCACAACCCTCTGAGAATCCACTTGGCCTTACGAGTGGATGGATGGTTGATGG 
GTGGAAACCGGAGCACACGCACAGCACAATTAGCAGTTATGAAAAGTTGAAAGTTCATTTTGTGCAGCAACAAAACGAAAATGTGATGAAAAGTTCGCACTGACCTTTTGGA 
ATTTTTCATTGTGAACGAAGGAGCTGGCCGAAAATGATTTTGCCAAAAAGCCGTGGGCGACAATAAACAGATGAAGTCTGAGGCATGTTTGCGGTCCACTTAATTGCAGGCC 
AAGAAGCTCATGAAATACCCTGGTAGCCACAAACTGGTTGCTCCATCTGCAGACCAAACGAAATTATGGGATTCATTCGCAGAGGGAATGTGCAACAAAGCAACTAAAATGT 
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GAGATTTTCCRTTTGAAAGCCCACACTTTCGTTCGATTTGCCACAATTTGCAGCATGTATAAAGTTATATCCGTACTGCCATGATATTTCTGCATGGCAGZ^ACCATTTCAAG 
TTTGTCACTCGAACACGCACCCAAGTGCAGAATGCCGGCATGTGAGGATGCGAGGATGCGAAAATGCCAGGACACGCGGAGGTGGCCGGTGGCTTTTCGATAGCGGAGGATG 
GTCAAGCCACGAGGGGCACCGAAACGGGAACCCGGCTCATTGTCATTTTCCGACCCGAGTTGGCTTTGAGCACCTGAGAACCAGTCGACCCAGTCGACCCAGTCGACCCAGT 
CGACCAGTCTACCTGTCTACCTGTCGACCAAATGTGGACACAGGCTCCATTCGAGATGGCTGCTCGTCAACAGAGCCATCAAATTGTCATTTTCCGAACAAACGCGTTGGTT 
TTCACTCGAAATTGGCCGAAAAATAAATGTCCTTGTAAATCACTCGACTTTTATCTTTAAAATATGATTTAAATTCAGCGAATAAAATACCCAATAGCTGATAATTTATGCC 
CAATTTATTATGAATTAAAAAAAAAAAAAAGATATGAAGATTTGATATTAAAATGCCCAAAAATACCATATATATTACCAATTTTATTGATTAACATTTGCTCATTTCAGTC 
ATAAACAAATCCTTTGATAAAGCCAATAAAATAATATTTTAAAAAATTTTAAACCAAACCCAAATCATAATATTAATCTTATGCACACCAAACTAAATGTTGAATTGGCAAG 
ACGAGAATGCTAATAGTACATAAATCACAACTTATCACTTTGGGAAACAACCATTAAAAATGCTATTTAAAAGCAGACAAACAGCGTAGGATGGCTGGTACATTTTTACTGG 
GATTTTGCTCTCCTTTGGCAATCTGTCATGTTTTTGAAGTAGGTCCTCGTCCTCGACAGCATCGTCTGCCAGATAAGAAATGCATATTTAATAATTTTATTCTTTCACTCAG 
CAGGGGCCCTAGGAGCTTAGCTTGTTTTATGACTGCGACAAAGTAATCCTGTGCAAAAGGTAGGAAGTGGGAAATCACTGGTTCCAAAGGGGCAGATTATCAGGGATACAAC 
AGAGATGTGCTCGACTTAGCTGGCATTATGAGAAGCTCGTGTGGAAAAATTGGCCCCAACCAACACACAACACAAAAATGGGCGAAGCCAAAGTTCTCGATGTCGATGTTGT 
TTCCCTTCAATCAGCGATTCAATTTACGAGTGCCAAAATTCGACACAGTTTTCCCCTTAGCCATTTTGCCACTTGCCACTTGCCACTTTTCACTGCCGGCTGACACGCCCCA 
TCCGGAATGCCGCGAAAATGGGCGTGCGAATATTTTGCCTTTCGCTGAATTCATTGTTGGCCCATGGCCCACGTTGCGGTCCGGTTTCTATTTTCAGTATATCCATAAAATT 
AACATTTTTGTTGCGGACTCTCAAAATCACTGAGCTATGGTCCTCGTGGTCCTCCAGTTTGCATTTCCCAGCGAAAAGGGAAAGGGGCAAGAGGTAAAGTGGCAACGGGGTA 
GAGGGTGCGTATATGGCGGCGCCACGCGACGGAGCTAAATTTAATTTTGTTAATGAGCTCGGCCCCTTTCCGTTTCATCATCCCGCCAATGGGGGCAGCCAGCCAGCCAGCC 
AGCATGGCGCAGCCACACTCTGCCGTCCTATTATTTTGCATTCGCAATTCTAAAAGCCTCAACCCGTTTGTGAGGTGGTCCGTGGTTCCAGAATCCACTTGCCAAGTGGAGC 
GAGTGTGTGTGCGTGTGTTTTGGCCTTTATTGCGCTCTTGGTGCCCGCCAGGATTAGACTGTGGATAGGCATCGGGATTGGGATTGGGTTGGGGATGGGGAATGGGGAATGG 
GAATGGGAATAGGGGATTGCCATTGGGATGGCGATTAGAACGACAATGGACCATTGCCGGAAACACTTGAAAACTGTTTCGTGTTGGCTGGGGACTTTGATTAAAAGCGGCA 
CCATTGCTATCATAGCGGCAAGCCGAAAAGCAGAAAACAAAAACAAAACAAAAAAAATGAGGGAAAATAAACAAGTTGGATCTGACAAAGACATTAACATTTTAATTGTTCC 
GGCAACGCAATGGAGTCGTGGACCAAGCGTAACAATTGCCAATGAGTGAGCTCGAATAAAAGTTTATGGGGTGGAGGCGAAGGGCAGTCGGGAAAATGGCCATTAATATTGG 
AGCCATAAAAGTCAAAACTTTTCGACGGGTGGCAAATGGGAGGCGTGGATTAGCAATATATCCCATGCCGCCAGGAAATTGACCCTAGTCATTATTTGCCACAGAGATGATT 
ATGTTAGGCAATAAACAAACTTAAGTTTTGAATCTGCTCGATTTGTGAATGAAAACGCCAAAAATATACATATATAATCTAATGTAAATATTTTAGTTATGGATTTTGAATG 
GCTTGGATTTTTGTATTGGTTGGTAACTTCTCTGTTTTTCCAATTCATTGCAGCTGCCTCGGTGGCCGTTTCCTCCTGGACACTGGTGGCCATCTCTTGCGAGCGCTACTAC 
GCCATTTGCCACCCGCTGAGGTCGCGCACCTGGCAGACGATCAACCACGCCAACAAGATCATCGCCATCATCTGGCTGGGCAGTCTGGTGTGCATGACGCCCATCGCCGCCT 
TTAGCCAGCTGATGCCGACCAGCCGACCAGGTGAGTGTGCGCCAGGATAACACTAATTCAGGAACTAACCCCCTGTCTGGTCAGGACTTCGCAAGTGCCGCGAGCAGTGGCC 
GGCGGATAGCCTCAACTACGAGCGGGCATACAACCTGTTCCTGGACCTGGCCCTGCTGGTCCTTCCTCTGCTGGCTCTGAGCTTCACCTATCTTTTTATCACCCGCACCCTG 
TACGTGAGCATGCGCAACGAGCGAGCCATGAACTTTGGCAGCAGTGGGCCGGAGGTCACCACCTCCTCCTCTGCCGCTGTCGCAGAAGCCGGTAGCCAGCGACGCGCCAACG 
GAAGCCATTGTCAGTCCCTCGACACGATTGTGCCACACCAGCACAATCCTCACCAGCAGCACCACCACCATTCCCAGTACTACTATGGTAAGGGGTTTATGCTTATTTTTTG 
AGATATTCGTAGATTTTTTTTGTCAGCCAGCTGGGTCAAAGGGAATTCTGGGTTCAGAGGGTTCAGTGGCGTAGCATTAATAGTTGCGACTGTTGCCGGGGGTTGGCAGGGG 
GCGTCATTCGGGATTAGCTCAACTACTGAGCTTATCCGCGGACCCAAAGCCCCCGAACTGCGTCACACACAATTTGCACGGCAAAGTTTCCCACGCACGAAAAGATTCGTGT 
GTCTACGACTCCTGCCTAAATCGCGGAAGTGAATTGAATGGCTGGCTGGGGGATGGGGAAAGGGGGCTAAAGCCTGGAGGAGCACTCTGTCCAAGTTTTTCAAGCGTGTGTA 
CGAAGAGACAATAGCAAGCGAAAGCTCTCCCGGCGTTTTTACTTTCATTTGCATTCGCATCGCCGGCTTTAAGAGCACCACCCATGGCTAGATGGCCCCTTCTCCCATCTAC 
GGCGTCCTCCATCCTCTCCAGCTGCGGATGCTGTGAATAATCCCCAGATTATGGGCACAAAAATTCTAGGCAGCCCAAGGCAAAGCCGGCGCAAATCAATTCTGCTAATTCG 
CCTTTAATGCCGCCTTAAAGCGTGCTCCCAATATATACATATCTCTATGTAAATTTAGAATATATTGTAGTATATGCTCGTCTGCAGCAGACATGCTTGACTGTAAATATAG 
AACAGTCCCAGTGGAGAGGCCTGCTGAAGAGGAACCTCCTCGGTGCTCCACCATGGATGCGAGTTGATAACGTGACGTGTATGATAAATGCACACCCACAAAGCGAAGTCAA 
CCGTAGAGTCCCCGTGGAAATCACAGTGAGCCTACCGAGCAGCGAACTTACTTTAATCTTCGAGAATATTACTCTGCCTCTGGTCCAATTAATGTTGAGAAGAGGAGCTCCT 
GCCACGGAAAGATGGAAGCACTGAGGCAATGGAGCAGTGGAGCAGTGGAGCAGTGAGTCCCTGGGCGACTCATTTGCATGAAAGGGTTGGCGCCTCCAGGTGGTAAGTCCTT 
TATGCGAAATGGAAAGTGTGCATTAGCACCTTCACTTGCGCCGCACTGAATTATCGCAGCACTCGAGAATTCATCGAGCGATTCATTTCCTAGGCAACTGACCCCGCTTGAC 
CCATGGACCAACACGGACATGCCAGTCCACCCGCTGCTGAAAAAGGTCAATTTAAATGCCGTAAATTATGCACGAGACGAGCATTTAAGCGCTCGTTAAAGGCCGTTATCCC 
ATTGCCAGGATTCTCCACTGGAGCCCGTCGGACGCGTGGCACTCATTTTCACACATACTCGCACTCCGGTCGCCTTTCTGTGTGACAATAAATTTGATTAAATTTTTAACTT 
TTTCGTTCCGTTTAAGGCCATCTCGTTTGACATAACCCCGTCATAAATTTCCAAGCCAAAGAGCATTTATGTGTGTTGGCTGGATCGCAAATGTCCCATCGCCGAACAAGCT 
GAACATTTTCATGGAAATTGCCATATTATTAGCAGCTTCACATGGCCCCCACTATTCGTGCCAAGAATCCGCATGCTATCCATCCCCAAACTGCGCGCTTGATGGCAATTTA 
AATTATGCTCAGCTGAACTACTCGCCCAACCGCAAACGGCAATAAAATTGCGAGGTATATGGTCTCATTCCGCACTTAAGTATCCATTTCCCCCTTACATGTATCCCAAATG 
ATTCCCATCACATGCTGGTAGCTACAAATATAGTATCCTTTCCATGTAACATCTACAATGTAGCACAAATTTTGTGATATATGGATATAATCCCTTCATTGCATCATTAATA 
TCTACGTGTCCTTCTCTTGAAACTCAGCCAAATGCCATTCCTGCTGGCGGTGAGGAAGGTGAGCGGAGACACCTCCACCCTGCACTGATCCTCATCCGCCAGCTGGACGCAG 
TGGTCCTGCAGTAGTGTGGTCAGTGCAGCCTTCGTCTGGAGCAGGCCAAGTCGCATTCCCGGACACATTCGTGGTCCAGCTCCGAAGCCCAGAAACCGACAGCCCATTGGTG 
GCTGATTTAGAAAGCGTTCTGGACGAAACTGGTTGGGTGCCGGATACAATGCGGGATCTCTGAAATCGAAAAAGACTGTAGTTCGTAGACAATCTGAAGTCTCACTGAAGTG 
TCACTCACAAATGAATGGCCTGAACCGGCAACACCAGCACGGTGCCCAAATGCACCTTAAGTTCGCTGCTCGCTCCTGATTTCTGATCAGGAAGGGTGAATGTCTTGGTGCA 
GCGCTTCTGCAGAGCCTGCATGGCCGGATGAAGACGCAGTGCCTCCAAGAGCGCAGCTTCGCTATAGCGAAGTTCACCCAGAGCCACTGGATCAATCAGATTGCCAGCGTGG 
CGCTGGGCCACTTCATCCAATTCGATGTGTAACCGTCGCTGTGCATCCTCATTGAGGGCCAATTCGTAGAGGGCGAAGGCGAGTAGCATCGCCGAGGTTTCGTAGCCCTCCA 
GTAGGAGAGTGGTGGCATGTCCAGCTAGCTCCTCCTTTCCCAAGCCTCTTTTGCTTTCCGCCAGCCATTGCAAGAGGTTGTCGCCACCACTTCGGGCTTCCACCAATTCCCT 
GAACCAGTGCTGCAGGGGCAGGGGTACGTATCTGTATAAGTGGATAGTGGATCAAAGGTTACACATTGGCGAGTTCCACATTACCAACCGGTGGCCAATGAGTCTACCCAAT 
CGAGGCGTGTGCAACAGTGACATGGTCTCCAGCAAACTCCAAACGCTTGGCTGAAAGAGCGGAGCGAGCCATTCCAGCCAGCGACTTGGTTCATGAGCCACCCTCATGTGAA 
TCCCCAGGCAATGTGCATCCAGACCGAAGATCGCGGAGGCAACCACTTGTAGCGTATAACGCGTGGCCAAGTCCTTGGCCTCGAAGCGTCCAAGGGGCACCTGATCCCGGAG 
CAACTGACATGCTCTGGCCACATGCGGCAAGGTCTGACGCACTCGGCTGGGTGTGAAGAGCGGCACCAGGTCGGCCCGCAAGACTCGCCAGCGGTCTCCGTTGGCGATGAAG 
GGATTGTGGCTAGCCAGGACATCGCGCTGGTGGTCCACATAAACGGCGTTGTCCGCACAGTCGGCAAAGTTACGGCCCACCAGGATCTGGCGCAGCAACTCCTGGTCACGCA 
CCAGAATGGCGGGTTCGTTAAACAGGCGATAGAAACCCACATATTTCAGGCCCGGATTCCGACTGTCAAGAAAAGTTCATTGCACGTGGCCTTGAAGGACTCTTAAGATTAG 
TGTATCTTACGTATAGATTTCCTGGTAAATCTCTCCATATGAGCGGCGGCCCAGGGCAAAGTCCAACATATTCCCAACCAATGGCCAGCCAAAAGGTGCCTCCAGTCCCAGG 
CGACGCCAGTGATTGCCCTGCCATTTCCAAAACAGGAGCGTGGCGGCCAATAGGATAAATAGCACCAGTGGCAGCATTATCGATGAGCATCTGAAAAGCTTGAAAGGCACAA 
TCCCACTCAACTTGCATTCCCAGTTGAGTTGAATATCTCAAAGCAGCTGAGAGTGAACCACGTTGCGTGATTCCATGATTGCGCCTCATTAGCCACGTAATTGCTGTATATC 
AATCAACTGACGGGTATGTTTCAATCACGCAAGATTCGAGACAGCTAGCATTGGGCTTTGAATTAATTTCTATAACCACTATGACTCAATATGTTTTCGGATAATAATCTTA 
GTATTTCAACTTATATGCATTGTACACATCCATTCAGTTTTAATTTATTTAGTGCATTTCTTCTGTTCAAAATTTTGTTTTTTTTTTCTGTGCTTAACATTCAATTTGACAA 
TCGTATCCATTTGCGAGTCAAATCAGATGTGTTTTGTGGATACCCTGTAACTTTCCATCTTGCAGATTACGGCCACTGTGGCAGCAAGCGGCGATTGATCAGCGGAGGAGGA 
CCCTGCGAAGGAAGGAGGCATCTATACTGCATGCGGAGCGCATCCGTGAAGTCCCTGCGCCATCAGCAGATCAACGGAGGAGGTGGAACCCTAAGTGGAACTGGAGCTGGAA 
ATGGCGAGTGCTGCAGCCGGGTGCACAGGATGCGCCAACAGATGCAGCTGCAGCAGCAAGGCTACGTGAGCGACAATGAATCCCGGCGCAAGTCATTGTCGCAGCCCAGTCT 
GCGTATCACGGAAGCGGGACTGCGGAGGTGAGTGCTTCAGGGTATTCTCATCCACATATGACCGACCGATTCGTGTGTCGATTTTCGCTTTCACGGTACCTTGCAGATCCAA 
CGAAACCAAGAGTCTGGAGAGCAAGAAGCGTGTGGTCAAGATGCTGTTCGTCCTAGTGCTGGAGTTCTTCATCTGCTGGACGCCGCTGTATGTGATCAACACGATGACCATG 
CTCCTTGGGCCGACGGTGTACGAGTACGTGGGCTATACCTCCATCAGTTTCCTCCAGCTGCTGGCCTACTCATCCAGCTGCTGCAATCCGATCACCTACTGCTTCATGAACG 
CCAGCTTCCGGCGCGCCTTCGTGGACACCTTTAAGGGGATGCGGGTGTGCGAGCGCCTGTGCGCTCCCTGCTGCTTCTGGCGGCGACGCTCCAAGAACGAGACGAATCTTTC 
GGTGGCCGGTAACTCCATTGCGCTGGCCAACTCGGTCATGTCCAGCCACACGATCCTCGAGAGTCCGCGACTCTGAGCCAGCCGCCGCCGACGGCAGATGGCCGAGTCGGAG 
TCACTTTAGGTTAGCAATGGGTACGGATCGGGAGCGCTAATATGTGATGATGCAATGTAACTGCAATAGAAACTGTGGTTGTCAACGACATTCCGCCGCGTTCTGAATGTGT 
GTGGAAGTGGCCAGATGGATAGGTGGGTGGATGGAGGAGGTGCCATGGCAATTGCCCCTCCAAATTGCTCTTCATCTTACCGATAAGTATTTTAATATATGCGTATGTGTGT 
ATGCTTAGAGTTGCGCTAAAGCGCTTTATTCCGTGTGTGTTATTCGACTTTGACCTCGCGTTCCATGTGTTTACACCTTAAGGTGGGTGCAACCAGTCATGCCAATCTAAAT 
TGAGATCGCCACACGCTATGGGGAATGGAAAGTGCTCAGCTGGATTAAGATTTTTAAACAAACAATAGGCTAAGAAATCAGCACAATGGTTTCACGACATGGTCACAGTATG 
TCGTGAGCCAACAAAATGACTTCCAAATGGGGGGCGCTCCATAAAATTGTTTTAATTTATGGAAAATGGTATATGTGTTGGAAGGATTTGTTGGTTCAAATCTGGAGTTGGG 
TCAAAAGTTGGTAAACAACTCATTTGTCACACATGTCGTATGAGTGATATTTACGCAAGCTATCCGACCATTGTGCTGCACTTTCTAATTTTATTTCCACACTTTACGTTCC 
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AAAGATCAATCCCAAAAGTAAAAGGTATTCTGATCACGKTGCAGrGAAAGGTTTCAATGAATCAAGAAGCATTTGACAGCAAGCTGATTTTACAGGGTACACAGGGAAGTCC 
CACTCTCAAAAGGATTCCCATCATTCAAAAATGGTTGTCAGACGGAGCTTATGGAGTGGCAGACACTCCGAGGTCCAAAACTAAATTAAGAAACCCAATCGGGTGCCAAAAT 
CCAATTTTAAGTAGACGAAAGTAGCTGGAAATGCCAATCGGATATATTCGGGCGGGATTAGCATTGTG 
(SEQ ID NO: 61) 

Exon: 89.. 105 

Exon: 176.. 924 

Exon: 1025.-1252 

Exon: 12710.. 12910 

Exon: 12965.. 13303 

Exon: 17202.. 17499 

Exon: 17579.. 17996 

Start ATG: 89 

Transcript No. : CT21155 

ATGTTGCCGCGCCTGTGCGCCGACGCTTGCCGTCAGTGCTTCGCGAAGATTGCGAGGCGCGACACACATCGCGGAACCCGAACTCCATACGGCTGTGCCGACACTCAATCCC 
GACCTAAGCCCAATTTTTTGTTGCGTGAAGTTGACGAAGTGTGTTGTACTGCTGCAAGTGCAAGTCCCCGATTGCTAGTGCTATTCCGGGATCACAAACGAGCATCCTTCTT 
CGGCCTCACGATTGACGCCTTCTACCACTACCTGCGCCAGGCGCTGCCGCTGGCCAAGGAGGCTGCGATCCACCTGAATGCCAGCAACGAGATCTCCGCTGTCGGTGACGGG 
GTCACCATCACAGGGACACCCGGTGATCTTCTCAACTACAGCGGGCTGGAATTGGArCTCGGACTGGATTTGGATCTTAATCTGGACATGGACCTAGCCACTACTCCCAGCA 
GCAGCACCTTGGCGCCAGCTGTAACCGTCAGGACACCGGGGAACAGGAGCGTGGTGAGGGTCAGTGCCGATGTGCCCATCTGGGTGGTGCCGTGCTATTCGGCCATCCTTCT 
GTGCGCCGTGGTCGGGAACCTGCTGGTGGTGCTAACGCTCGTCCAGAATCGTCGCATGCGCACCATCACGAACGTTTTTCTGCTCAACCTGGCCATCTCGGACATCCTGCTC 
GGCGTATTCTGCATGCCGGTGACCCTGGTCGGCACTCTGCTAAGGCATTTCATATTCGGCGAGCTGCTCTGCAAGCTCATCCAGTTTGCTCAGGTCCCTATCCCCGTGACCG 
AGCCCCATAATCGCTTGTCAAATGGCCACGAACCAGCACGACCTCGAGCGGATTCGCCAGCAGATCGTGTTGGCCAACATTCAGGAGCTGATAAAGAAGATGACACGTCGCT 
GCTTCGACGTATGTATCGCTATGCCGGAAATGGAGTTGCGCTCCACGGAGCGCGACTGCCTGGCCAACTGTATGGATCGATTCATGGACTCGGTTCAGCTGCCTCGGTGGCC 
GTTTCCTCCTGGACACTGGTGGCCATCTCTTGCGAGCGCTACTACGCCATTTGCCACCCGCTGAGGTCGCGCACCTGGCAGACGATCAACCACGCCAACAAGATCATCGCCA 
TCATCTGGCTGGGCAGTCTGGTGTGCATGACGCCCATCGCCGCCTTTAGCCAGCTGATGCCGACCAGCCGACCAGGACTTCGCAAGTGCCGCGAGCAGTGGCCGGCGGATAG 
CCTCAACTACGAGCGGGCATACAACCTGTTCCTGGACCTGGCCCTGCTGGTCCTTCCTCTGCTGGCTCTGAGCTTCACCTATCTTTTTATCACCCGCACCCTGTACGTGAGC 
ATGCGCAACGAGCGAGCCATGAACTTTGGCAGCAGTGGGCCGGAGGTCACCACCTCCTCCTCTGCCGCTGTCGCAGAAGCCGGTAGCCAGCGACGCGCCAACGGAAGCCATT 
GTCAGTCCCTCGACACGATTGTGCCACACCAGCACAATCCTCACCAGCAGCACCACCACCATTCCCAGTACTACTATGATTACGGCCACTGTGGCAGCAAGCGGCGATTGAT 
CAGCGGAGGAGGACCCTGCGAAGGAAGGAGGCATCTATACTGCATGCGGAGCGCATCCGTGAAGTCCCTGCGCCATCAGCAGATCAACGGAGGAGGTGGAACCCTAAGTGGA 
ACTGGAGCTGGAAATGGCGAGTGCTGCAGCCGGGTGCACAGGATGCGCCAACAGATGCAGCTGCAGCAGCAAGGCTACGTGAGCGACAATGAATCCCGGCGCAAGTCATTGT 
CGCAGCCCAGTCTGCGTATCACGGAAGCGGGACTGCGGAGATCCAACGAAACCAAGAGTCTGGAGAGCAAGAAGCGTGTGGTCAAGATGCTGTTCGTCCTAGTGCTGGAGTT 
CTTCATCTGCTGGACGCCGCTGTATGTGATCAACACGATGACCATGCTCCTTGGGCCGACGGTGTACGAGTACGTGGGCTATACCTCCATCAGTTTCCTCCAGCTGCTGGCC 
TACTCATCCAGCTGCTGCAATCCGATCACCTACTGCTTCATGAACGCCAGCTTCCGGCGCGCCTTCGTGGACACCTTTAAGGGGATGCGGGTGTGCGAGCGCCTGTGCGCTC 
CCTGCTGCTTCTGGCGGCGACGCTCCAAGAACGAGACGAATCTTTCGGTGGCCGGTAACTCCATTGCGCTGGCCAACTCGGTCATGTCCAGCCACACGATCCTCGAGAGTCC 
GCGACTCTGA 
(SEQ ID NO: 62) 



MLPRLCADACRQCFAKIARRDTHRGTRTPYGCADTQSRPKPNFLLREVDEVCCTAASASPBLLVLFRDHKRASFFGLTIDAFXHYLRQALPLAKEAAIHLNASNEISAVGDG 

VTITGTPGDLLNYSGLELDLGLDLDLNLDMDLATTPSSSTLAPAVTVRTPGNRSVVRVSADVPIWVVPCYSAILLCAVVGNLLVVLTLVQNRRMRTITNVFLLNLAISDILL 

GVFCMPVTLVGTLLRHFIFGELLCKLIQFAQVPIPVTEPHNRLSNGHEPARPRADSPADRVGQHSGADKEDDTSLLRRMYRYAGNGVALHGARLPGQLYGSIHGLGSAASVA 

VSSWTLVAISCERYYAICHPLRSRTWQTINHANKIIAIIWLG3LVCMTPIAAFSQLMPTSRPGLRKCREQWPADSLNYERAYNLFLDLALLVLPLLALSFTYLFITRTLYVS 

MRNERAMNFGSSGPEVTTSSSAAVAEAGSQRRANGSHCQSLDTIVPHQHNPHQQHHHHSQYYYDYGHCG3KRRLISGGGPCEGRRHLYCMRSASVKSLRIIQQINGGGGTLSG 

TGAGNGECCSRVHRMRQQMQLQQQGYVSDNESRRKSLSQPSLRITEAGLRRSNETKSLESKKRVVKMLFVLVLEFFICWTPLYVINTMTMLLGPTVYEYVGYTSISFLQLLA 

YSSSCCNPITYCFMNASFRRAFVDTFKGMRVCEPXCAPCCFWRRRSKNETNLSVAGNSIALANSVMSSHTILESPRL* 

(SEQ ID NO: 63) 

Name: GASTRIN/CHOLECYSTOKININ RECEPTOR like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384801 

AGCTGTAAAGTACTACTACTACATATGTGTACATTCAAGTTTCGCTTACGAATGCAACGTTCCATTAAAGAAGCCACCAACTTTAAGATACACGCATATAACAACTCCTGAG 
AATTGTGAAAACTGATCTACTTTGATGATCAACTTTCATTGCTGCTGATTGAATAATTGAGTAATGAGTAATTGGTATGCTGATAAACAGGAAAACCAAAATGTCAACAGTT 
GATAAGGAACATTAAAACACAAATTAAGAGGACGGTAAGTCCTTCACTCGGACCTTTTCAGCTGAGCACCGTAAAGTGCTGAAGGGATTGCCCCCACACCCAGACCGCGCCC 
ACTTCCACTTCCACTTTAAAATAATCATCATGGCCCTCTTTTTTGGCCAGGCTGCGGTGGCTTTGGGCCTTAATTGTCGACTCAGTTGCTGTGCTGTGCTGTGTACTCGACA 
AGGCGACGAACCGAGAGCCTTGCAGCTCCATAATGACAGGCGGATGGACTCGAACACTCGGACCCGGACCCGGACCCGAACCCGGACCCAGTCCACAGCCAGGACATAAGCG 
GCTTGTGCTCCACTCAACTGTCGCCTTTCGGAGCTGCTCCTAGCTTGCAATTAAAATACCCACTTGGGCCCTATAAATTGTGAACGTAGGAAGGCAGGCGGAAAAAAAAAAA 
AATATTCCAGCACTTTGGTGTATGGAGTGGGGTCCACTTAATTCCCAGGTACTCACCCAAAAGGATTCCAACTATAAAGACATACAGCGAGAGTGCTTAAGTTTTACTCTGT 
TTATTTGCAGCAACCTTAGGAAGTTCAATTGAAACACCAATAGACTAGCGTCTGAGTTATTTTACGAGTACGATCATAATCACAAGTAGGTAAAAAAAATCCATATTCGCCT 
TACTTAAAACAAAAAAAAAAACCACACGCAGATCGCTTATCATACATATTTACACCGCATCTCGTTGATCCTGAATGGGTCCTTGGAGGCGAATCCTCTACAGCTTAGAGCT 
GAGGACTGTTGACGGCGGCGGCGGCACTATTGGTGGTCACCACCGCATTGAGAAACTCGTGGCGTGAGCGACTCCGCCAGGTGCCCATACCCATGGCTAGGCCGGGATTGAG 
ACTAATGTTGGTGTTGGCACTCGCATAGGCGCCCGGGCCTGCGCCCGCCTGGCTGGCGGAGAGTCCTCCACCAGCAGCACCACCGACGCCTCCGCTGGCACCTGCTCCACGA 
CGCCAGGGCAGACCCTTGAAGGTGTCGACAAAGGCGCGCCGGAAGCTGGCGTTCATGAAGCAGTAGGTGATCGGATTGCAGCAGCTGGATGAGTAGGCCAGCAGCTGGAGGA 
AACTGATGGCCGTGTAGTCGACATACTCGTACACCACCGGTCCGATCAGCATGACCATCGTGTTGATCACGTACAGCGGAGTCCAGCAGATGAAAAACTCCAGCACCAGGAC 
GAACAGCATCTTGACCACACGCTTCTTGCTCTCCAGGGTCTTGGCCTCGTTGGACCTGGATAGTGGATTGTGGATTGTGGAGAAATGGGAATAGGGAATAGGATTGGCAGAC 
CGGTTATCGGACACGAATCCTACCTGCGAAGTGCCGCATCGTGGACGCGAATGCTGGGCGAGGAGGTCTTGGCCAGCGTCACCGTGGTCGTCGTGGTGGTGGTGATCACAGT 
TGGAGCCGTTGGTCTCGTTGTCAAGGTGGTCGTTGCCATATTGGTTGATCCTCCGCCTGCGGATCCCTCTGAATTTCCATTATTATTGTTACTATTTTCTATGTGAAAAAAA 
AAAAACATGTGAACACAAGCCAAATATAATCAAACAAATTAACGTTATTAGCAGCAACAAATAAAACAGTTGCCATAACTAATCTAAACTAATACATAGACGTAAAATAATT 
AAATTTTCAATTTGGTTTTAAGAGTCTCAAAAATAATATAGCATATTATGTTAAGTGGATTTGTACCTTCTTATCGAGCATTTCACCAACAATCTATATCTTTAAAGTATTA 
TGCCATATTACATGTCATGTAACATTAAATAACTTATGAGTTATGATATTTAGATGCAAAATAATACTTGAATGCATTCATAAATGCTCTTAAAGTTGTTTGCATATTCATG 
GCTTTTAAGGATTGAAAAAGGATTAGTTTCGTGCAGTTACCAACCATTATAGACTGCGGTGGCGGTCAGGACCAGGATGCAGTTACTACTGCTGCTGGTGCCCGGATTCGGT 
GCGCTTCCGCCGGCCGTTGTAGCGGAAACAGGCAGCGATTGCTGCAGGATGCGTCCGCTGTCCTTGGCCATGCCTACGTACAGGGTACGCGTGATGAGGATGTAGGCCACGC 
AGAGGACGAGAAGCGGCAGGACGAGCAGCAGGAAGTCCAGCAGGATGTTGTAGAAGAGCTCGTATCCCTGGTCGGGCCAAAACTCACGGCACTTGCAGTAGCCTGTAACGTA 
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AGTTGGTATAACAACCRGftCGGTGTCTGCTGGCCAGGTTGCTATGCACTCACCCGGTCGACTGGTGGGTATCAATTGACTAAAGACCGCTATGGGCGTCATGCAGftGGftTGC 
CGCCCAGCCAGATGAAGCCGATGATCTTGTAGGCGTGACTGATTGTCTGCCAGGATCGCGAGCGCAGTGGATGGCATATCGCGTAGTAGCGCTCACAGGATATGGCCACCAA 
GGTCCAGGACGAAACGGCCACGGAGGCGGCTGAAATAAATAAATGAAAGCGGATTAACTTACAACAAACCGAACACAATGGACACACCCAGCAGCTCCTGTCGAGCGCCGTG 
TCAAATCGCTTTCATTAGAAGGTCCTTCGCCCGCGTTCGTTGCATTTTAATTACACAATTTGCATATTGGGACCAGACGAACCGTACCAACCAGACGGACGGAGACATCGCT 
CTTCGCACTTCTGCCTGTTCTTGTAGTATGACAAACACGATGCACGATGCAATTTGCAGAAAGTGCAATTAAGCAATTCCCCAGCCACTCCTTTTCTTGTTGCCCGGTCTCC 
TTTTCATGTAAGCCTGTTAGATGATTTCCCATTGTAAAGCAATTTGTACCCACACCCTGCCCATCCCTCATATTCTTTTGGGCTACCACTATCGATGGGGCTACACAAGTGC 
AAGTCAATAGGGTGGACTGCTGTAGGTGAAAAACTCGACACCGAAAGAAAAAGACCCGATTCGAAAAATTGGTCAATTTTAGCGCAATTAAATGAGAAATAATTTCTAGTTT 
TTCAATCGATTGTTTTATAGCACCAAATCATATATCTTAACCGTAAAGGCATCAAATAAATATGATAAACTGATCAAATGAAAGCATAATTTGCTTGTATGTATGCGTGTTT 
TTGGAATTTATAGCTTACCAAAACACTTGTGAAATTTGCAAATATACATATTTGTTCATACACAATTTAGACTTCAATGCAATAAGCTGTAAGGTCTATGGTTTCAATTATT 
AATACTCGGAAATTGTTGTGCTAGCAGATACATTTGTGGTTTGTTAGATATACATATTTGCTTATAAAGATATATATGTATGTACGTATTGCATTCAAAAGCAATAAGCAGT 
AAGGTAAATTGGATCAAGGGACGAGTATTTAAGACAGCCTGTATATGTTCCCTTTACTTTAATGCGACACCTCATTCAGGCAATTAAGGTCCACCCTAATGTGCTTATCCCT 
TCAGCCATCTGCAGCTGCATTGTATTATATTGGAATGGTATTTTCTCGCAAAATGATATGCAAGTTTTAGATTGATACGTACAATTGCAGTGATTACTACTTTAGTTACGAA 
TGATGATAGGTGATGTAGTAGATATATATTTGCTACCAAAGATAGTTACACCTTAGAATCCTAAAAGAAATAAGTGTTTTGTAGTCGCAGTTCGTTTCTCTAGTTCAACACA 
TCCAGCCAACCAAGGTCCACCCTAATGTGACTGCACTTCAGCTGCACTGTATTATATTACATTTTAGCTGCACAATTGTTTACTTTCTATTTTAATTATGCATGCAAACTAA 
TATGAGCAGTAAAGATGGATTACCTCCACCTGTTCCGTTGGAGTGCATCCCGTCCAGGCAACCAAGGTCCACCCTAATGGGGTAGCCCTTGCCCTTCCAACTGCATTGTCAC 
ATTGGTCTCGCCCrCTGTCGCGTTTGTTGAGTTAAACCCCTGAAATTGAAGTTTACTTTTGATTGACGGCGGTGTGTGATGCTCTAAGCCAGAAATGCTAGCATGTGTGCGC 
TTAAACTTAAAACAAACAGCAGCATGGCATTTC 
(SEQ ID NO: 64) 

Exon: 4080.. 4056 
Exon: 2717. .2517 
Exon: 2454. .2169 
Exon: 1736. .1001 

Start ATG: 4080 (Reverse strand: CAT) 
Transcript No. : CT21314 

ATGCACTCCAACGGAACAGGTGGAGCCGCCTCCGTGGCCGTTTCGTCCTGGACCTTGGTGGCCATATCCTGTGAGCGCTACTACGCGATATGCCATCCACTGCGCTCGCGAT 
CCTGGCAGACAATCAGTCACGCCTACAAGATCATCGGCTTCATCTGGCTGGGCGGCATCCTCTGCATGACGCCCATAGCGGTCTTTAGTCAATTGATACCCACCAGTCGACC 
GGGCTACTGCAAGTGCCGTGAGTTTTGGCCCGACCAGGGATACGAGCTCTTCTACAACATCCTGCTGGACTTCCTGCTGCTCGTCCTGCCGCTTCTCGTCCTCTGCGTGGCC 
TACATCCTCATCACGCGTACCCTGTACGTAGGCATGGCCAAGGACAGCGGACGCATCCTGCAGCAATCGCTGCCTGTTTCCGCTACAACGGCCGGCGGAAGCGCACCGAATC 
CGGGCACCAGCAGCAGTAGTAACTGCATCCTGGTCCTGACCGCCACCGCAGTCTATAATGGTTGGCGGAGGATCAACCAATATGGCAACGACCACCTTGACAACGAGACCAA 
CGGCTCCAACTGTGATCACCACCACCACGACGACCACGGTGACGCTGGCCAAGACCTCCTCGCCCAGCATTCGCGTCCACGATGCGGCACTTCGCAGGTAGGATTCGTGTCC 
GATAACCGGTCTGCCAATCCTATTCCCTATTCCCATTTCTCCACAATCCACAATCCACTATCCAGGTCCAACGAGGCCAAGACCCTGGAGAGCAAGAAGCGTGTGGTCAAGA 
TGCTGTTCGTCCTGGTGCTGGAGTTTTTCATCTGCTGGACTCCGCTGTACGTGATCAACACGATGGTCATGCTGATCGGACCGGTGGTGTACGAGTATGTCGACTACACGGC 
CATCAGTTTCCTCCAGCTGCTGGCCTACTCATCCAGCTGCTGCAATCCGATCACCTACTGCTTCATGAACGCCAGCTTCCGGCGCGCCTTTGTCGACACCTTCAAGGGTCTG 
CCCTGGCGTCGTGGAGCAGGTGCCAGCGGAGGCGTCGGTGGTGCrGCTGGTGGAGGACTCTCCGCCAGCCAGGCGGGCGCAGGCCCGGGCGCCTATGCGAGTGCCAACACCA 
ACATTAGTCTCAATCCCGGCCTAGCCATGGGTATGGGCACCTGGCGGAGTCGCTCACGCCACGAGTTTCTCAATGCGGTGGTGACCACCAATAGTGCCGCCGCCGCCGTCAA 
CAGTCCTCAGCTCTAA 
(SEQ ID NO: 65) 

Start ATG: 1 (Reverse strand: CAT) 

MHSNGTGGAASVAVSSWTLVAISCERVYAICHPLRSRSWQTISHAlfKIIGFIWLGGILCMTPIAVFSQLIPTSRPGYCKCREFWPDQGYELFYNILLDFLLLVLPLLVLCVA 

YILITRTLYVGMAKDSGRILQQSLPVSATTAGGSAPNPGTSSSSNCILVLTATAVYNGWRRINQYGNDHLDNETNGSNCDHHHHDDHGDAGQDLLAQHSRPRCGTSQVGFVS 

DNRSANPIPYSHFSTIHNPLSRSNEAKTLESKKRVVKMLFVLVLEFFICWTPLYVINTMVMLIGPWYEYVDYTAISFLQLLAYSSSCCNPITYCFMNASFRRAFVDTFKGL 

PWRRGAGASGGVGGAAGGGLSASQAGAGPGAYASANTNISLNPGLAMGMGTWRSRSRHEFLNAWTTNSAAAAVNSPQL* 

(SEQ ID NO: 66) 

Name: GASTRI N/ CHOLEC YSTOK I N IN TYPE B Receptor-like 
Classification: Gjprotein_linked_receptor 



Celera Sequence No. : 142000013384546 

CAAAATAAATGTATATAATTTCAAAATTGACATTTGCTAATGATAAGCCTCTGCCTCATGCCCATGCTTATGCTGAAATATGGACAAAATTCGGGCACAGAGCAAGAGATGC 
GCAGAGCAGCTGCCATCCGGAATGGAATGAGTTGGAATTTGGGCAGCAGATGGCGCTGTCGCAAAAGGAAGGAAGCTAAATTTCACTGGGCCCCAACCAAACGGAAATTTCC 
ACCCACGGCTCAGAAATTGAAGCTGTTTATGGTAAATTAAACCAATTGAGTGGGATTACGTTTTGCGCACTACAACAACGCCACCAACCTCCGCATCAACTCCGCAATCTCC 
GACACCCCGAAGTGTTTCCAACTGGGTTTCAATTTCACTGTCTGGACGAGCCTGCGGCTAACGAAATTTATATACCCATACAATGCCATCAAAGCCACAAGGTGTTTATGGG 
CTTAACTCCGGCCCTGATTCCGCCTACTTGGCTGCCTCGCCTCACAGATTTGGATTTGCATGTGCCACGATTTGGGGCATGCGGCTGAGAAGGCGGCTGAGCAGGCGGATAG 
AAGACAAATAGCCAGCACGTTGCGGTTCATTTGGCTGTCACAACAAATGCCGAATTGATGCAAATGTAAATGGATTCGATATCCAGGAACGCGTCGTCGGAATCCCAAGCAG 
CAGTCAAGTGATGAATACACGGCTTAAAAGATACTGTATTAACTTACAGTAGCTTACCTTCAAAGCAAAAATTGTAGATTTCATTGTCTAGTAACGAGTTCATATCCATACC 
TCTTTAATATTGGATACTATTTATTACCCAGTTTATTACTTAGTTTTTATTTCAGAATCAAAAACAAATATAAGCAAGCTTACCCCATTTAAGTACTTGGCAATTGCAAATG 
TGCACATACCACTTTCTTTTCCATCCACAAACTGGCTTTCAAACTGCTGTTGGCCAACTCATAAATATGGCAGACAATAAATCTAGATTCTTTTTCAATTGCAGGGTTTGGC 
AAACGATTGGCAATAATAGGAGTCATCAAAACGAGAGCAAATGTCATACAGCATGTCGTCCTTATTCCGTGGCGTTAGTGCGACATTTGGAGTTGGAAACTAGGAGCCCACT 
TAGACTAACAACGCCCGTGACCATTACCATGACGCTACTGCAGAGGCTTCAGGCCATGTCGGCCACCACGACCAGGACAATACTGGAGGGCAGCATCAGCAGTTTTGGTGGC 
GGGACAAATGAGCCTCTGGCGAGCAAAATACCCGTTCTGGAGGAGTCAGCCTCACATGCCAGATATTTGAAATTCSTTGCCGACGGGCTCATCGACGAGGGACTGGGCAGTG 
CGGTGGGCAGTGGGAGCAGCATCGCCGTATCCGTTGAAGACGTGGTCGCCGGACAGGCGCAGGACATCCAGGCGAGCGAAGGATCCACCGACGACGCCGACGGCAGTAGCCA 
TTTGGCATTAGTCTTCGTCAAGTGTTTCATTATTGGTTTCATCATACTGGCCGCCATCCTGGGCAACATGCTGGTGATTGTGTCGGTCATGCGGCACCGGAAATTGCGGTGA 
GTCCTTGCCATTGTGCGTTCTAGCCATATCCGTTGGTCAGGACATGTGTGTCCTCAAATGTCTGGGTTTCATTAATTTGTCGCCATGAATCAGGTGGCTTTTTTCCCTTTTT 
GCGCCATGGTTTCAATTTCATTGTGTGCATTTGGCAGGAAGCTTCCAACTATAAGTTACTTAGATAATAATTTAACAAAGCATGAACTTCAACATCAAAACATTAGTGAGCA 
AATATTAAAAGAGCCCTTACTTTCGCTTTTGTTTAAGGTAAATTGCCCGAGCTGTAACCTGCCACATTGTCCACGAATCGTGGGTGTTTTAGCTGTCAAGGACATGACTTGT 
ACACCCCACCCCCCCCGCCCCTTTTTCCTTTTATTTCTTTGTTTGCTCCTTTTTGCCGAACGATTTTGTGCGATTTGACCGCGTTCGGCTTAACGAGTAGCTACCTCCTTTT 
TTTGCGGCCATTAATTTGTCATCATTTATCATATTTTCCATCATTTTTGCGTGTCAAGCGCACAGAACTCATCTCGTTCGCCCACCGAACACTTCAGAGGGGGGCGTGGTTG 
GTTGACTAAGTTAAATGCTAGATACGTGCGTAATTACATTAGAAATGTTAATAAGGGGCGAGCATAAAAATATTGTTGCCTACTTACGGGGGCGCCATTGGCATTTACTTTA 
TGTGCGGCCATATAATTTGTCAAATTTATGAATTGGGCGCCACCTTCACATGAGATATGGCCACGTGGGCGTTTGAGGGATTTCCAGGTATCCAGGTAGCAAAGTACGTACG 
TGTTTCAGTCGCAAATTCAAAAGCCCAACACCGCTGGCATTTAAAGAACCCAACATAAAACACTTGGAGAGAGGCGACAATGGTGTAACCTAACCCCGGGACATTCAATGTC 
AAGGCGCAGACTATTAAAGCAGCCCCGGGAAAACGTCTTCCACTGTCATTGCCGTATTCCTTTCGCCTTGGAACCAACCAACGCACACCTGACATTTGTGGCTGCCACCGAG 
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AATGAGGATGGGCATGGTGATGAGCATGGGGGATGAGGCTCATTTAGCCTGCCAGGGACAGGCftATTTGGGACCTGRCCCTGACTACTACGTGTATGCCATGTTGCCGTTTT 

TATCTCAACTGCCTTGCGCCAGCATCTCTTTCATTATTGTTGGGCTCATTGTCATGTGCGTTTTCCGCACCATTTTCACCTTTTCCGCCTCGCTAGGCAGTTTAAATCATCA 

CGGCAAATTAGACTTAATTACGAAACAACATGCATTCCCACTTAGGTCCCATGCTACCTAAAAACCTTGTGACCAAGCGGAGTAGCAGGTATCTATTTCAATGTTCTGGATA 

AGCCATCTGGGCGACAGTTGTGCGACTTTTTCACGACTGGCCACTCAACTTACATTATTCATGTCACACGCATCGATAAAGTGGGTGACAAATTAAAGTCAGACTGCAGATC 

AAGTACGAGGAAGAGCCGGGAAAGTGTTTCCTCACTTTTCCTATTAATTAGAGCTTTCCACGTTGGCTGCCTGCCGAAAGTGTCGACTCATAATTGTAATCAAACTTGACAC 

CCAACAAGTTl'CCTCCTTTCATTCCCGAGCACCCACTATATAATCGGCTTTTGGGTAACGCCTCAATAACATTTTCGTGTTGCCTCTGATGCGGGTGTTTTGCCTTTGTACT 

TTTTGGGCGCTTTAAGCTTGCTCCAAAAATGTAGACCTTTCGGGGCGTCAGTTTGTGGCCGTAATTAGGTTTGCGGGAGATGGTTTAATGTCCCAGTGAAATATGGCAAAAT 

GGCGAGCGATTGTTAACGAGTTTAATAGATAAAGGTCGGATACACATAAAAAGAATATCCATGATTGATGCAGTTTAAGAAGTTCCCATCCAAAGTTCCTTCATTCAAAAGT 

TTATCTGGCACATTAATCCGTTTAGACGGAAAGCAAAACTTTCATTTGCGATGTTGTGGTACGAAGAGTCAAGTTCCCGCAGATGTTATGGTTTTAAACATGAATTCCAACC 

GCCCATTCGGTTGCGGTTATATCCGTGGAATGGCGTTATGTACGCCTGATTGTTTGGAAATCAGTGCGGACGGCCGATAAACGCATTTGCTGCTCCACATAGCGTATACGCC 

GCGTGCACCTAATTGACTTAACGTTTTGCGGTGCCACGCAACGTTGATGATCGCTGGTGGAAGTTGCGGTGGCGTTAAGCTAATTATTAATTAACCCATTAAACAATGAAAT 

CCACTCAGAGAAGCGCCTTTTCGTACATGAAACGGCGGTCACACGCGGCGTATGCGCAATGTGCGCACCAAAGTTTCCAACTCAAGTTGGACAGCAATCCGAAGTAACGATT 

AGCGCTGCGAGCTCTGAGTGACTTGAAAGCTTAATGCACAAATTTAATTTGAGAACTTGGCACTCTCAATACCACTGGCACTCAGGCTTTAGGTTCATTCTGCCATTTTTGG 

TTGACTGTGTGGCTTCCGCCTTGGTTCCTGCTAAGTTTTAACCTAGCATTTACTCCTCCTCCTTGCAGCATCATTACCAACTACTTTGTGGTCTCTCTGGCCGTCGCCGACA 

TGCTGGTGGCCCTCTGTGCGATGACATTTAATGCTTCCGTCATGATCTCGGGCAAGTGGATGTTTGGTTCCGTGATGTGCGACATGTGGAACAGCTTCGACGTCTACTTCTC 

CACCGCCAGCATCATGCACCTCTGTTGCATATCGGTCGACAGGTGAGTCCCGGGTTTACTTTGTCATTGTGGAATAATTGAAGTTCCGAGTGCTACTGGGATGTCTTTTATA 

CCTTCCTTTAAATGCCCATCAAATTGAATAGTTGCTTGACAGTTTCTAGTGACAATCTTTTTTCCCCTTTACCCGTAGATACTACGCCATTGTGCAGCCACTGGACTATCCA 

CTAATCATGACACAGCGACGCGTGTTCATCATGCTATTGATGGTGTGGCTATCGCCGGCGCTCCTCTCGTTCCTGCCCATCTGCTCGGGATGGTACACAACGACCGAGAACT 

ACAAGTATCTCAAATCGAATCCGCATGTGAGTAGCTTACTTGAAGCCCTTAATTGGGACAATTAAATATTTGTACAATAGTGTATAACTCTATTATGGAGCCCACAGATTTG 

ATGAGTAGGTCGATCCGGTGTATGCACCTACATTATTTATTCAATTTTCTTGCACATCAAACTAATAGAAAATTCTAATTTAGTTGAAACGGTATCGACTTTGCATGCATTT 

TGCATGCCTTCGATGCCACGTGGGCGTCCATGTGGCAGTGGCAACCTTTGACTTTTCCCCCGACCCGCCTCAACGGAAATTGATTTATGTTTGTGTGGCTGAAGCAGTTTTA 

CGCCTTTCAGGCTGCCAGCCGACAGGTAGCATATTTATTACTCCATTTTATTGGTTTCCTCCGCAAATACTTGGCAAACTCCTTGGCAGCACCTCCTCCATCTACAAATATA 

TACATTTATATATATATATATATATATATGTTCTCTGCCCGTCAGCAGCTGCTGGTACTGATTCCCCCCGCCACTGCGGTTATCATTGCGCAATAAAATGCTGATGTAGAAA 

TAAATGTAGGTAATTCCATTCCGGTTTTCTGCGCTCCCATTCAGGCTGACTTCAGCATAGGCTGTGCATAATTTTCGGCTCTACACCTGGAAGGACGCCTGAAACGCCTGAC 

AGGCCTTTGCATACATTAGCATCCGCGGTGGGCTTTAATTAGTTTTACTTGTTTATGCTAATAACTGTCAGCGCGAAGCCGGCAGAATGTCCTGGCCGTAAATCTGATTAGA 

TGTCTTCTGGTGGCCTTTTGTCATGGCAGCGCTGTAATGCACAGCGTAATTGAGTTTTTATTGTCGCACTTGGCATTGCAGTAATCGCATTGCCATTGCAATTGCCGTCGCA 

ATAGCCGTTGCTCATACGCAACGTTGGCCGCCGCCAGCCGCAGTTGGCAATTGCCGAGCATTACGCTCCATTACGGTTCGCCACTGGCATCCGACTGCACTGCTCCAACGTT 

CATTTTGATTTATTGATAGTCGCTAATTGAGTTGCCCATCAAATCGCAACGATTTCGATTCCGATTTCGATGCTCCCCTCACTCCATATTTCAGATATGCGAGTTCAAAGTG 

AACAAGGCATACGCCATAGTCAGCTCGTCGATGAGCTTCTGGATTCCCGGCATCGTAATGCTGTCGATGTACTACCGCATTTACCAGGAGGCCGACCGACAGGAGCGTCTGG 

TGTACAGGTAATAGCTATTCGAGTGGCCCCTCCATTTGCCCGCTTCCAATTAACTCTAATTACTTACTGTGTGCGCCCAGATCCAAGGTGGCCGCTCTGCTGCTGGAGAAGC 

ATCTGCAAATTAGCCAAATTCCCAAGCCCCGGCCGAGCATTCAGGTGGAGCAGTCGACCATCTCGACGATGCGGCGTGAGCGGAAGGCCGCCCGCACCCTGGGCATCATCAT 

GAGCGCCTTCCTCATCTGCTGGCTGCCGTTCTTCCTCTGGTGCGTATACTTGATGGCTCAAATTGGGCAATTATCGCTAGCACTCTAAGGATCAAGTGCCGAGAAGTTTTTA 

ATTAACTCGGGCTCATATCAAATAATTTGCACTTTCCATGCTACAGGTGGGCGGAAAGTGTGTATAAACAATTAACAATACAAGTATGTTTCCTTGGGATTGAAATATAAGA 

AAGGCAATGTAGACTTAAGCAGTAGTTATTATGCACTATTCAAGTATCTATTCCACCTACTTAAAAGTAAAGAATCTAACACTTTCCCTTTCTCCAATTAAACCGCAGGTAC 

ATCGTATCCTCGCTGTGCGATAGTTGCATCACTCCGCGCCTGCTCGTTGGCATCCTGTTTTGGATCGGCTACTTCAACTCGGCCCTGAACCCCATTATTTATGC ATACTICA 

ACCGCGACTTCAGGGCCGCCTTCAAGAAGACCCTCAAGGTGAGTTGCGGGCTGGCCATCGAGTAATTTGCATATACCTTCCACTTCGACATCTTGATGGGACTTTTATTCTA 

TTCAATTTCGCTTTGTTCGGGCATCCAAAACTGAATTCTCTGCACCACCTTAGACTTAGTACAGGGCTTCGGCAACTGCTGCAATCGCAAAGGACAGGGCCAACGCCACCCA 

TTGCACAGAAGCGGCGGCGGCCACCATTTCGACAACAGTATCGACTGGCAACGCGGAGGCGCCACCACCAGCGCCCCACCGACGAGGCTCACGTTTAGCAATTATTTTGGTA 

GCTATTGCATAGCATCACGGGTGCACTGAGAAAAAACATGAACAATTTATCAAATATATTAAGTAACTTGGAGTAACTTAGTTCCAGTTACACGACTAACAAATAAGAAAAA 

AGATATGGAAAAAAGAACATTGTCTCCTCATTTTTTCTCAGTGCAGCTCAGTGTTCCGCGGAGAGCGTAGATAGCGTAGCGTTCATATTAGTTGATTGAAATGCTTCTTCAT 

CCACATCTACTGGTTGCTAAGCAAACAAAATAGCTTTTCCTCTGCCTGAAGAACTGAAAACTCTGCACGTTCTTTGCTCTACAGCTCTACAGCTCTGCCTGCGCCTGCCTCT 

GTCTCTCTATCAGTTTCATTCTCGAGGGTATCTTTCGGGCATGGTGAATGGGACAGTGAATGGGGGTTGCAATGCTTGGGCTTTCTAATCGCTTGAACACTCTGCTTGCCAC 

TTACCATCGTATACTACCATTTGCCATATATAGTGCCATATACCATATATCATTTCTCTTTGCACGCATAGCTGGCGCTTACTGTGCTGATTCGTTGCCTATGGCCCAACAA 

AAGCTAAGCCCGCCAAAGTATGCAACAGCAAAACAGAAAGCATAAAACGTAATATTTAAATTAAAACGGAACCTCTTCTTTGTATGTATGCCCAGCTACAACCAGCTACACC 

TACACCTTCTCCACATTCATACAAGCGTAAATGCTGATGATGAAATGTTATTCTTTCTCTAAAAATTCTCCCTTTTATGCTGCACACTTCGACACACCAGCAACACCAACAC 

CAACACACACACACACACTGAGGCACATGGACGGGTCCTTGTTCGTCCCTTTTTTTTATATGAATGCGCGGCCTTTCATTTCGTTTAGCATTTTTTGTGTGAGTCTGTTGTT 

TGTTTTTTGTTTCTTTCCCTTTAGCCTTTGTTTTCTGTGTGAAACCAAATTGTCACAGACATGTGCCGCCAGCACCCGAAAAAAAAAAACACCATACCCCTGTCGATTGTGT 

CGCCCGTCCCACGCGGCGTATGCTTAACCTTTGAGTGCTGTACAGTAGCGAGCTAACTCTATTAAATTACATAGCGTACTCTAACGGTAGCATGCGAGTTTATCTCATCTAC 

TCATAGTTGGGTGTCCTCTTCATTCCACAGAGTCTGTTTCCCTACGCTTTCTACTTCTGTCGACGTGGCAGGGGGCGAGACGATGACCGGGATCTGGAGTTCGGCGGTCCCA 

GCCGACGGGGAACCAATGGAGCCCAACGGACCGGATCCGGATCCGCCGAGATGGCCAACTGCGTCAACTCCACGGCCTCGTCGGAGATACACATGAGCGTGATGCGTGCCCG 

CCAGTATGCCGTCAATGTCACACCCACCACGGACGCCCAGATGCAGCAGCTGCATCCCCTGTACACCAACTAAACGAAAAACACATATGGCGTATACTTGATTTCAACTCCA 

GTGCTTCACAATGTCTGTTCATGAGATGAGTGCGCAGTGCACAGCAAGTGTTCTCCAGAAAACCAGATTGCCAGCTTACAAATACAAATCACTCGATAAAAACATTCCTCAA 

AATGGCTGCTAAGAGGGCTAAACAAGAATTTAACCACAAATAATAGCTAAAGCAACAGCCACAGCAACAATACGACAATCCCACAGATTAAATAAAGCCTTCAAGAGCAGCA 

AATTGACGTGCAAAATAAATTGAAATCTTATAGAAGTATGTATTTCGACATACAATACATATTTAGGTGTCATGGAGTCAGGAATTTCATAACGAGATGTTTATAAATGCGC 

ATGCAAATAGATATATGAATTTGAATGTTTGATATBCGTATATAAAAAGATAGATGTTAACCCACAAAAAAAAAAATAATAAAATATGAAATATGCATTGATTGGTAATCTA 

AAACGAATCGCAAATTAAAGCTACACTGTGCAACAGTGGTATGTAGTGAGAAAAAAAGTGTAGGTTCAAGGTCTGGTTGAAAGAAAAAAATTTTTAATCTGTTCAATTATTT 

GAAATATTAACTTTAAATACGATCAAACTTATCTGGTCCTTTCACATTTTATTCGATTAGCGAATAAAGACTGACAATTTTATTTTATCAGTGACTTAAAAATGATTTAATT 

AAAACATTATCCACATATTCAAAGTACTTAACCAGGCCTTAAATTCACTTTAACTCTCCAATAAACAAAGATAACTAATGTTAGTTGTAGCAGAGTGTTCGTATTGTTTGTA 

AAACTCATACTCTCAGTTCTGTTAGATGTAATGACAAAGCTTTGATTTGATTCGTTAAATACATTAAAATAATCTAGTTCATTGAAAAACAATCAAATTATATATGTCATAA 

TACTTACATACATAGGTATTCATATTCATAAAGAATACATATAGGGCCTAAAGCGCACGACTAAC 

(SEO ID NO: 67) 

Exon: 1001.. 1564 
Exon: 4101.. 4298 
Exon: 4447. .4618 
Exon: 5695.. 5831 
Exon: 5905.. 6087 
Exon: 6381.. 6534 
Exon: 7983.. 8249 
Start ATG: 1149 

Transcript No. : CT21432 

GGTTTGGCAAACGATTGGCAATAATAGGAGTCATCAAAACGAGAGCAAATGTCATACAGCATGTCGTCCTTATTCCGTGGCGTTAGTGCGACATTTGGAGTTGGAAACTAGG 
AGCCCACTTAGACTAACAACGCCCGTGACCATTACCATGACGCTACTGCAGAGGCTTCAGGCCATGTCGGCCACCACGACCAGGACAATACTGGAGGGCAGCATCAGCAGTT 
TTGGTGGCGGGACAAATGAGCCTCTGGCGAGCAAAATACCCGTTCTGGAGGAGTCAGCCTCACATGCCAGATATTTGAAATTCATTGCCGACGGGCTCATCGACGAGGGACT 
GGGCAGTGCGGTGGGCAGTGGGAGCAGCATCGCCGTATCCGTTGAAGACGTGGTCGCCGGACAGGCGCAGGACATCCAGGCGAGCGAAGGATCCACCGACGACGCCGACGGC 
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AGTAGCCRTTTGGCATTAGTCTTCGTCMGTGTTTCATTATTGGTTTCATCATACTGGCCGCCATCCTGGGCAACATGCTGGTGATTGTGTCGGTCATGCGGCACCGGAAAT 

TGCGCATCATTACCAACTACTTTGTGGTCTCTCTGGCCGTCGCCGACATGCTGGTGGCCCTCTGTGCGATGACATTTAATGCTTCCGTCATGATCTCGGGCAAGTGGATGTT 

TGGTTCCGTGATGTGCGACATGTGGAACAGCTTCGACGTCTACTTCTCCACCGCCAGCATCATGCACCTCTGTTGCATATCGGTCGACAGATACTACGCCATTGTGCAGCCA 

CTGGACTATCCACTAATCATGACACAGCGACGCGTGTTCATCATGCTATTGATGGTGTGGCTATCGCCGGCGCTCCTCTCGTTCCTGCCCATCTGCTCGGGATGGTACACAA 

CGACCGAGAACTACAAGTATCTCAAATCGAATCCGCATATATGCGAGTTCAAAGTGAACAflGGCATACGCCATAGTCAGCTCGTCGATGAGCTTCTGGATTCCCGGCATCGT 

AATGCTGTCGATGTACTACCGCATTTACCAGGAGGCCGACCGACAGGAGCGTCTGGTGTACAGATCCAAGGTGGCCGCTCTGCTGCTGGAGAAGCATCTGCAAATTAGCCAA 

ATTCCCAAGCCCCGGCCGAGCATTCAGGTGGAGCAGTCGACCATCTCGACGATGCGGCGTGAGCGGAAGGCCGCCCGCACCCTGGGCATCATCATGAGCGCCTTCCTCATCT 

GCTGGCTGCCGTTCTTCCTCTGGTACATCGTATCCTCGCTGTGCGATAGTTGCATCACTCCGCGCCTGCTCGTTGGCATCCTGTTTTGGATCGGCTACTTCAACTCGGCCCT 

GAACCCCATTATTTATGCATACTTCAACCGCGACTTCAGGGCCGCCTTCAAGAAGACCCTCAAGAGTCTGTTTCCCTACGCTTTCTACTTCTGTCGACGTGGCAGGGGGCGA 

GACGATGACCGGGATCTGGAGTTCGGCGGTCCCAGCCGACGGGGAACCAATGGAGCCCAACGGACCGGATCCGGATCCGCCGAGATGGCCAACTGCGTCAACTCCACGGCCT 

CGTCGGAGATACACATGAGCGTGATGCGTGCCCGCCAGTATGCCGTCAATGTCACACCCACCACGGACGCCCAGATGCAGCAGCTGCATCCCCTGTACACCAACTAA 

(SEQ ID NO: 68) 

Start ATG : 149 

MTLLQRLQAMSATTTRTILEGSISSFGGGTNEPLASKIPVLEESASHARYLKFIADGLIDEGLGSAVGSGSSIAVSVEDVVAGQAQDIQASEGSTDDADGSSHLALVFVKCF 
IIGFIILAAILGNMLVIVSVMRHRKLRIITNYFVVSLAVADMLVALCAMTFNASVMISGKWMFGSVMCDMWNSFDVYFSTASIMHLCCI SVDRYYAI VQPLDYPLIMTQRRV 
FIMLLMVWLS PALLS FLPICSGWYTTTENYKYLKSNPHICEFKVNKAYAIVSSSMSFWIPGIVMLSMYYRIYQEADRQERLVYRSKVAALLLEKHLQISQIPKPRPSIQVEQ 
STISTMRRERKAARTLGIIMSAFLICWLPFFLWYI VSSLCDSCITPRLLVGILFWIGYFNSALNPI IYAYFNRDFRAAFKKTLKSLFPYAFYFCRRGRGRDDDRDLEFGGPS 
RRGTNGAQRTGSGSAEMANCVNSTASSEIHMSVMRARQYAVNVTPTTDAQMQQLHPLYTN* 
(SEQ ID NO: 69) 

Name: 5-HTlA/Ocr-like receptor 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384515 

CTGGACGCCGGACACGAAAGCAGATCACCTGCCGTACAGTGCCAAGATCATCGAACGCATTTACAATGAGGAGATCGGTGATGGCGGTGGCCACAGTCCACGGAGGATAAAC 

ATGCTGGAGTTCAGTCAGTATCTGGAGCAGTACCTGTGGCCACACTACCAACGTGAAACCGCCACCCATGCCCACCTTATGTCCATCGTTATAATGGCCAACGAGAAGTTCC 

GGGAGCGCGTCGAAGTTTGGACCGTGTTTGAAAAGCTCCCGGATCAGTATCCAGCCTTCTTTCGCCACGTGCTAGAGAGCTGTTTGCCGAGCAAAAAGGCAAAGGAGGCTAG 

TAGCACGCTAAGGGAGCGGACAGCGCTGTTGATGTTCATTAACCACTGCTTTAACAGCATGGAGATAGAGCTGTGCAGAGAACAGGCCAAACGACTAGTCTCCTTGTCCATG 

TGGCATTGCCTGCAGCCTCGTAAGTTCTTCTTCGCAACTCTTCACATCTATCAAAGTGATCAAAAATGTATTTGCTTGCAGGCCGTCGCGAACAAGAGCTTCGAGAAGTTCC 

TGAGTGGCGAAAGTACTGGAAACGCTTGCTCAAGAAGGAGAAAGACAGCAAACCGGAGGTCCTCTGGGAGAGACACTTTATGCAGAACCTTATCATTGACTTCCTGCACATA 

CTCGAAAGCATTCCCGCCGAAGGCGAGGTGCCGCGCAACGTGGTTCACTACTGTGAGCGCTTCCTAGAGTTTATTATCGACTTGGAGGCACTTTTGCCGACTCGACGCTTCT 

TCAACACGGTTTTAGACGACTGTCATCTGATTGTGCGGGCTCTTCTGTCACCCTTGGTTCGTCGCGAGGAGGGAAAACTGTTTGGTCAGGTAGGTGTCAGACTTCTCATTTA 

ATAACTATCTCGTACAGCATAAATTAGTTAAAATTAAGTACAAATCTCAATGTTGGACATTGCTAAACTACGAAAACACTTATATTAGTGTGGACGGTGGCTCCTCAGTAAT 

CGTTGCCGTTCATATAATGCAGCTCATCGCCCCAGTCGTTTGCCGACGGTGGCTCATTGTAGCAGCAGGTCTTTTTCAGCAGAAATGTCACATGTCGCTGGCGCAGCACGCA 

TATGTATAGCAACAGCGGCGTCTGCAGGGCGTTAACTACGATGTGGGCGTAAAGCAGGCCTTCCATCTGCAACCAGGACATTATGAGGAATAGCCAGGCTATAGACATTACT 

AACAGCATCAGCGAGAACATGATAAAACTGCAAGGAAGATAATGTTTTCATATACTTATATGTAATGCCTAGATGCCGATCTACTTACTTGGCTTTCAATTTGTGCGCAATG 

CGGCCGTATACCGTCCTGCGGTTAATCAACTTCCTGGTGGTCACATAGAAGAATATGTTTACCAAAATGGTACAAGCAATTGGAGCAAAGAATATGCATATGCCTAGCCAGC 

CGATCGTCTCTTGCTCGCCCACCATGTGCTCCTGTTTGTAGGATTCGGCGTCGAGGAAGAAATGTGCGAAGACAGCCAGTGCAGCCATCGTTGCTGTGCATCCCCAAGCATA 

CGCAGAATAATAGCAGTACTTCCTGCCGTCCGTGACTCGCAGGAAGACGTTCCTCGAACGAAAGGTCTTCCAAATGTAAAAACCGAAACTGTTCAGCCAGAAGAAAGCTGCC 

AAGAGGCTGAAGCACAGGATAATGTCCGCCACAATGAAGCTCACATGGCTGGTCAGCTCGGTGAAAATTCTGACAAGGTCTGCTGCCTGGCTTACCATAAGGCACATGGCTA 

TCGTTGTCACAATGTTACCAACCAGATCTCTGCTGAATGGATTGAAAATTAACGCATGAAGCAAGTCAAATTCAAACTATTGCTCACCTGAGAGTAGGAAGTATAAAATAGA 

TGATGGCGATGACCAGCAGGATAACAAGCGAGATGCCATGGAAAATTGGGTTTAAGATTTTGCGGAGCAGAAAGTTTGAGTCACTCCACTTGATCTCCTTGCGGGCCAAGCA 

AATGTTGGCGAATAGCACATTTTCCTCTCCAGTACTGGAAGTGGCCTGCGAATGGATTTTATGATCATCCATGCAATTTATGTGGACTTTAGCTACTCTTACCTTGTCAATG 

CAATACAATCCCTTATCGTAGTCATGGTATAGCGGTTCCAGACTGCCGGCAATATCCTCCTCATAATCCGATTGTATTCCATGACGCTCCTCTGCTTCGTTTTCGGCATTGG 

TGTAATGCCTTAGCCTCCCATCATCCAGTAACACTAGCTTATCAGAGCTCTAGAAAATATAAAAATATCTATAAGTATCTAAAAGTTTCTATGCAAAGAACTTCAAACTCAC 

GCCAGCATAGTGGTATATTGGCCACATCTGCATGGATCCACAATTTGGAATCCCGATGACGAATTTGAACTTAACGGGCCTGTTTTGCTCTCCTCCATAGCTGGTGAACATC 

GGCTGGAAGTAGTCTGCAAATGGAAACGGACGATAAATAGAGAACACAGACCGTGGAAAATCATAAAGGCAGTTCAAGTTAACGCTGCTTTTGTTGCGGCTGGTGGATCAGA 

GGGGAGCCAGATTTTCTTTTCTCTCTGCGAATATAAATATTTACGCTTCTGGGCAAAGAACTTGAGCTGTTCCATGCGCCTTCATCAGCTGGTTTTGCAATTTGTGCAGATT 

GGTTCTTCTTCAATTATCGGAGTTCGGATACATTTTTTTCCACTCAGCCATTCATATGTTAATCATAGCTAAGCACACGGAAATTAGATTGCATGTTGCGATTATGTGCCCT 

GTTGTATTGTAGTATTGGGATTACTCAAATGACTACAATTAAAATCGGATGAGATCAGTACAATGCAATGATTCAACTATCAGTAATAGTTACATTTTTATGTGTCACCTAT 

ATGGCCGATCCATGGTCATAAATGTTTTATCCAGTTTTAGTTGTCCGAGCTAGAATATCGATATAGATGCTAGAGTACGCTTTTCTATTTCCCACAATTGTTGCGATTGTTT 

GTCAAACATTCACCCAAGGCAATATATCACTATATTCACCATTAGCTCCACAAACATTAATTACCAGGGCTCGCATTTATGGGGAACAGAAGCATCTTATATTACCTTACTA 

ATGGATTTTTTGCCTCATTTTACTTCCAAGAGATAATATTCCCGATTTGGGAATCCCTCATCAAGTGGCTACTTTATAGTTTAAACAGAAACATGCCCCACAGCATAAATCA 

TATTTCTCATCAGGATTGGGGCCACCCTCTGTTTGTTTTAAGAACTTTCTTGGGGGAGTGCGAACTGTGGAGTTGGGGATATTTAGCTGTTGGCTTCGTTCCACTGATCCAC 

TGATAGCCAACAAATGGGCCATTTAATGTTGGGCATACATATGTTGCATGCCAGAAGGGAAAATCCATAAAGTTGAATGGCCAACCAACTTCGTTGTGGTTGTTATGGCAAC 

TTTTTCGAGCGACTAGGAGTTTTCCGCCTGGCCAGCTGTTGAACTTACCCGTTTCATTGACCTGCTGACACTCATGGTCCACGTGTATCTCGAACTTCTCGCAACATTTGTT 

GACCAGCACTAGGTTGGGATCGGAAGATAGGGCTGTTGATGATCCGGCGCTAGTGACATGACTGGTT 

(SEQ ID NO: 70) 

Exon: 2651.. 2621 
Exon: 2477.. 2353 
Exon: 2289.. 2119 
Exon : 2061 . .1880 
Exon: 1824.. 1321 
Exon: 1259.. 1001 

Start ATG: 2651 (Reverse strand: CAT) 
Transcript No. : CT21585 

ATGGAACAGCTCAAGTTCTTTGCCCAGAAGCACTACTTCCAGCCGATGTTCACCAGCTATGGAGGAGAGCAAAACAGGCCCGTTAAGTTCAAATTCGTCATCGGGATTCCAA 
ATTGTGGATCCATGCAGATGTGGCCAATATACCACTATGCTGGCAGCTCTGATAAGCTAGTGTTACTGGATGATGGGAGGCTAAGGCATTACACCAATGCCGAAAACGAAGC 
AGAGGAGCGTCATGGAATACAATCGGATTATGAGGAGGATATTGCCGGCAGTCTGGAACCGCTATACCATGACTACGATAAGGGATTGTATTGCATTGACAAGGCCACTTCC 
AGTACTGGAGAGGAAAATGTGCTATTCGCCAACATTTGCTTGGCCCGCAAGGAGATCAAGTGGAGTGACTCAAACTTTCTGCTCCGCAAAATCTTAAACCCAATTTTCCATG 
GCATCTCGCTTGTTATCCTGCTGGTCATCGCCATCATCTATTTTATACTTCCTACTCTCAGCAGAGATCTGGTTGGTAACATTGTGACAACGATAGCCATGTGCCTTATGGT 



FIGURE SHEET 




WO 01/070980 



PCT/US01/09341 



36/89 



AAGCCAGGCAGCAGACCTTGTCAGAATTTTCRCCGAGCTGACCAGCCATGTGAGCTTCRTTGTGGCGGACATTATCCTGTGCTTCAGCCTCTTGGCRGCTTTCTTCTGGCTG 

AACAGTTTCGGTTTTTACATTTGGAAGACCTTTCGTTCGAGGAACGTCTTCCTGCGAGTCACGGACGGCAGGAAGTACTGCTATTATTCTGCGTATGCTTGGGGATGCACAG 

CAACGATGGCTGCACTGGCTGTCTTCGCACATTTCTTCCTCGACGCCGAATCCTACAAACAGGAGCACATGGTGGGCGAGCAAGAGACGATCGGCTGGCTAGGCATATGCAT 

ATTCTTTGCTCCAATTGCTTGTACCATTTrGGTAAACATATTCTTCTATGTGACCACCAGGAAGTTGATTAACCGCAGGACGGTATACGGCCGCATTGCGCACAAATTGAAA 

GCCAATTTTATCATGTTCTCGCTGATGCTGTTAGTAATGTCTATAGCCTGGCTATTCCTCATAATGTCCTGGTTGCAGATGGAAGGCCTGCTTTACGCCCACATCGTAGTTA 

ACGCCCTGCAGACGCCGCTGTTGCTATACATATGCGTGCTGCGCCAGCGACATGTGACATTTCTGCTGAAAAAGACCTGCTGCTACAATGAGCCACCGTCGGCAAACGACTG 

GGGCGATGAGCTGCATTATATGAACGGCAACGATTACTGA 

(SEQ ID NO: 71) 

Start ATG: 1 (Reverse strand: CAT) 

MEQLKFFAQKHYFQPMFTSYGGEQNRPVKFKFVIGIPNCGSMQMWPIYHYAGSSDKLVLLDDGRLRHYTNAENEAEERHGIQSDYEEDIAGSLEPLYHDYDKGLYCIDKATS 

STGEENVLFANICLARKEIKWSDSNFLLRKILNPIFHGISLVILLVIAIIYFILPTLSRDLVGNIVTTIAMCLMVSQAADLVRIFTELTSHVSFIVADIILCFSLLAAFFWL 

NSFGFYIWKTFRSRNVFLRVTDGRKYCYYSAYAWGCTATMAALAVFAHFFLDAESYKQEHMVGEQErriGWLGICIFFAPIACTILVNIFFYVTTRKLINRRTvYGRIAHKLK 

ANFIMFSLMLLVMSIAWLFLIMSWLUMEGLLYAHIWNALOTPLLLYICVLRQRHVTFLLKKTCCYNEPPSANDWGDELHYMNGNDY* 

(SEQ ID NO: 72) 

Name: mth-like 5 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013383774 

GCTGTTCGCACGGCATTTTTGGGCCAGCAAACAGGATGCGGAAAACCGGAAGCTGGGAACTGGAAATGGGCAAGAATCTGGTTTTTGGCTGCCATTCAATTAGGTATCGGTC 
AGCCCAATTCAACTGCTGCTAACTGGCAACTGGAGATTTGGCGATTGCAGCTGCCCATTGTCAACACTTCAATTAACGCAATAGCAGGTAAAAAGGGGACATCTTTTCATCG 
AGATCTCGACCTTGTTTTGCTATCCTTGCAAACAATTGATGGTATGTCGTAAGTTTGTGCGAATAAAATGGTTTTGGAAATATATATATATATTGTTTATTAGTTTATGGCC 
CCAAAAAATGCTACAACACAATATTGAAACCGTTGTTATCTGAAGGGAATGCAACGATTCGAGCTTCATATTGAGAAGGTCCTGTTTTCTCTCTGTTCCTGCATAGTTCCTC 
TTTCATTTCGCATCTTTTTCCACTGGCCGCAGTCTCGCAGGATATGCTAACTAAGGACTATGCCACCATATTCGACAATTTGTTGGGCACCAAGTTAATATGCATTGATGCG 
TTATTCATACGCCCCGTGGGACGGGCGAATAGCAAAACTCTCAATGGGCCTTTGTTAATTTGTATGTGTGTGTATGTGTATTTTGCGTGTATTTGATTTGATGCGTTTAGTA 
CTTAGGATAACAATTACGTTTAGGTTAACTAGCATAAATCTATAATCATCACATTATGGCACAATGTATGTTATGTAGTATATATATATCGTATATATAGAGAGACTGATAT 
ACATAGATATCGATTGATATAACAAATTAAATTTTCAGCTAACCGCACGCTTGAACTGAATCACTTTTGATTCAATCATCT(^CTTGTGATTCCGCGTGATTTGCTGGCAAA 
TTTAAAGCCCTCTTCTCGTGTTTCATGAGAAGTTTCATGATATCATCTATCACTCCTCTCATGGAGTTTAAAACCATAAATATTTTATTATAAATATTCACTATTCACGCTA 
TCAGGCGACCCGTATTACGATTTCGACGACTTCCGGCTCCGCCATAATTACTACCATAATGATCGCCATTTATTTGGGTCAGTGTAGTCGTTGTATGCGGCACATAGAGACT 
CGTATTGTTACGCGCATGTCGACGTCTATAGCGATATCCCGTTATCAGTTCTCGAACCGTCTTTCGGTACTTATCGGAAAGTACACAGTATAGAAAAAAATTGGATGCCGCA 
TGTAGGGAGGCCAATAGGTTACACACATTTCCCGCCACCACCTGAGTGCCAATTTGAATTTCCATATACTGATTGACTATCAAATAGATCGCCCAAGGCAATTGGCAAACTA 
TGAACATTAACACCACCGCAATAAGTGTGATCGTCACGCGATTTTCCTGCGATACGCTGCCTTTTAGTCCAGAATTTGATTTGCGCTAAGGATTCAGAATTTATACAACATA 
TCATCATCATCATCATCGTTTCGAATAACTCACTTTTGTGCGCCTTATGCTGCTGGCATTGGTCAGGTCACCACGCAGATTCTTGGACCGATGAACCAATAGAATGAGAATG 
GAGTTGAAGGTGGCCAATAGAAGTAGGGGGAATACCACGAAGACCAGCAGGGTGAACAGGTTCCACAGATAGCTGTATAGCTCATCATAGCCCAGTTCCGAAAGACCATGAT 
TGTAATAGGTCACCTAAAACATATAGATAGATACATAGAATACATACATAGATTTCCATTAAGAACAACCTTTTAAGCAATGGAATATCCCAAGCGACTAAGTAAAAGCATG 
TCTTACAAAACCCAAAGCACTGATATCATTTTAAGTTATCACCGAATTTATCATAGTCATGGCGCTTGTACACCGCTCGTGATTTTCACTGGTTAACAATTTTCTCATGTTT 
TTTTCTCCAGGGAAATCGACGGTATTTCCAACATTATTTTACTCACATTTTGACAGTATTCCGTTACGTTAAATGCGAATGCATCCGTATTGTTGATGTTGTGATTCTCAGC 
GCTGCGCTTGCGACGCCACAATTGCAGCAAACTCTCTGTGACCCCTGACCTCTGGGCCGCGTCTGCCACCTTCTGATCCTGCTCCTCCAGCGTTTTTCTCAGTGTGGGCAGC 
GTCACAAATCCAGATGACTGCCAGTGACGGCGATGTCGTGGAATGTTCTCTAGTTGAGGGCAAAGAAAGTAATGCAACTATCAACTACTTCACTTATAAAGCTGTCTAAAAG 
TATGCTTCGTACTTTTCAATGCATACTTCTAAGCATCTCTATTAATACAACTTCGATTTTTCCAAGTGTACACTACTGCTATACCCTCGTTACTCACCTGGCTCTCCATCAC 
CACTGCCACTGCCCCAGTCGACGAGCTGATTAGAGCTGCCGGACGTCGTGGACTCCATGGCGGAGTCCGGCGATTGCCAAATGGTCGCCGGCGTGGGCAGCGGGGGCGTGGC 
CAGTGGGGGCGTGACTGCCACGGGTGGCGGCGGCATGGGCGAGATGTTGGCCACCGTTTGGTTGCACGGCTGGTAGGCGTCATCAATCTGCCTAGTACCAATCGTAATTGTA 
TGCTCGAATGCCGTCGATAGTGTGGACAGCAAACAGAAGATGGCCACCGCTGGCAATGTAGAATATAAAGTTGCATATGGAAATGGATTCTATTGTTACATACGCCTATTTC 
AGCCAACTCACCTGCTATCACCTTTTTGGCCAGCGATTCCGTGCAGAATGTTTGCCTCTTGAGCGGATAGCGTATCGCAATGAATCTCTCGATGGTGAAGCACACGGCTATG 
TAAATTGAGATGTAACCTGTCCGGTAAAAACAGAGAAAGCAATTGTGAATGGGGGCGTGGCAGTACAATACGAACGCAATCAAAGATGCACTGCAAATATTTTGGGAAAATC 
CAATCCAAAAAGGATATATATACATATATATTTATTCAGGTTAAGTGTCTAATATAGATTGGTAAATTAAATTCAAACTTCGGTTAGACTGTGCCTTTTTTCCGTGTACTCT 
GGAATTATAAATAAACGCAGCTCCGCTGTTAATACCCCAAACCATCCATTTTTGACCGCACCACCAATTTATTTGCCGCGCGGGCTGTCAATTTATTCAAAACGCAGCGCAC 
TCTAAGCGATTGACGTCCGTGCTGTGCTGATGAGGTTGCTGCGGGAGGGTGGGATATTCAAGGGGGGTGAGGGGGGTGAAAAGTGGGGTGGGAAAGCGGAAAACGGTTCGGA 
AAAACGCTATCGCATGGCAGTGAGCCTCAGCTGACCGCTGGCATTTCGCAGAAAGCGCTGCTTACTATACCTCGTAAATTGGCAGAGCTCAATGAAATTGATGCACAACAGA 
AACAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCCGCAGGTGCAATTCGAATTGATGGGCCCATCGCTGGAATCCGGTTGATACGGGGGGAAATGGGGGAT 
ACATTGGATCCGGGGATCCAGACGAGGCTAACCCGGGGAACTGGGGAATTTATGGAATCCGCACCATTCCACTTAGTGCTCTCCAAACGTTGC^TTCGAACTGTTTTTGTTT 
TTTGCCCCCGGCAGCAATTGTTTTCACCGC^TGATATGTACGCTTTTCTTATTGTAGTTTTGCGGGAATAATGTTAAATATAAAAACACCCTTTTGGTAAATTTTTTCAAAT 
TGAAAGCTTATTTAAGCATAAAATACTCGTGTTTCGATTTAAAGAACACTAATCGAGTTACTTTCGGTCTCACAACCTTATTTATTGCATACTTTGCGACTGCTTTGAATTA 
TTGAATTATTACTCCATTGCTACTTAATTCTAATTTGGTCAGTACTCACCGAAACTGTCGCACAGCCAGACAAAATAGCCATAGAGCTGCCAGTAGAGCTTGAAATGATACT 
TGGGATAGTCATAGTGCTGGAGTGACAGGATTAACTGGCACGTCAGGTAGGCAATATCCGTAATGGCCAGTGCCGTTAGATATATGTTCGTCGTGCATCTCATGCGCTTTCT 
AAAAATACCGTCAAGAATATCGAAAAGAATTGCAAATGGTGTTGATTCTGTTGGGCACTCGACGCATTTGGAAATCAGGCCAAACCATTTCGGTGGACAATGCAAAAACGCT 
GCCAAGTGCAACACCAATTGAAATAGCAAATTGCCTGAAAGCCAAACGATCGGTGGGGTTGTGCGCAGAGGGGGTGGAAAATTCCAGGCGGTTGATGTGATGGGCGCGGGGC 
CTTGGGCATTTTCCGAAATGCGCACAAATGTCGGCACAGTCATAAAGCCAAAAGATGAACAATAAAAACTAACTGTTAGCATTTAGTTTTCTCCCTGGCATCGCCCGCAGCT 
CTGATAGTTAAATTCTCTTGTATTCCCGTAAAACAAAATAAAATAACGTAATGAAAATATTTAATTAAAAAGCCATTCATGCCGTCAACGCTCAATGTTAAGCTAGTTGCTT 
TGAAAAGTTGTAAATATGGTTTAAAATAATAAAGTTCATGCTATAAATATCAATAAATTTTGTTTAGCAAATAAATGAATTTGCTAAGATCTTAGCAAGGAACACAGTTTAT 
CTTTTTTTTTCACGTTTTCCTGGGAAATGGACATATGGTGTCATAGCAGTCAGCCTAACGCACAGAGCACTTGCAACTGTCGCATATTTTCCGGCAGACATAAAAAACACAC 
CCATCCACACAGCATATCGCCTATCTGTGTGCGTTTTTGCATTTGGCACATAAAATGGCTCTTATTGTTTCAGTTACTGTTATTGTTGTTGCACTTGTACCCAACGTAGTTG 
TTGCTGTTGTTATTGTTATCGTTGTTGGCTGTGTGGAAATGATTTTAAAGTGCAGCGGTGGCTACAATTAAATGTCGTGTTTAATGCGCTGCGTACCTGCTCGCCGCTTCAT 
CCATGTTCCACTTATTTCACACACACAGACACACACACACACGCACAACAGGACATGCAGTTGGATGCACAGAGAGAAAATCGTGCAAGTTAGCTGGCATTTTAAATGGAAA 
ATTGTAACTTAAAACTAGTTTAAAACAAGCTCAAACGAATAAGTTGAAGTTTGGAATAAAATTAATTTGAAAAGGCTCATATATTTTATATGCATTGCATACTTTTTAGTAC 
TTAACCAAATGCATGAAAATCCAATTAGTCGCTTTTGCATACCTAATAACTTAACATTCTTAACGTGGCAGCTAAAACTTCTTTCAAATGCCATCATCAGCTAAATGATTCA 
TATTTAATTCGAAAATCTTGTGACCCTGCTGCATG 
(SEQ ID NO: 73) 

Exon: 4299.. 3858 

Exon: 2B16..2700 

Exon: 2625.. 2338 

Exon: 2177.. 1951 
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Exon: 1693.. 1490 
Exon: 1189.. 1001 

Start ATG: 4299 (Reverse strand: CAT) 
Transcript No. : CT21642 

ATGACTGTGCCGACATTTGTGCGCATTTCGGAAAATGCCCMGGCCCCGCGCCCATCflCATCAACCGCCTGGAATTTTCCACCCCCTCTGCGCACMCCCCACCGATCGTTT 
GGCTTTCAGGCAATTTGCTATTTCAATTGGTGTTGCACTTGGCAGCGTTTTTGCATTGTCCACCGAAATGGTTTGGCCTGATTTCCAAATGCGTCGAGTGCCCAACAGAATC 
AACACCATTTGCAATTCTTTTCGATATTCTTGACGGTATTTTTAGAAAGCGCATGAGATGCACGACGAACATATATCTAACGGCACTGGCCATTACGGATATTGCCTACCTG 
ACGTGCCAGTTAATCCTGTCACTCCAGCACTATGACTATCCCAAGTATCATTTCAAGCTCTACTGGCAGCTCTATGGCTATTTTGTCTGGCTGTGCGACAGTTTCGGTTACA 
TCTCAATTTACATAGCCGTGTGCTTCACCATCGAGAGATTCATTGCGATACGCTATCCGCTCAAGAGGCAAACATTCTGCACGGAATCGCTGGCCAAAAAGGTGATAGCAGC 
GGTGGCCATCTTCTGTTTGCTGTCCACACTATCGACGGCATTCGAGCATACAATTACGATTGGTACTAGGCAGATTGATGACGCCTACCAGCCGTGCAACCAAACGGTGGCC 
AACATCTCGCCCATGCCGCCGCCACCCGTGGCAGTCACGCCCCCACTGGCCACGCCCCCGCTGCCCACGCCGGCGACCATTTGGCAATCGCCGGACTCCGCCATGGAGTCCA 
CGACGTCCGGCAGCTCTAATCAGCTCGTCGACTGGGGCAGTGGCAGTGGTGATGGAGAGCCAGAGAACATTCCACGACATCGCCGTCACTGGCAGTCATCTGGATTTGTGAC 
GCTGCCCACACTGAGAAAAACGCTGGAGGAGCAGGATCAGAAGGTGGCAGACGCGGCCCAGAGGTCAGGGGTCACAGAGAGTTTGCTGCAATTGTGGCGTCGCAAGCGCAGC 
GCTGAGAATCACAACATCAACAATACGGATGCATTCGCATTTAACGTAACGGAATACTGTCAAAATGTGACCTATTACAATCATGGTCTTTCGGAACTGGGCTATGATGAGC 
TATACAGCTATCTGTGGAACCTGTTCACCCTGCTGGTCTTCGTGGTATTCCCCCTACTTCTATTGGCCACCTTCAACTCCATTCTCATTCTATTGGTTCATCGGTCCAAGAA 
TCTGCGTGGTGACCTGACCAATGCCAGCAGCATAAGGCGCACAAAATACCGAAAGACGGTTCGAGAACTGATAACGGGATATCGCTATAGACGTCGACATGCGCGTAACAAT 
ACGAGTCTCTATGTGCCGCATACAACGACTACACTGACCCAAATAAATGGCGATCATTATGGTAGTAATTATGGCGGAGCCCGAAGTCGTCGAAATCGTAATACGGGTCGCC 
TGATAGCGTGA 
(SEQ ID NO: 74) 

Start ATG: 1 (Reverse strand: CAT) 

MTVPTFVRISENAQGPAPITSTAWNFPPPLRTTPPIVWLSGNLLFQLVLHLAAFLHCPPKWFGLISKCVECPTESTPFAILFDILDGIFRKRMRCTTNIYLTALAITDIAYL 

TCQLILSLQHYDlPKYHFKLYWQLYGYFVWLCDSFGYlSIilAVCFTIERFIAIRyPLKRQTFCTESLAKKVIAAVAIFCLLSTLSTAFEHTITIGTRQIDDAYQPCNQTVA 

NISPMPPPPVAVTPPLATPPLPTPATIWQSPDSAMESTTSGSSNQLVDWGSGSGDGEPENIPRHRRHWQSSGFVTLPTIiRKTLEEQDQKVADAAQRSGVTESLLQLWRRKRS 

AENHN1NNTDAFAFNVTEYCQNVTYYNHGLSELGYDELYSYLWNLFTLLVFWFPLLLLATFNSILILLVHRSKNLRGDLTNASSIRRTKYRKTVRELITGYRYRRRHARNN 

TSLYVPKTTTTLTQINGDHYGSNYGGAGSRRNRNTGRLIA* 

(SEQ ID NO: 75) 

Name: TRH receptor like 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384839 

ATAAATGGCCGAATTTGCAAGTCAGTTCCACTTGCTCTTTAAAAAGCGAATCTCACCGAAATTTAAGCAAAATTTATCACCGACCAATGTGAAATTCCACTAATAATTTTTC 

AGCGGTGTTGATGGCTTTATTGACATCGCCAGAAATGATTCGAGCTGTAGCAGCGAAAATTTAATCAAAGACCTAACCCACGAACCTGGACGTTCTCACATAGAAAATGAAA 

ACGAAAGCTGAAAGACAGATGGATCGTTCGATGTGAGCTCTTTTTACACTTGGCTTACACGACGACTGCGGCGTCGGTAAAATTAATATACCAAACAATTACACGGCCCGGC 

AAAACAATTTTAAGCCCAGTTAATGCCGTCGCACAGTGGCTGCAAGCCACGCAAATTCGCCGCGAAAAATTGCCCACAATTGCCAGTGGCCGAGAAATGAAAGCGCTCAAAT 

TTGCATACTCGTACTGTCAAATGAAGCGTAGCAAAAGGAAAATTGCTACACATACTAATTGGAAAAGAAATGCAAAACCAATGCAAATGAAAATAATGACAGGGCGCGGTTT 

GCGAAGCTGTCTAATTATTTATTTGACATTGGCAATGACAATGAAACCGAGAGTCCCACAGCAGAGTAATGGATTTTTCTTTCAACTGCTTAAGTTCGTTTTCCATTGAAAT 

CTTATTTGGACATATAAACCATACGTAGAATGATAAGAATAATAAAAAACAAGTATGTTGGACACAATTGATAAGGAATCTTTAGTACTTTCTCACCTAAAAAAGGTGATAT 

ATATTTGAATTCATGTTGTCATGATAGCTACTCGAACCACTGTGCGTTGGCTACAGTCTGCTGATGCTCACAGTGTTTGAAAGTTAAGACTTTTGCTTTCCTTCCCCCGACT 

TGGTTTTTGCTTTTGGAAAGACCAAGCAAATGGATAAACAAGAGTTCAAGAGGTTGTCAGCCCCGCCCCTTTGCGCACATGAGGCAAGAAAGAAAAGAAAGTTATTACAAAA 

TTAACAGAGGCTGACAGCTGACAAGGGCAGAGGAATGGGGCCGAAATGGAACTGTATCTAGCTTGAGTTGGGTCCGACCGCCTCGGGCAGAAAATGTGACAAATTTGGAGGG 

GAAAAGCTCGTATGAGCGGGACTTACCACCTCCATTGTTGGCACTGGAGCAGTGTTTGGAGCTGAGTCCCTGGCCCGTGTTGGATGCAGAGTTGGGCAGTCCGCAGCCGCTC 

ATCACGCCGTACTTGATGGGCTTGAAGTGGAAGAACGAGGGCGAGTAGCCGGAACCGGCGCGACTCAGGGAACTGAACTGCTCCTCGCGGTCCTCCACGTACAGGCCCTCCT 

TGCCCATGGCCGCCAGTTGCCTGCCCTTGGGCATGAACATCACCAAGAAGACCGTGGCGGATGTGGCTACCAGACCGAAGGCCACGCAAGCGTCCTTGTGCCGCTCGGCCAC 

GGCCAGGCCGCATAGCATCCATCCCAGCCAGATGGGGATGGCTCCTCCGATGGCCAGCCCAATGTAGGTGGCCTCACGGTAGTTGTCCCTGATGCGGCGCGACTTGATGGCT 

AGCACGGCGATGAAGACGATCAGAAAGACAATGTAGATCAGCGAGAAGAGCAGCTCCGAGAACTGCGTCTTGCAGAGCGGTATTAGTACGGTGCTAACCGCCGCTATCCGGG 

TGTAGATCTCCGGAGTCCCGTCCAGCGTTGTGTAGGACGTGGGGTAAAACAGTGCCGAGTAGTTGGTCTGTGAGGCCACCGTGGTGCTAAGGAAGCCACTGCCCATCACAGG 

CACACTGGTGGTGTATACCTCCGGTGGTTGGGTGAGCAGCCACTGCCCTCCGATCGCTACCTGGATGAGCAGCGCAAAGAGGAGCAGCAATCCCTGGTATGGAGCCGGCAGA 

TACACCCCTCCATTTAGACTAATCAGAAATACGCATTTGACCAGCAGGGCGGCAAAGACCAGGGCGTAGGCCACGCCCACTCCGAAGCGGATGGCCCCGCAACTGATCAACG 

AGGGCTGTGCGGTGATGATGGCCCCCAAACTAGCGCAGGCGAACAGACCGAGAAGCAGCATCTGGCCCAGAAAGAGGTGCCTACGCGACGGGGACGTCCGCCAGGCCTTGAA 

GAGCACAAAGATCTCGAACGCTGCCATCATGAGCATGGTGAGGGTGGCCAGGACTAGCACCGGCACCACCCACGGCTCCTTGCGTAGTCCCGCGAAGGGCAAGTGGGTGGAC 

ACTATACTGGGCGGTGTGCTGTTGGAGCTGCTCCCGGGAGTCACCCGTCCAGTGGTGCCGGCTGCTCGCGCCTTCGCCCCTGAACTCCGACGGGGCGTGGCTGGTGTGGTGG 

TGGACGTGGTGCTGCTAGAACTGCTGGCCGATAGTGTGGGTACCACAAAGAATTCCGTCGACATTTCGAGACGGGCAGCACTGGAGACATTGCCACCCGGAAAATTGAGATC 

CAATTGTTTGGCAGATCGCTGGGTCTGAGGAGCTGCCGGAGTAGTGGTACTAGTGGTGCTACTTGTAGTACTTGTAGTCGTGCTGGTAGTAGTGGTGGTGCTGGTGGCATTA 

CGGCTACTCGGCTTAAGCACCTGGTAGGCGTTGCGATCCGATTCCTGGCTGGTCACCAGGCGCACCCGCCGCCAAATGTTGTTGCCATTAACGCTGAGGCTGCGCGACGCCG 

GCGTGGTGGCGATCAACTGCATGATCAGCAGCAGGATCAGCAGCAGGTGGCGTCTACCGGCCATTGGATGTGGTGACCTTCGATGCCCTGGACCTCTGACAATATAATCAAA 

CGCGGCACATCTCTTGCAAACGCACACTAACCACGCACTCGCGATCGCGATTGTTCAGGTTCGTAAATTCTTCAGTCCTCGCAGGCCCTAGATCATGAATTCTTCTTCTCGA 

TTCTCGCGCTAAAACGTCCGCTCGCTCCGATCGACCAACTGCAACTGACTGGCGGCTCTGCAAGTTGGGCTTTGGACCGGCTTAGTCGTTTGAGACCAGTTTGACGGCAGCC 

AAGTCGAACTAACACTTGTTTGAGTTCGATGTGCGTTTGCTAACCCGGCTAACACTTACAGGAACGCAACATGATGCGTGGGAGGCAGTTGGGGTTTGGGGCGCTTTATATG 

ATTTTTTGCAATGTAGAGTATTGCACGAGCATTTACTAAAGCATCCTTTGAATTTGATACAACATTCGTTACTCGATCTTCTGGCTTTAAGGTTTAGTATATAGCTTCAACA 

ACTAATGTTCTAAATAGCTCAATGTCTTGCTGAAGGGCTATTGATTTCCTCTCGAAAGCTATGGTACTTAATTTCCTATTTAGAGCCAGTAAGAACTCAATTTATCTTTAGC 

TTCCGTGCTAATCCGCTGGAACATATACTTTAACACAACATTTACATACGTAAGTTTTTTTTCCATTTGATAATTATTCCAATTAGATTTCATATTGCGGGATATTAACATT 

GGACACAATTTTTTCCTCTGCGGTTGGTTGCAACATCAACTGCAGACGCGATCCACAGACCTAGCTCAGGATCCACACTCAAACCGACTGCGGGATGGAGAACACTTTCATT 

CGTTCTGCATATGCGGAATTTCGGTTTATGGGCTCTGATGCTCGAGTTGGCTGGTAGGAAAGTGCTATGGGCCATTATCTATTAGCCAACCGCTTTTACGGTCTCCTCGCGG 

AGCCTCTCTCCTCGAAAACGAGCCAACAAAAGGCCAAATAGCGAAATCCGAGCACAAAAGGTGATTGT AAATCAAAAGAGCGCTAATGGTATATCGATATACAAAGTGTTGA 

GGCGGGAAAGGCATCGCCAATAGCGTTAGAAACGGCAATAGCAATAGCAACCCGCTCTGATTCCTTAGGTCTTATTGTATTCGGCGCGCGCTTTCAGTCAACCGCAGTTAGA 

ATTCAGATCACCTGAATGAGTGGGCA 

(SEQ ID NO: 76) 

Exon: 2946.. 1001 

Start ATG: 2752 (Reverse strand: CAT) 
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Transcript No. : CT22H7 

TCGATCGGAGCGAGCGGACGTTTTAGCGCGAGAATCGAGAAGAAGAATTCATGATCTAGGGCCTGCGAGGACTGAAGAATTTACGAACCTGAACAATCGCGATCGCGAGTGC 

GTGGTTAGTGTGCGTTTGCAAGAGATGTGCCGCGTTTGATTATATTGTCAGAGGTCCAGGGCATCGAAGGTCACCACATCCAATGGCCGGTAGACGCCACCTGCTGCTGATC 

CTGCTGCTGATCATGCAGTTGATCGCCACCACGCCGGCGTCGCGCAGCCTCAGCGTTAATGGCAACAACATTTGGCGGCGGGTGCGCCTGGTGACCAGCCAGGAATCGGATC 

GCAACGCCTACCAGGTGCTTAAGCCGAGTAGCCGTAATGCCACCAGCACCACCACTACTACCAGCACGACTACAAGTACTACAAGTAGCACCACTAGTACCACTACTCCGGC 

AGCTCCTCAGACCCAGCGATCTGCCAAACAATTGGATCTCAATTTTCCGGGTGGCAATGTCTCCAGTGCTGCCCGTCTCGAAATGTCGACGGAATTCTTTGTGGTACCCACA 

CTATCGGCCAGCAGTTCTAGCAGCACCACGTCCACCACCACACCAGCCACGCCCCGTCGGAGTTCAGGGGCGAAGGCGCGAGCAGCCGGCACCACTGGACGGGTGACTCCCG 

GGAGCAGCTCCAACAGCACACCGCCCAGTATAGTGTCCACCCACTTGCCCTTCGCGGGACTACGCAAGGAGCCGTGGGTGGTGCCGGTGCTAGTCCTGGCCACCCTCACCAT 

GCTCATGATGGCAGCGTTCGAGATCTTTGTGCTCTTCAAGGCCTGGCGGACGTCCCCGTCGCGTAGGCACCTCTTTCTGGGCCAGATGCTGCTTCTCGGTCTGTTCGCCTGC 

GCTAGTTTGGGGGCCATCATCACCGCACAGCCCTCGTTGATCAGTTGCGGGGCCATCCGCTTCGGAGTGGGCGTGGCCTACGCCCTGGTCTTTGCCGCCCTGCTGGTCAAAT 

GCGTATTTCTGATTAGTCTAAATGGAGGGGTGTATCTGCCGGCTCCATACCAGGGATTGCTGCTCCTCTTTGCGCTGCTCATCCAGGTAGCGATCGGAGGGCAGTGGCTGCT 

CACCCAACCACCGGAGGTATACACCACCAGTGTGCCTGTGATGGGCAGTGGCTTCCTTAGCACCACGGTGGCCTCACAGACCAACTACTCGGCACTGTTTTACCCCACGTCC 

TACACAACGCTGGACGGGACTCCGGAGATCTACACCCGGATAGCGGCGGTTAGCACCGTACTAATACCGCTCTGCAAGACGCAGTTCTCGGAGCTGCTCTTCTCGCTGATCT 

ACATTGTCTTTCTGATCGTCTTCATCGCCGTGCTAGCCATCAAGTCGCGCGGCATCAGGGACAACTACCGTGAGGCCACCTACATTGGGCTGGCCATCGGAGGAGCCATCCC 

CATCTGGCTGGGATGGATGCTATGCGGCCTGGCCGTGGCCGAGCGGCACAAGGACGCTTGCGTGGCCTTCGGTCTGGTAGCCACATCCGCCACGGTCTTCTTGGTGATGTTC 

ATGCCCAAGGGCAGGCAACTGGCGGCCATGGGCAAGGAGGGCCTGTACGTGGAGGACCGCGAGGAGCAGTTCAGTTCCCTGAGTCGCGCCGGTTCCGGCTACTCGCCCTCGT 

TCTTCCACTTCAAGCCCATCAAGTACGGCGTGATGAGCGGCTGCGGACTGCCCAACTCTGCATCCAACACGGGCCAGGGACTCAGCTCCAAACACTGCTCCAGTGCCAACAA 

TGGAGGTGGTAAGTCCCGCTCATACGAGCTTTTCCCCTCCAAATTTGTCACATTTTCTGCCCGAGGCGGTCGGACCCAACTCAAGCTAGATACAGTTCCATTTCGGCCCCAT 

TCCTCTGCCCTTGTCAGCTGTCAGCCTCTGTTAATTTTGTAA 

(SEQ ID NO: 77) 

Start ATG: 195 (Reverse strand: CAT) 

MAGRRHLLLILLLIMQLIATTPASRSLSVNGNNIWRRVRLVTSQESDRNAYQVLKPSSRNATSTTTTTSTTTSTTSSTTSTTTPAAPQTQRSAKQLDLN FPGGNVSSAARLE 

MSTEFFVVPTLSASSSSSTTSTTTPATPRRSSGAKARAAGTTGRVTPGSSSNSTPPSI VSTHLPFAGLRKEPWVVPVLVLATLTMLMMAAFEI FVLFKAWRTSPSRRHLFLG 

QMLLLGLFACASLGAIITAQPSLISCGAIRFGVGVAYALVFAALLVKCVFLISLNGGVYLPAPYQGLLLLFALLIQVAIGGQWLLTQPPEVYTTSVPVMGSGFLSTTVASQT 

NYSALFYPTSYTTLDGTPEIYTRIAAVSTVLIPLCKTQFSELLFSLIYIVFLIVFIAVLAIKSRGIRDNYREATYIGLAIGGAIPIWLGWMLCGLAVAERHKDACVAFGLVA 

TSATVFLVMFMPKGRQLAAMGKEGLYVEDREEQFSSISRAGSGYSPSFFHFKPIKYGVMSGCGLPNSASNTGOGLSSKHCSSANNGGGKSRSYELFPSKFVTFSARGGRTQL 

KLDTVPFRPHSSALVSCQPLLIL* 

(SEQ ID NO: IS) 

Name: Metabotropic glutaraate receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384617 

AAATTTGTATAACTTTCAGTTCTTATACTCAGTTATTCCATTAATCTCTAATCCCACGACATTTAAATACACATACAAATTTTTTATACAAAGTTTGCTTTATTGACTATTT 
TTGAAGCACATTTACCATATCTGTATTATTAAAATACTTAGCATAGATTACAGAATTTTCAGCTTATTCCATGAAATACTGTTTGTCAAACAAACAGATTTCATCAGTTTTT 
GTTTTTATAATCCTTTTTTTTTGGGAAAGATTCTCATATATTTACAATTATAATTTAAAATAGTCTCTACTTTAGCTTACTACATATTCAATTAAATCTCAGGAGGCTGACT 
TTTCCGCGGACTTTTATGGTGGCTCTGACCCAACTGGATGCATCCTGCGCCTTTCCTCGCAGAAAAGTATTTTATTTCACATTTTACGGGCGTCTTGGCTGGCTGTCTTACC 
TCTGGCTAAGATGCTTATTTTATACAAATCCGACTTTTGCGATTTTAATTGCATAAATCAGCTTAGTTATTTAATTGGTTTATTAATCTACGGGAGAAGCAACCATTTTATT 
TTAGGTAAGCACTAAGAAAAAGATTGGATCTTAAGATTCCGATATTATTATATATCTTTATTTACTTAGTAAAAGGATGGGGATTTACGTATATTTTATTTAAGAGGGTATA 
TTGGCAGTAAAATATTTATTCAAATTGTGGGTTGTACAGTTTATATTAATTATTAATATGTTATTAGCTGTCAAGGCAATAAGAAAATGTAATCCATTTCAGCCCCCTTCAG 
TTAAATATATTGCTAACATATCGATGTGCGGTTGTACAGACTCATTTTCAAATCTTTGAATTTTAATTCTGATCTCGGTTTTTATTACGTAGACACATATTAATACTGCTTA 
GTCTTAAGTTCTTCATTCATATTTGGATAAGAAAGGATTATAAAATCATTTAAATGCATACTATAATATTGGCTTATTTAAAGGAAATTCCTACTTAAACTGGGTTACAAAT 
CTGTCTGCTGCACTGCGCCCGATCCTCTTCTCGCGGTGGTGCAAAATGCCTCCTGCTGCTCGCTCAAACATATAATCAAATTCTCCGGCGGAGCTGTCACTTTGCCATTGGA 
ATTGCATGATTTCTGTGTCCCCGTGGTTCCGGTCTTTTCCGCTGTCGTGGTCGAGGATGTGGTGGTCGATGAATTGTTGTTACCCGCATTCAGTGGCAGCAGTGGAGGAATC 
TGCGGATTGCTGGTCAACAGGCGCTTGGAGCCACCCCTCTTTTTACTGCCCTGTTTGGTGAAAACACTGGGCTCCAGCTGGAGTTGAGCGTTGATATCCTGCTTATTCATAC 
AGGTAAAGGCCTTGAAGAAGCTCTTCCGGAAGTTCTCACTTAGGAAGGCATAAAGTATGGGATTCACCGCCGAATTCGAGTAAACCAGTGCCCCCAGAAGTAGGAAAATGAG 
TATTTCCAGTCGGGAGAGGTCCCTTTGCGCGGGATTCGAGTGAATCAGGGCCACCTTGGAGATCCAGTGGTGTGTAAATAAAAAATCGAATATACTTTATAATTAATTACCT 
GAGAAATCCAGTGAGGGAGCCAACATAGAATGTATACACTTATCACCTGCGGATTGAAAAGAAGGTGGGAATCTACAAATGAATTGAACTGATTATTTCGGCATTATTACCG 
TCAGTACCAGTCGAGTGACCTTCCTGTGAGCCCGCCTCTTCTCCTTGGACTTCGTTCCTGGTTTGGGACCCACCGATCGCAGTTTCCTTATAACCAAGTAGTAGAAACTCAG 
GATAAAGCACAGCGGTGTGGCGAATCCTAGGAAAAATGTGTACAGTATGAAGGTGGTGCCCGAATGCTTCTTGTACGCATCTGGCCACATTATGTTGCACGAGTAATTGATG 
CCATCCTCCTGCTCCACAGTGCTGGCATAAAGGATCACGGGCAGCATGAGGACCGCTGAAGTTGACCAGGCAATCGCTGAGACCACTTTGGCAATATGCAGAGTTCGATATC 
GTGGCGAGGAAATCGGGTGGCATACCGCTATATATCGATCCGCGGACATGATGAGCAGAAAAATCGACGAGGTGAAGGAGGTGATGGATGTGCTCACCATGTAGGCTTTGCA 
CATAAACTCCCCGAATCGCCAGCTGCAAATTCGCATTGTGTACAGCAGAAAGGGTATTCCAATCAGGAAGCACTCGTCTGCCACCGCCAGATTCAGGATATATATATTCGTG 
ACCGTTTGCATTTTGGAAAAGCGCAACACCACGTAGATCACCAGGGTGTTGCCAAATAATCCGATAATGCACACAAATCCGTAGAGCACCACAGTGAACAAGTCAGCAAAGG 
AATTCCGCGTGGCTATGCAATGTTGATAGGTGGGCAAATCAGTGCCATAAAGGGACTCCTCTGGCTGAATTGTGGAACTACCACTAATCCATCTATGGTTCAGTTCCGTGGT 
ATATAAGCTCTCGTTCGTGTTATACCAATTTGTGGCATTTGCCTCTGAATCGGCTGTCATTTCCCCTTTCACAAACTGGAGGACATCCATCATCAGCCACGTAAACATATTT 
CCACCTAAGCGCTGTCCAGTTCAAATTTAATTATTTTAAATAGGTTTTACGCGTTGTTGCTCCCCGCTCGTTTTGTTCTTGCATTTTGGCGTTGCTGCAAAATACATTTTGT 
TATTACGTGAAATATTTTCAGACACAGTTGTTTTGGCCAAACTTGATTAGTTACTAATTGAAATACTGTGGTGCTATTAAAGAAAAAAACCTAAAACCTTCACTAGTCTAGT 
CAATATTTTTTATTCCAACGTAAGTTTAAAATGTACTGTAACAATTGGTGATAAGTATTCAAATGGAATTAAGTTAAGTCTACAAGTTTTATAAGATATTTTAATATAACTA 
GATTGCAGCTCTTTAATGCATAGTGTATATTTTTAATATATTATGAGTTTACTTTTAAGTGGCAGTCAATTTTATTTGCTGTCTTAACATCTTTGAATCGTATGTACCCCAT 
TCGTAATGCCACTTCACATACAGATTCTTTTGCAACTTTACTACAACTTTTACTGACTTTCTGAGAAATCCCCTCGAAAATTGAAAACTTTCCTGAACAACTTGTAGAAAAA 
CAAACCTTTAAAAATACGACGTATGAAATAAATCGTCTGTTTCCTGCTTTTGAAAAGTGAGTTACCACAAATTTGGTTTGTAAGGCGAAGGAAATTAACATTTATATTTAAA 
TTAAATTGCTCACAGCTTGACACGTCCAACAAACAAGAAACAAGAACTTTTTTATATTCTGACCTAACTCGACACTTTCTGTAAATAAATTAGGCGTAAGGAAAAGCTGGGC 
AGCATTTATGCAAATATCAGGTGAAAATATTTGTCTTCTCGGAAGTCGGAAAACTCCAGCTATACAGATCCCCATGGTTTTGTATTAGGAGATTGAAATTTAAGCCGATTTG 
TTTGTGCATTCCATGCACATCAAATATTGTTGACTGAGCCAAAACACATGTG 
(SEQ ID NO: 79) 

Exon: 2524.. 1679 
Exon: 1510.. 1001 

Start ATG: 2524 (Reverse strand: CAT) 
Transcript No. : CT22465 
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ATGACAGCCGATTCAGAGGCAAATGCCACAAATTGGTATAACACGAflCGAGAGCTTATATflCCACGGAACTGAACCATAGATGGATTAGTGGTAGTTCCACAATTCAGCCAG 

AGGAGTCCCTTTATGGCACTGATTTGCCCACCTATCAACATTGCATAGCCACGCGGAATTCCTTTGCTGACTTGTTCACTGTGGTGCTCTACGGATTTGTGTGCATTATCGG 

ATTATTTGGCAACACCCTGGTGATCTACGTGGTGTTGCGCTTTTCCAAAATGCAAACGGTCACGAATATATATATCCTGAATCTGGCGGTGGCAGACGAGTGCTTCCTGATT 

GGAATACCCTTTCTGCTGTACACAATGCGAATTTGCAGCTGGCGATTCGGGGAGTTTATGTGCAAAGCCTACATGGTGAGCACATCCATCACCTCCTTCACCTCGTCGATTT 

TTCTGCTCATCATGTCCGCGGATCGATATATAGCGGTATGCCACCCGATTTCCTCGCCACGATATCGAACTCTGCATATTGCCAAAGTGGTCTCAGCGATTGCCTGGTCAAC 

TTCAGCGGTCCTCATGCTGCCCGTGATCCTTTATGCCAGCACTGTGGAGCAGGAGGATGGCATCAATTACTCGTGCAACATAATGTGGCCAGATGCGTACAAGAAGCATTCG 

GGCACCACCTTCATACTGTACACATTTTTCCTAGGATTCGCCACACCGCTGTGCTTTATCCTGAGTTTCTACTACTTGGTTATAAGGAAACTGCGATCGGTGGGTCCCAAAC 

CAGGAACGAAGTCCAAGGAGAAGAGGCGGGCTCACAGGAAGGTCACTCGACTGGTACTGACGGTGGCCCTGATTCACTCGAATCCCGCGCAAAGGGACCTCTCCCGACTGGA 

AATACTCATTTTCCTACTTCTGGGGGCACTGGTTTACTCGAATTCGGCGGTGAATCCCATACTTTATGCCTTCCTAAGTGAGAACTTCCGGAAGAGCTTCTTCAAGGCCTTT 

ACCTGTATGAATAAGCAGGATATCAACGCTCAACTCCAGCTGGAGCCCAGTGTTTTCACCAAACAGGGCAGTAAAAAGAGGGGTGGCTCCAAGCGCCTGTTGACCAGCAATC 

CGCAGATTCCTCCACTGCTGCCACTGAATGCGGGTAACAACAATTCATCGACCACCACATCCTCGACCACGACAGCGGAAAAGACCGGAACCACGGGGACACAGAAATCATG 

CAATTCCAATGGCAAAGTGACAGCTCCGCCGGAGAATTTGATTATATGTTTGAGCGAGCAGCAGGAGGCATTTTGCACCACCGCGAGAAGAGGATCGGGCGCAGTGCAGCAG 

ACAGATTTGTAA 

(SEQ ID NO: 80) 

Start ATG: 1 (Reverse strand: CAT) 

MTADSEANATNWYNTNESLYTTELNHRWISGSSTIQPEESLYGTDLPTYQHCIATRNSFADLFTVVLYGFVCIIGLFGNTLVI YVVLRFSKMQTVTNIYILNLAVADECFLI 
GI PFLLYTMRICSWRFGEFMCKAYMVSTSITSFTSSIFLLIMSADRYIAVCHPISSPRYRTLHIAKVVSAIAWSTSAVLMLPVILYASTVEQEDGINYSCNIMWPDAYKKHS 
GTTFILYTFFLGFATPLCFILSFYYLVIRKLRSVGPKPGTKSKEKRRAHRKVTRLVLTVALIHSNPAQRDLSRLEILIFLLLGALVYSNSAVNPILYAFLSENFRKSFFKAF 
TCMNKQDINAQr.QLEPSVFTKQGSKKRGGSKRLLTSNPQIPPLLPLHAGNNNSSTTTSSTTTAEKTGTTGTQKSCNSNGKVTAPPENLIICLSEQQEAFCTTARRGSGAVQQ 
TDL* 

(SEQ ID NO: 61) 

Name: Somatostatin receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384531 

AGCATGCGGCTGAACAAAAAAAAACTCGAGGAATGGCCGCAATTTCGGTGAGCCATATAAACAACATAACATATAGTTTTCCCATCCATCTACGCTGTCTGCGCCACTGGCC 
AAGTCCAAGTTGTCTTGACGGCCCGGCAAAAAGCAACATCCAAGGAAAATGTTGCCATTTTAGATGCGCCGCCCGCTGAGCGGAAAAGTCGCGCGGAAATTACTAAATCGAT 
ACGTGCTGCGTATCAGATTTTCCATGGGTTCTGCATGCAAAAATACCTTGAAATCCTTGCCTGCAGTCGCACACATACACGTATTGCCATCTTTGTACAGACAAGCTTGACT 
AAGTCGAACTTTACTATGTTATTCTACTAAAAGGTTTTAAAGCAAATGATCTATACTATACTATACTATTAACTTCAATAGAATTCCTATTTTCTCTGGCTAGCAAACAAAT 
TTGGATCATAAAAAGATTTTCATGGCATTTATGAGATACGTAGATTGTGTGGCTCAACCTTTTATCTATATTATTTCCATAAATTTAAGCACCAATGGAAATGTTTATCTCC 
AAAGCTTAGGAAAATTTGGAAGTATATATAATAAAAACCTTTTATGTCTGCAAATAAGAGATATTACGTGTATACTGAGTTTCGCTTGAGTTTCGGAGTTAAGAGGCTTCCC 
TATAGACTGTTTGAAAATCCTATAATGCCTTGGCATTTCTAAAAGGTCAACTGAAAGCAGTGGAATTGCTGGTGGGCGAGGTTCGTCTCCTAAGTGGCAAAATGATCCAGAA 
TCGGAAGCAAACCCTGACGATGAAATAAACTATACGAGAAACTGGAGAACAATTACACATGGACTGTGGGTGCAATTAGGGACCACACCTTTTTCGCTGCCGACGCGTCTAA 
AGTTAATGAATATGTTAATAATATGTTATGCGCGTGCAAATTTGTTCAGGGCTTGAGGGGTTTTCTGTGTTTCTGTTTCCAGGCGAGTCCAGTCCAATCCAACTCTATCGCC 
TGATGGACATGGTGTTCGTGGGGAACTGTGGCTTCTGGCCGGCGCTGAAGAAGCGTTTGCAGGTGAGGCGCCAGAAGGCCGCCCGGAAGTCGACGCTGTAGAAGGCGTAGAT 
GAAGGGATTAATGGCGCTATTGAACCAACCCAGCCAGGTGAGGGCCTTGGCCAGCATTTGGCTGGCCTGGTGTTCGGCCAGGAACGGGGTGATCAGATAGTTGATAAAGAAG 
GGCAGCCAGCAGGCGATAAAGCCACCCACCACAATGCTCAGGGTTTGAGTGGTCTTGTTCTCCTTCTTCAGCGACGTAATCCGATTGGACAGCGATTTACTGTTGGTGTTGG 
CACTGGTGTTCCCGCTAGTCGATGTCGTGGTTACGCCCGCCTGGTGATGATAATGAGGATGGTGGTGATGGCTTCGCAGTCTCTCGCGGTCCCGCTCCCCATGCGAATGCCT 
AAAGCTGTGTGTTCGCGGTGGCTGCGATAGGGAGTGGGTCTGCACCTGGCCCGTTTGGCCCGGCGGCTGGGACTGCCATTGCGCATCCAGGAGACTACTGCCTGGGCCCATG 
CCACTCGTCATTGTGGTAATGGTCGTTGAGGCGGTGGAAGCGCGCTTCAGCGAGGAGGGTCGTGTGAGTTCGTAGCACCCATTCTTGGCGCAGCCCATGGAGCCACCTACTC 
CGCCGGAGGGAAGGGCCAGCAGGGATTGACAGTGTGTGCCGCCAGTGGCACTACTTGCTCCTCCACCCCCACCACCTGCTCCTCCAGCGCCCATCGCCGCACAATTGTCGCT 
GTCGCTGAGCATCATGTCTTGGAGCTCCTTGGCTATCGTCTGGTTGGAGAACGTTCGCGATGTGGCCTGTCTCTGGCGCTGGCCCACCGAGCTGTGCTGCTCCTGCTCCGAC 
AGCTCATTCTCCACGTCCGCTGCCGTGTAGCGCTTGAATTTCTAGACACGCAAAGTGCACAAGGCGGTTAAGTCAATGTCGATGATAAATAACTAAAATAGAGTCAATGGGC 
GGCAAAAGGCGGCAAAGATAAAAGAAACGGAAACGGAAGTCACGCTAGGAAAAGCCGAAGCTTTAAACACTCTTGTAGGCATCATCATATACTTTAAAGGGAACCACACTGG 
TAAGCAGATGGCTTTAAGCTGATTTAAGACTACATACCCAAATGCCATGCGGCCATTGTTCTATCAACTGCACGATGTGCTAATCTCTCTAAACATAAAACTGAATTTCAGA 
CCGGTTTTAATGCGTATGGAGCTGCCTGGACTTGATTCAGTTTAAGTGAATTTTAAGCCTATTTTAAAATGCACACCAGGTACACATACAAATGTATATATGCAGATGCAAT 
TTTCGAAGCGAGCATCGTTTAGTGTTAGCTATTTAGTGATGCTAACTTAAAGCTCGACATAAATGAGTCGAATATGTAATTACCCTGGTATTCAAGGATGTAGCAACATTAG 
CAACATTTCATAACTAACATTTTTGAAAAGTATTCAACTCTAAAGCGCCAAACTGCTCTCTCGTTAGAAAACGGTTAGTTGAAAACTCATTAATTCAGCTATACGGGCTGCC 
AGAGTTGAAGTGAAAAACTTGTTTTATAGCGGACACTGGACCGGAAATTGGCACATACCCCGATGCGAGGGTATGGTATTTAAATCTGTGTGAGCTCCTGGCATTTGGGAAT 
TATGAGGCAAACGTGCCACAGTTTGTGCCGCACGTGGAGTGTCCTTTAAGCCGGAGTGGTTCGCAATAGAGATGACTTTCGCTGGTCGGGCCAGCGTTTGTTGTTTGCAGCT 
GCCGTGGTTTTCCTATTCCATATTCCGTATTTGTGCACTATGTTGGCTTTGGGGTTTTGGATGGCGGGAGACGTGCGAATGTTTGCAGGCAAACAGAAGGACATGCGGACAT 
TGGGACATGGTTGCGGATGGCTGCGACGGAGCTTACCTTGTTGTGAACACTTATGTCGGTCATATTGTCGTGCCTGGATGCAATGACACAGGAAATTCTTGCATACACATAA 
ATCATGACTGCCATGGGTATGAAGAAGGAGCCCATGGCCGAGAAGATGACGTAGCCCTCGTTCTGGTTGTATCGGCACTCGCGCAGATCCCTTCGTCCTGGCTCGTACCTGC 
GGGAAATGGAAGTGAGCAGGATACGCTTAATGGGCCACCAAGTTAGTCAACTAATGCAGGAAGTTCCCAATTATTTGACTACGGAGCTGTGACCCTCGGAGGGGTCAGAGGT 
CGCAAGAGCAGCAGACAATTGCCAGCTAATTATATTCGTTTGTTCAGCTGGGTGTCGCCAGTCAGTCTGCTCGGAGGTCTCGAAATTTATTCTAATTAATTTACACTCACAC 
TCGGCGACATTTGTAATGCTTAAAATGGACCAAAAAGATACCCAAAGTGGTCCTTCAAAATTATGCAATGGCATCTTTTGACACAATTTTCTGCAGTCAACAAAATTGTTGA 
TTTAATTTCATTTAAGCAAATTAAGCTTTAATTATTTTTGTAAGTTAAATAAATTAAACATCTCTTGGTTACATTTCGGAATTCTGCTGCCTACAGGAACATTTAGTATATT 
TCTTCAAAAAATCTGTTGACTGTGCTAACGAAGTTAGAT-ATAAACAAAGCACTTTAATTAATTCTCCTTTTAGTTAGTTAGACAAAAGTAGAGAGAACAAAATTTGTTCTTA 
AAGCTAGCAGATGCGGTATACCAACAAGGAAGTTGATGTTTAGGTTAAAATTCGGTGCATCTATTTAGTGTGTGGACTGTTGACCAACTTAGAAGACTTCTTGCTGCACTCC 
TTCTTCGTTATGGAAATAACCCTACATACTGATTATTTCTTCAAGTGTACTTACCATCCCAGCATGGGGGGACAGGTGATGGCCAACGCGAGCAGCCAGACGATTAGGATCA 
TGATGAGGGCCAATCGTTTGGAGCGCCGTTTACGGGAGTACGTGAGCGGTCTGGTCACGGCCAAATATCTGTCGGATGGACGTGAAGGATGCAAAGATTTACGATATCAGGG 
GAATACTATGGATGTTTTGGATAGGCCTATGCTCACCTGTCCACACTGATGGCGCACAGGCTGAGAATGGATGCCGTGCAGAGGAGCACGTCCAGGGAGATCCAAATGTCGC 
AGAGCACCCAGCCCAGTTGCCACGAGCCTGCAATCAAGATTCCAAATTAAAAACTTTCCGCCAGCCTTTTGCAGCGCGGAGTGGGGCAGCCAACTTAAAATTCCGGTCCAAA 
GTCAATTTGATTAGCAACATTTTCCATGGCTTTCAGCAAATGCCTCACCTAAGGCCAATTAAAGCGGCCAGCTCGGCAAATTTTGGCAATGTTTCAGGTCCCCTTCAACCCG 
TTAGTGGTTTAATTCTATACCGAAAAAGCACCGATTTCTCCATTTGACCCCAACACTTCGAGCTTCTACAGTGCTTTCTCCTCGTCGGTGGCTTGTCAAGCGATTAATTTTT 
AAATCAAGTGCCCCTTAATGGTCTCCCGGCGCCAAATGAAAAGTTGTTTTTGTGAAAGGCGCTAGGAGCGAAGGAACCAAGGAGCtCATTACACGGATATTGCCTATGGGCC 
AGCACACTTGGGTTGGTCTAGTTCAATGTAAGGTTGCTGAATTTGCTGAATCTCCAAGGACTCGGATTGAAGCAGTTAAAGGCAAAATTTGGAGTGAATTAGAAAAACACAA 
GGATTACTTAATTAATTATGGGAATGCTTGTTCAGTTGTAGTGTATGTGTATGTTTTTATCTAAAAACTGGAGTAGGAACTGGCACAAAGCAGTCGTCATATCAGCACTTTC 
AGTGCCTACCCCAATTTCAACTGGATGATAAGGCTCCAAAAAGCAAACTATAAATCCCGACCGTGTGGGTGTATTACGTCCTAACf TAATGATATATGTGCGTATACTTTGG 
ACAAAAATGAACTGAATGTGGGGCTTGGTAAATTGTTTGCCTCGAAAAAACCTGGGGGCATGAGTAATTATGAATGAGATTCAAAAGGTATCTCCACAGAGAGAAACTAGTG 
GATGAAAATCGGAAAACAAGCTTAATGTTCCATAAGACAGAAGCCAAGTGAGCTCAAATCAGAAGGGTTTAGCATATTTTCCATTCAACTTCACGAGTTCAGCAACAAAGAT 
CAACTATTTTCTGACGAGAATAATTCATTCCGATTCGGATTGTCAGTGGTCCGGATGATTATGACAAACTTTCGACATGCCGTTGCCTTTAAAGTATTTCCACAAAGTTGGG 
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CTCGGGAATTTTCCCCTTTTTCGGCTTTTTTTTTCGTGCTGAGGAAATTGCACGRGAGCCGRGGCAAAAAAATGAAAGAAATTCTTTCAGTTCAAAGAATACAMTCCATAft 

GGCGAGAGAGATACATACTCGCATATTTCACCTAGCAGCATTTACATAAATACACGCCCCCTTTGCCCCGCCCCACCATCTGACACACCTCCTCCTCGAGTTCCTGTTCGTG 

TCCGTGCTGCTATTCCTGTTCGTGCTACTTTGTTGGGATACACTTGGAGAAAAATATATATATATGCTAGTCCTCCTAGAAATCCCTACACATTGCTCTAAACAGAATCTCA 

TTGATATTTACAGGTTCGAAAGTTCCTTAAATACCTCAAGAAACATTGCTACTACATTTTTACTTCATGAGATACAAATTACAAAGACTTGTATTGGATTTGTTCTCAGTGC 

ATTCGCTTGGTTCTGGGCGCCAAGCTTTTGTTAGCAATTTTTTGAGGGATTTACCGGAATGGCATCTTCATCTTGTAATCACGCGAAAAGCCCGGCTAGAGGCGCCTCCTTT 

CGCGGAAGGACTCTAATGCCTTAATGGCGCTGCCACACACTCACACACACACTTACACACTCACACTCATACACACATAGACTCACCTATGAGATGGACGGCGACGGCGGGG 

GGCATGACGAAGATGCCCACGAGAAGGTCGGCCACCGCCAGGCTCATCACAAAGCAGTTGGTAATGGTGCGAAGCCGGCGAGTGGTCAATATGGCCAGTATGACCAGGGTGT 

TCCCGATGACGGTGACCACGATGAAGGTGGCAAAGACTGCAATCAGGATGATGCCCTGCCAGGAGAGCACCAGATCGTAGAAGTGCGCCCATTCGGCGGCCGCCGCCGACTC 

GTTGTAGTAATTAAATGTGGTCACGGGCGCCGTGGGTCCGGGTTCGGTGAGAGCCACTGCATCCGTACCGTTCGGTCCGCTGGGATATCCACCTCCTACTCCTCCTACTCCT 

CCTCCACCTGTCCCATTGCCCGCATCCTCATATACGATGGATATGTTTTCCGGCAGACCACCCACTCCCACTCCCGATCCCAGTCCCAATCCCCGCAGGGAGGAGGAGGAGG 

CAGCCACATACATGAGCGACGGATAAATGGCGCTTAAATTATTCAGCTGCAAGTTGACAGCCATGGATCGGCCAGTTGGGTTCACAATCGAGTGTGGCTAGCTCCGGCAGCA 

GGCCGCCTAGGCCCATCCATATCGTGCCATATCCCTGGAGTTGACGGAGCAGGAACACGGCCCAACTGTGGTCATTGTGTCCAGTTGCACGTCGGGTTAACTGGAATTAAAG 

GAAAGTTCGACATTTAGCAAAGTCCTTTGCATGGTAATAAAAACGAAGTCAAGAGGAAACAATAACCATTTGCATATCCCTTTTATGCTGAAATCTATTTGTGTTTGTTTTA 

ATTTTTTATGTATGCACGCATTATCTGATTTATTCCTTGTGACAATTTCCGAAATTAAATTGATTTCACAGCATTCCATAGAATCTCATTTAAATTGCAATCGTCGAAATTG 

TCAATTTCATTCATTCGCTTGCAATTAAGCATTCAACAATCTCAAGGCAAAAAGTGCACAGCATTGAATATACCCGTCTGTGTAAATTACGTTGAAAGTTGTCAACGGTGTC 

AAAAGATTCTTAAGATGGCCTTTGTATTGGGTTGCCATTACCGCGATACAGTTAAATACTTTACTTTCTAAGTAGATAAACGTTAATCCTTTAGTTAGTCATGCTTTCAACG 

AAAATGCCGAAGTGCGCTACTGTGAGCAATAAATCAAATTAAGTTTACCTTATCATCTAGAAGCACCCACTGGCTTGATTTAATTTAAAATGTATGATTCCATTTGCTTTGG 

ATTAGGTACCGAACTTAGAGACTTAAGAATAAATAATTCGTGGAGGTATTTTCTATCAGAAAATTAAAAATTTTTATACAAATATTCACGGTTTGATTTGGTGAAGAAAATT 

CAAGAAAATCATCAACTTCGGAATTCCTGGCGCATTCTAATAATCATTATATGAAAATTACAAATATAATGAGCTTTATACAAAGTAAGCACTATTACTACAAAGTGGCGAA 

ATGTATTACCAATAAATTTGCCAAGGGAATGGCTACATTTTGGGAATATTTTATTT 

(SEO ID NO: 82) 

Exon: 6448.. 5911 
Exon: 4171.. 4069 
Exon: 3988.. 3663 
Exon: 3132.. 2949 
Exon: 1945.. 1001 

Start ATG: 6448 (Reverse strand: CAT) 
Transcript No. : CT22855 

ATGGCTGTCAACTTGCAGCTGAATAATTTAAGCGCCATTTATCCGTCGCTCATGTATGTGGCTGCCTCCTCCTCCTCCCTGCGGGGATTGGGACTGGGATCGGGAGTGGGAG 

TGGGTGGTCTGCCGGAAAACATATCCATCGTATATGAGGATGCGGGCAATGGGACAGGTGGAGGAGGAGTAGGAGGAGTAGGAGGIGGATATCCCAGCGGACCGAACGGTAC 

GGATGCAGTGGCTCTCACCGAACCCGGACCCACGGCGCCCGTGACCACATTTAATTACTACAACGAGTCGGCGGCGGCCGCCGAATGGGCGCACTTCTACGATCTGGTGCTC 

TCCTGGCAGGGCATCATCCTGATTGCAGTCTTTGCCACCTTCATCGTGGTCACCGTCATCGGGAACACCCTGGTCATACTGGCCATATTGACCACTCGCCGGCTTCGCACCA 

TTACCAACTGCTTTGTGATGAGCCTGGCGGTGGCCGACCTTCTCGTGGGCATCTTCGTCATGCCCCCCGCCGTCGCCGTCCATCTCATAGGCTCGTGGCAACTGGGCTGGGT 

GCTCTGCGACATTTGGATCTCCCTGGACGTGCTCCTCTGCACGGCATCCATTCTCAGCCTGTGCGCCATCAGTGTGGACAGATATTTGGCCGTGACCAGACCGCTCACGTAC 

TCCCGTAAACGGCGCTCCAAACGATTGGCCCTCATCATGATCCTAATCGTCTGGCTGCTCGCGTTGGCCATCACCTGTCCCCCCATGCTGGGATGGTACGAGCCAGGACGAA 

GGGATCTGCGCGAGTGCCGATACAACCAGAACGAGGGCTACGTCATCTTCTCGGCCATGGGCTCCTTCTTCATACCCATGGCAGTCATGATTTATGTGTATGCAAGAATTTC 

CTGTGTCATTGCATCCAGGCACGACAATATGACCGACATAAGTGTTCACAACAAGAAATTCAAGCGCTACACGGCAGCGGACGTGGAGAATGAGCTGTCGGAGCAGGAGCAG 

CACAGCTCGGTGGGCCAGCGCCAGAGACAGGCCACATCGCGAACGTTCTCCAACCAGACGATAGCCAAGGAGCTCCAAGACATGATGCTCAGCGACAGCGACAATTGTGCGG 

CGATGGGCGCTGGAGGAGCAGGTGGTGGGGGTGGAGGAGCAAGTAGTGCCACTGGCGGCACACACTGTCAATCCCTGCTGGCCCTTCCCTCCGGCGGAGTAGGTGGCTCCAT 

GGGCTGCGCCAAGAATGGGTGCTACGAACTCACACGACCCTCCTCGCTGAAGCGCGCTTCCACCGCCTCAACGACCATTACCACAATGACGAGTGGCATGGGCCCAGGCAGT 

AGTCTCCTGGATGCGCAATGGCAGTCCCAGCCGCCGGGCCAAACGGGCCAGGTGCAGACCCACTCCCTATCGCAGCCACCGCGAACACACAGCTTTAGGCATTCGCATGGGG 

AGCGGGACCGCGAGAGACTGCGAAGCCATCACCACCATCCTCATTATCATCACCAGGCGGGCGTAACCACGACATCGACTAGCGGGAACACCAGTGCCAACACCAACAGTAA 

ATCGCTGTCCAATCGGATTACGTCGCTGAAGAAGGAGAACAAGACCACTCAAACCCTGAGCATTGTGGTGGGTGGCTTTATCGCCTGCTGGCTGCCCTTCTTTATCAACTAT 

CTGATCACCCCGTTCCTGGCCGAACACCAGGCCAGCCAAATGCTGGCCAAGGCCCTCACCTGGCTGGGTTGGTTCAATAGCGCCATTAATCCCTTCATCTACGCCTTCTACA 

GCGTCGACTTCCGGGCGGCCTTCTGGCGCCTCACCTGCAAACGCTTCTTCAGCGCCGGCCAGAAGCCACAGTTCCCCACGAACACCATGTCCATCAGGCGATAG 

(SEQ ID NO: 83) 

Start ATG: 1 (Reverse strand: CAT) 

MAVNLQLNNLSAI YPSLMYVAASSSSLRGLGLGSGVGVGGLPENISIVYEDAGNGTGGGGVGGVGGGYPSGPNGTDAVALTEPGPTAPVTTFNYyNESAAAAEWAHFYDLVL 

SWQGIILIAVFATFIWTVIGNTLVILAILTTRRLRTITNCFVMSLAVADLLVGIFVMPPAVAVHLIGSWQLGWVLCDIWISLDVLLCTASILSLCAISVDRYLAVTRPLTY 

SRKRRSKRLALIMILIVWLLALAITCPPMLGWYEPGRRDLRECRYNQNEGYVIFSAMGSFFIPMAVMIYVYARISCVIASRDDNMTDISVHNKKFKRYTAADVENELSEQEQ 

HSSVGQRQRQATSRTFSNQTIAKELQDMMLSDSDNCAAKGAGGAGGGGGGA5SATGGTHCQSLLALPSGGVGGSMGCAKNGCYELTRPSSLKRASTASTTITTMTSGMGPGS 

SLLDAQWQSQPPGQTGQVQTHSLSQPPRTHSFRHSHGERDRERLRSHHHHPHYHHQAGVTTTSTSGNTSANTNSKSLSNRITSLKKENKTTQTLSIVVGGFIACWLPFFINY 

LITPFLAEHQASQMLAKALTWLGWFNSAINPFIYAFYSVDFRAAFWRLTCKRFFSAGQKPQ.FPTNTMSIRR* 

(SEQ ID NO: 84) 

. Name: Octoparaine receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384651 

GTGCTCGACATTTCGCTTGCGTCCCGTAAAGAAATTGTCCACCACGATGACCTCGTGCCCCTGGACCATCAGATCGTCGACCAAGTGGGAGCCCACGAAGCCAGCTCCTCCG 
GTAATTAGTATGCGTTTGCGATTCTTATAGTTCAGGTATTTAACCTTCGGATACTTGCGCGGTGTGCTGGTCTGGAGGCTACGAACTTGTTCCTCCAATCTGGCCAAATTCT 
CCCTTGTGCGCTGGAGTTCGGCCTTTTGTTCTCGAATAAGAGAGCTCTGCTCCTCGTACGCCATCTGAAGGCTCCTTTGTAGCGGACTTTCCGTGGGCGGCCATTTGGCCTG 
AACCTCCTCGACGCCGGGAACGGAAACGGCGACCTTTCCGCTTGGGCAGAAGCTGGCCATTCGGTAAAGGTAGACCAGGAGCAGGAGGAGCAGCGAGATGGCTGCAACGATC 
TTCAGACGCTTTTTGGTGGCAGTCATCGTCGGTTATGGGTTGTGTACTGCAGGGTTCCAGTGCTTATCAGCAACTGATATCGCCACATTGCATTCTCGTTCGGCTTTAGAAA 
CTATATAAACAATACAATTGCGTTATCAGGAGGTGCCAGAGTGGCCAGATGGCTCAAACGGAATTTCTAAATGGATTAATAATCTAACGCCAGGGAAAAGCTCCCGCCTAAT 
TTTAAAATTTACATCAAAAGTATCGACAGGCGAACATTTTGATTTTGGATTATAAATTATAATGAATTGATAATGAATTAGACATTTTTGTTGGTGATCTTCCCCAGGGACG 
AATATTTTTCGTTTTAACGATAAAAGGTCGGAAATTGCCAAAAAATGAGTTTGAAAACTAAATTTGTTTGAGGGCTTTCTTAGGCTGGTCAGCAGGGACTACTTTCCGAATA 
AAGTGTTTGCCCAAACGGCTATTGAAGGCTATTTAAACTGCCAACGATGAGCATCGATCGCCAGTCGGGCAAAGTTCTTAGTGCCAAGTGTTGTAGTGCTCAGCATGAGGCT 
TCCTTGGGTGATTTTCTGTACAGTTCTGCTTTTAATTTTTACGAACAACTCAAATGCAGATATTCCCGGGTGCAACTACTACGACACGGTTGATATCTCATACATTGAAAGG 
CAAAACGATTCGTATTTATATGATGACATCGAAATTCCTGCTAGCTTAACTGGATACTACGAGTTCAGGCAGTTTGGGGACGGTTCGATTACGCCGATTGAAAAGCATTTAA 
GGGCCTGTGTTTGCAGCGTGCGGCCCTGTATTCGAATCTGCTGCCCAGCCAAGAACTTTTTGGCCAACGGAAAATGCGATGATGGTCTCAAAGAGGAGCTCGCCCGGTTCAA 
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ACCCTATATATACTTCACATACATGGACCTACAGGCACGAGTACCACTCACCGATATGGCTATTATCAGAGATGAGTTCTTTGATTGTGATGAAATGATTTACATCAGCGAC 

TTTAACTATTTCTTAGAAGAGGTTAGCATTCAAATCTTCAATAAATGTGGACTTATAGTTTGGTTCCAGGATGGAAAATTTTGGGTTACCGTTGACCTCTTCATGGAAAAAC 

AGGACTACTGCTTGTACCGGCACAACTTTGATTCGGATTTTCCAAAGTCCATGTGGATAATACGACACCGCTGCACAAGCCACATATCTCCTGGATCCTTAGAGAGTAATTC 

AATTATTATTTATAACAAAAGTCATCAATATCGTTTTCATAGCTATATAGAAAGAAACGAAAATAACTCCTTATTGGTGTGCAAACTGCTGACTGCATTCTGTGTGCCCTTT 

GCAAGTTTTTTAATTACAATAAGCTGGTTTTTTATTTCTTATATTTTCTTTTTCAGTTCTAATTATAACAATGATATGCTTTGTCCTAACAATCGCAGTATATCTATACATT 

AAGAAGCTGCGAAATGTTACTGGCAAGTGCATTGTATGCTGTATAGTTTCGAGGTTTATCCAGTGCTTGATCATGATACTAGATCATTTAAATCTATTGAATGGCATTTGCT 

CTCCAGCTGGTTAGTTCGAATTGGAGATATCTCTGGAAGCTTTAAATGTTATATCGTTGACTTCTTGCAGGTTACAGCTCGCACTTTTTCCGGATGGCTTCCAACCTCTGGC 

TCTCCGTCATCAGCTACCATACGTGGAAAGTCTTGACGTCGCTCAATCGAGTCGACCCTAACTATCGGTTCCTGCGGTACAACGCCTTCGTCTGGAGCACAGCCGCAATCAT 

GACGGGAAGTATTTATATAGTTAATCAGATTTGGGAAAACGATCCCAGTAAATGGAACTGGTTGCCTCTGGTCGGTTTTATTCGGTGCTCGGTCAAAGGTAAACTTTCAATG 

CGATTGCTTTTAAATCGTTACCATGGTTACGTATTGCTTTTCCATATCAGATTGGCACCCATCCGTCTGGATCTATATAAGTGGACCGTCGCTGGCCCTGAGCACTTTCAAT 

GTCGCCATGTTCGCCCTGACAGCCATTTACATTAGGAAAGTGAAGGGGGGTATAAATAAGTTCACAAATGAGGAGGAGGGAAGGATTAACTGCATAAACTTTGACAGCCAGA 

CGTAAGATATGATAAAACCTTGGACCTTGGTAAACTAGTTATCCTTGCCGTATTATCCACATAGTTACCTACAGTTCCTGCGGCTCTCCATCGTGATGGGCCTTACTTGGAT 

ATTCAATGTCATTCCGTATTCTGCACGGCTCCACATTTTCTGGGAATGGGTCGGGATAATATCCGAGTATTTTCACAGCGCGTTTGGAATTGTTTTGTTCGTTCTGCTCGTC 

CTGAAGCGCAGCACATGGACTCTAATGATGGATTCTTAAGCTTGTTTCTATATAATTCTAATAATTGTTTACAATTAAAGCTTTAAACAATTAAAGCCCCTCATTTTATTAA 

TATCCTTTGCTGAACGAAGGAATTGAATTGAATTTTATTGAACTCAATAAATAAATATAGTTTTAGAAAACGATGTACTTTAAACTGCCAAGAGGTCACCGCGACAAGCTTT 

TCACTTCGCCACCCTCTGGTAACACTGATTCTCCAGCCACTTTCGATTCTTTAAAAAACATCGATAGGTATCGCCAAAAAAATCAGGTGCTCTGCTTGTAGTTTTGCGATTT 

CATTGCAATTTTAATTTGCTTTACTGTTCAAAAAGAAATACAGTGCAAAATATGCCGAACGAAAACGCAGAGGACCTCTCGGGTCAAGAGCTAAAGCAGAAGGCCAAGGAGG 

TGGCCGACGCATCGGAGGCAATGCTCGAAAAGGTTGTTGCCGGCTTAAATATCCAGGACACGGCATCGACGAATGCAGCAGGAAACGAGGATGCGGAGCAGCCTGATGGTGC 

CAAGAATGAGGCTTCAGTGTCTGCGAATGCAAGTAAGTTTTCAATATGCCTGTGAAAAGTCATTTAGAGGCTCCTCCTGCGGCTGAAAAAATGAGGTTAGGTTTTGTCGGGA 

ACTACAGTCCAACCTCTCTAACTTGAACTGCTGTGCTATTCATTTCAATTCAATTTTCAATTTTCATTTTCTGTGCAATTACTCCAAAGATTAGCCCTAACGGAAAAATAAA 

CAACTCATAAATCATAAATACATATTTGATTTTGATGAAATATGAATATACTTACATAGATACAATTGAGTTTTTTATTACGAATAAAATATGTGTAATATAAGATTTCTAC 

CTTGTTGATTCCATTGATTGATGCTTCATTGAAACCCCAAAATTGGGCAAATTTGTTGTCATGGGAAACTAGTCCACTGTCGTGATGCCTTAAGCTTTTTTTGGGGCAGATT 

AGTCATTCCAGTTTAAACTAATCGAAAA 

(SEQ ID NO: 85) 

Exon: 1001.. 1673 
Exon: 184 9.. 2025 
Exon: 2087.. 2338 
Exon: 2403.. 2836 
Start ATG: 1001 

Transcript No. : CT22963 

ATGAGGCTTCCTTGGGTGATTTTCTGTACAGTTCTGCTTTTAATTTTTACGAACAACTCAAATGCAGATATTCCCGGGTGCAACTACTACGACACGGTTGATATCTCATACA 

TTGAAAGGCAAAACGATTCGTATTTATATGATGACATCGAAATTCCTGCTAGCTTAACTGGATACTACGAGTTCAGGCAGTTTGGGGACGGTTCGATTACGCCGATTGAAAA 

GCATTTAAGGGCCTGTGTTTGCAGCGTGCGGCCCTGTATTCGAATCTGCTGCCCAGCCAAGAACTTTTTGGCCAACGGAAAATGCGATGATGGTCTCAAAGAGGAGCTCGCC 

CGGTTCAAACCCTATATATACTTCACATACATGGACCTACAGGCACGAGTACCACTCACCGATATGGCTATTATCAGAGATGAGTTCTTTGATTGTGATGAAATGATTTACA 

TCAGCGACTTTAACTATTTCTTAGAAGAGGTTAGCATTCAAATCTTCAATAAATGTGGACTTATAGTTTGGTTCCAGGATGGAAAATTTTGGGTTACCGTTGACCTCTTCAT 

GGAAAAACAGGACTACTGCTTGTACCGGCACAACTTTGATTCGGATTTTCCAAAGTCCATGTGGATAATACGACACCGCTGCACAAGCCACATATCTCCTGGATCCTTAGAG 

ATTCTAATTATAACAATGATATGCTTTGTCCTAACAATCGCAGTATATCTATACATTAAGAAGCTGCGAAATGTTACTGGCAAGTGCATTGTATGCTGTATAGTTTCGAGGT 

TTATCCAGTGCTTGATCATGATACTAGATCATTTAAATCTATTGAATGGCATTTGCTCTCCAGCTGGTTACAGCTCGCACTTTTTCCGGATGGCTTCCAACCTCTGGCTCTC 

CGTCATCAGCTACCATACGTGGAAAGTCTTGACGTCGCTCAATCGAGTCGACCCTAACTATCGGTTCCTGCGGTACAACGCCTTCGTCTGGAGCACAGCCGCAATCATGACG 

GGAAGTATTTATATAGTTAATCAGATTTGGGAAAACGATCCCAGTAAATGGAACTGGTTGCCTCTGGTCGGTTTTATTCGGTGCTCGGTCAAAGATTGGCACCCATCCGTCT 

GGATCTATATAAGTGGACCGTCGCTGGCCCTGAGCACTTTCAATGTCGCCATGTTCGCCCTGACAGCCATTTACATTAGGAAAGTGAAGGGGGGTATAAATAAGTTCACAAA 

TGAGGAGGAGGGAAGGATTAACTGCATAAACTTTGACAGCCAGACGTAAGATATGATAAAACCTTGGACCTTGGTAAACTAGTTATCCTTGCCGTATTATCCACATAGTTAC 

CTACAGTTCCTGCGGCTCTCCATCGTGATGGGCCTTACTTGGATATTCAATGTCATTCCGTATTCTGCACGGCTCCACATTTTCTGGGAATGGGTCGGGATAATATCCGAGT 

ATTTTCACAGCGCGTTTGGAATTGTTTTGTTCGTTCTGCTCGTCCTGAAGCGCAGCACATGGACTCTAATGATGGATTCT 

(SEQ ID NO: 86) 



MRLPWVIFCTVLLLIFTNNSNADIPGCNYYDTVDISYIERQNDSYLYDDIEIPASLTGYYEFRQFGDGSITPIEKHLRACVCSVRPCIRICCPAKNFLANGKCDDGLKEELA 
RFKPYI YFTYMDLQARVPLTDMAIIRDEFFDCDEMI YISDFNYFLEEVSIQIFNKCGLIVWFQDGKFWVTVDLFMEKQDYCLYRHNFDSDFPKSMWIIRHRCTSHISPGSLE 
ILIITMICFVLTIAVYLYIKKLRNVTGKCIVCCIVSRFIQCLIMILDHLNLLNGICSPAGYSSHFFRMASNLWLSVISYHTWKVLTSLNRVDPNYRFLRYNAFVWSTAAIMT 
GSI YI VNQI WENDPSKWNWLPLVGFIRCSVKDWHPSVWIYISGPSLALSTFNVAMFALTAIYIRKVKGGINKFTNEEEGRINCINFDSQT* 
(SEQ ID NO: 87) 

Name: mth-like 7 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384621 

TGAACCTGCACCACCAGCCAAGACCCCAGCCAAGACTCCGACCAAGGCAGCTGCTCCAGCCGCTGTTGCCAGTCCCGCAGCTGCGGCAACGTCGGCAGACGCCTCTCCATCT 
CCGGCCAAGAAGGCAACGCCTGCTCGCGCTGCCGCCGGAGCCAAGGCCACGCCGCAGCGACAACGCGCCCGCGGTCGCTACAATCGCTACTAAGTAGACGTCGAAGGAATTC 
GTAATTTAACATAACATAAACCTTAATATACAAGAAAAAATTTACATTCACGCCCAGGAGGAGGATGAAGAAAGAACAGATCAAATCGATCTCTTCCCGCAATTTGACATTT 
GCATCCCAATTTGCATATACACTGAATGAATGGAAACTGAGAAGAAGACGGTCAACTTTTAGTTGAAACAGTTGAATATATTGTACGACGAAGCTTCTTGAAACTGACGAGG 
GAGAACACAAACAAGTTAATTGAATCGAGAATCGTATTACCTACCCGATTCCCCCCTCACAGAACGCATACCATGCCAGCCAGCGAACCAACCATTCAACTTATGTAATCAT 
ATAAATTCTATAATAAACAAAGAAACAAAACTAGTTGTAAAACAAACACGATTCCTGTGTGTCATTGCGGGATATGGAAATTGGCTTTCAATCTGTTGCAAAATGAGATTCA 
GTAGCAGTGTATATATCTATGAGGGCTGTACGATTTCGTTTTAGCGCACAATTAATACATTTAATACTTGATCTTTTTACACAAGCGTTATATTTAATGCTACAAATCGGTA 
CAGAGATTAAGTGAGGTGAGAATACATACATATAAGGAAAGCCTTGATATAGTAAGATACGAAGTGGGATATATTTATTATTTACAGCTTACGAGACTAGGCGCTACTCCCA 
GGAATGCTCCGAGGGAGGCGGAAACGCTTTTTTGAGTGAGATCCCACTGAGTCGGGAAATGGGATTAAGTACTTGGGGGCGCGCGGTATGAAGATCGAGTGGAATCAGTTGA 
ATTCCAGAGTATTCGCCTCCGTCATCGTCGTACGCATGCGGCTCTGGTCGCTCTGGGAACGACGAAGTCCGCCCGGACGCCATCCGCTCCTCCAGCGCTTAATGCATCCCAG 
CAAGGACCAGTTGATGGACTTGGAGCGACTCAGCACATAGACATACGGATCCGAGGTGAAGTGCAGCGCCGTAAGAACATCGGCGATGATGAAGAACTTGTTCGATGCGGGC 
ACCCGATTTGGAGCTATAGCCAACGGGATGGCGATCTAAAACGGAGATAATTGTTGATAAAGGCCATGTAAATTTATTATATTATATTAGATTTGCATAATACGGAATGGTT 
GAAGATACTAGACGTTTAGTAGTTACTATCTGCAAATTTAGAATGGTATCCACCCAGTACTCACCATCTGTGGCATCCAGCAGATGACGAACGAGATGCTGAGGAACGCCAT 
TAGTTTGGCGMCTTGATCTCGACTGGCGAGGAGTCGGTGGCCGCCATTGTGACGCTAACGCTGTGCCTGTACTGCCTTGCCGGCTGTACACTGCGATGGCTGTTTCCGCTT 
CCGGTGCTCAGCTGCGTCTGGTAGAGGGTGGTGCCGCTGCTGCTCTCGGGATCGATGCTGATGGCACTGTTTTTGTCCCTGGAAACCAGGTCGTAGTGCATATGCCGCTTGG 
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CCGTGCGACTCCTTCCGATCACACAGAGCAAGGTGTGGGCCACGAAGAGGTTGCfiGGCCACGATTACAATGCflCSGCAGGGTACCTGGAATACftTCGATGGTCAMCCAATA 

TTTGAAATGTAACCAATTTGATAGGAACTTACCGAAGACCATAAAGAGCACCGCATACGTCTTGTTCCACACGCCCGGCGCATCGCGATATCGTATACACTTAAGCTGGTCC 

GGATTGGATTCATCGATGTATGCACCAAAACCAACGAATGGCAGGAACGTGATCACCACGGCGATCATCAGAATGCTGTTGBTGCTCTTGCGAATGAGCTCGTAGGTAATGT 

GCTATGGCGAGAGAGATTTGTGAATATTTAATGAATGAGTTGGCAATGTTTGTGATTTCCTGTGTGCACTTGTATCATTTCTTAAAATTAAATTACACTAAGAATAAAATGC 

ATAGAAAATGTTATTTATTATACATATATTGCAATCAAGTTCATACCTTGTGGTAGATAAAGGGTCTGGCTAGGGCCATCCATCTTTCTGCCGCCATAACTGCTGCAATGCA 

TCCAGAACTGAGCCCGAAAAATCGCCAAACCACGAGTCCCACGCAATCCACTCGGATGAACGACTGCAGGACTTCCTTTGAGAGATACATTTTCAGCAGCGTCGTGGTCAGC 

ATGCCCAGTAGAGCCACCAGATTGTTGGTCGCCAGGCATCTGAAATAGTTGAGAGCTCCGTAAATTTCGGATAGCTTAGCTTTATGGCATTTCGAAGCTCATAAATCGGAAA 

GTAAATGCTTTTCCTTGCCATTAGCGCCCAAATTGCCTACAGTTCGTTGGGTTCTTTAATAAGCTACGAATATAAGTTGGGTATAAGATCGATAAAATGTAATAACGAATTG 

CTTTTAATTAATTGCCACTTTTATAAACCGAAACTTTCAATCAAGATACCTCAAAGCTGCTCCATATGGTTGTACAACTATCGATTTTGGTTATCTAAGACGTGACTCGGTC 

CTTAAACGCTTTGATTCGAACGCCGACCAATAGCAATGAATAATTATTTTGGTTCGCAAAATGCTCAATTATAAAAAGGAAATATCTATTATATGGATATTTAAAAATTTAA 

TTCATTTTTTTGAATGTTTATATTATAAAGGATGCTCCTCAAATTTCTCCATTGAATATTGTGTATATTCACTTTATATTCGTTCATATATTTAACATTTGTATATTTATAC 

TAATATAAACTAAAGTACAGATCCCTGTATTTTTATCTACTTAGTGAAATATGTGACTGTCAATGGTGACTTAGATTAAGTGTTTCCTTTTATTGGTATCATAAAATATTGC 

ATTTGTGGGGCACGAATGTGTTTCGATTTGATTGTTTTTGGGGCGCCGTTGACACAGGAAACTCATTTGTCTCCGTGTGAAATGAAACTATATGAATGTAGGGCTTTTTTCC 

AATGGAAATATATGTATATATATATAAAACACACGCTTGGAATAGAACACAAAAATGGGATTGACAATTCCCGGGGATTGGCCTAACAGTCCGTGCATCGTTAGGCATCGCA 

ATGACTTTATTCATTTTTGTTTTCATTATTTGTTTGTGCACATTGCACCTTTATACCTGATTTCAGCGAACAAGCGATTAAAATGCATTTTATGGTGCATTGTAAAAGTTTC 

GCCAAAAGCTAAAATGTCGGAACTGTTCATTACAGGGCTAAAATGTCTAAATCTTCAATTCGACGATTTGCAGTGCGAAATGCAAAATCGTTTTGCTAACTTGTAAAGGCAA 

AATCATTGTTTACTCTAGCAACGGGAAAATACACACAGCTATTCCGCTTTTCGTTCATAGTCAAAAAGCATGAAGGGATACCCTGATTTATCGACACCGCTCAAGTGCCATT 

CACCAGCTTCATTCAATCAATCAAATTACAGATCAAAGGCTCTGCATGAAGGAGGCCTCATGCCTGAAGTGGAATTCGCGTTAATTTATGCTTCCTTGACGGCTCAAATTGC 

TGGCGACAGCAATTTACGCATATTACGCATACGCACTGTCGTCTCAGCAAAAAAAAAATGTTGGGAAAGGAGCAAATCGCAAACAAGCTCACTGAATTCGCCATGGGCCAAT 

GCTGATAAATGTAATATAAATTGTGACAAATATTTGGCTTATGAACTAAAAGAATTTCCGTGTGTGTCATAGATTTTATCACTTCTTTACAGTACACACCCTACCGAATGAT 

TAACACGATGTTTAAAATATCAACAAAGTCTATTATGAGTATTCGTATTGAGAAGCGAATATTTTACAAGAAATTATTATTATTATATTAATATTAAAAGGAATTTGGTGAA 

ATACACAAGTTTTATTTCACTCATATTTACAAAGGAATAGATTATCATTTTTGAGTATTATTATATCTACCAGTTTTTTGCACCTAAGCAAAGTTAATTTAAATACATATTC 

ATATAAATCTTTTCCCCAGTATGCACATTTGCCAATCAAGTCAACCACCGATCGCCGTCTTCAGTTACACCATAAAGCTTTTAATGCATTCGCAATTGGCTGCATTCCCACG 

TCGCCGATAAGATTGACAGAAAACAGACTGGAATATGGAATATGGACAAAATCTCAATGCCCAAAACACATGGCCAACGAACTAATTCCGTCATTTGATAAGATATGCAAAA 

ATAAAACAGCAAAAACCATGCCAAGCAAAACAATTCAAACTATGCAAATCAGACCCGCATAGACATATTATAGAGCACCAAAGTGGCCGCAGTTGAGAGCAGGTTGAATAAA 

CTTACATAGCCATGAACTGGACACAAACAGTGGCAAACATTAACTGCATGAAATGAGTAGCTACACGCTGAGAAAATATGCAAAATGTCAGCTCAATAATATTATTAACAAT 

AATACTATATGGTTTTGCATGCTTAATATTTATGCAAATCAAATAGAATTTGGTGAACTACACAACTATAAATTACTACTAAGATGGGTACATTTTGAAAGATTTTAGGGCA 

AGCAAAACCTACTTTTTGCATTAATATATTATTATTTCCACTATGCATAGCTGCACTTACCGCAGCATGAGCGTGTATTTGCTGTTTTTGTTGAGCTTCTTGCGCGCCAAAA 

TGAAAAGGGCCAGCGAATTGCCAAAAACGCCCAGAACCATGATAATAACACCAATAATCAGGCGATTTCGATTTGCATAGAGCGCTTTCGGAGGCGCCACAGTAGTTGAGTT 

GTGCGGCATCAACAGGTCGAGCACAGGTGTTGTGGTTTCCATTTTGCCGGTATGATAATTGCGTTCGATTGCAGGTGATTCAGGTGAAAATCTAACTGGCCGCAAACGATAT 

TTCTTTTTCATAAATGAGCCTGCCGCAAAGGCAAGGCAGCTCGCAAATGCACTGGTAGATGGAAACCCGTTTCGATAGCCAAATTGGGACTTTTTTCCTAAGACACTTAATT 

AATGATTAGCGCACGCGACACTTGATCAACCAACGACCGACCAATTCGAAGGCGATTCGATTCGATCCGAGGAGCGCGTCCGCAGTCAGCCAGGCGTCGATTTAGAATGACT 

GCCGACCGTGCCGAGCAGAGCACTCGTCGGCTGCTGCCCATAGCTCCAAGCTATCCAAGCCATCATATACACGTGGGGGTGGTGGTAGGGGCCAGTTCGGAGCGAGGCCAGT 

TCGGCTGGCGCCGCCGCCGGTCGGATCGCATCGGATGTAGCATACAGCACATAGACACCCCCAAGCGAAGCGGCCACCCACATCCACGCCCAGCGATGGCGACCACCCCATC 

GAAAATTCACAGCTCGTGCCAGCGCCTTTGCCTTTTGGACGTGTGCGAACGGTCGGAATAGCACGACGAATTTTATTGTTTTATACTCTCTCCACCTGCGGCTCACTTTAAT 

TTCACTTTCAACTGGATGTAATATTAATTATGCAGTTTTTAAGCTTCCATTTGATTTTATACAGTTGACTAAATTAGAATAAAATAATATACTCAAAAGTTTACTTCAGCAA 

TATATTTACTCATGTTTCTTTGTCATTGAAAAATTAACAAAACAAAAAAACAGCTATTTCATAGATAGATTAATTATTATTATTAAGAGTCTTCAGATTCGTCATCTTTAGT 

ATATGGTGTGTCATTTGAAAACCTATTTTTGAGGAATATATGAACTGAAATCTGCGTATACTTTAATTACTGTAATGTATTAAAGTATTGTTTCTTTTATACAAAAAGTCTA 

AAAATATAAGCAAACAAATTATTTTACTGAAAATTTGTTTTTTATGCCACTTTTAAGAAATTTTGCGAATTAAACATATATTTTTAAAGCTAGGAGCCTGGTAATTATAGTA 

GTTTAGCTTTAACTCTTTGCTGAAAATATTTCCAACTAATTTATTTGTTTTCAACAGAATGCATTACGAAAGATAAAAATTAAATTTGCAGTTACTAAGCGTTGCAGTTGCA 

CCTTTTCTTATTCTTATCTTCTTAGTATATAGTACTGTTTTGTTGGGCAAAAATGTTTATTGTTTAGTTTCAACTGCC 

(SEQ ID NO: 88) 

Exon: 5238.. 4765 
Exon: 2391.. 2175 
Exon: 2017.. 1825 
Exon: 1764.. 1409 
Exon: 1267.. 1001 

Start ATG : 4970 (Reverse strand: CAT) 
Transcript No. : CT23019 

actgcggacgcgctcctcggatcgaatcgaatcgccttcgaattggtcggtcgttggttgatcaagtgtcgcgtgcgctaatcattaattaagtgtcttaggaaaaaagtcc 
caatttggctatcgaaacgggtttccatctaccagtgcatttgcgagctgccttgcctttgcggcaggctcatttatgaaaaagaaatatcgtttgcggccagttagatttt 
cacctgaatcacctgcaatcgaacgcaattatcataccggcaaaatggaaaccacaacacctgtgctcgacctgttgatgccgcacaactcaactactgtggcgcctccgaa 
agcgctctatgcaaatcgaaatcgcctgattattggtgttattatcatggttctgggcgtttttggcaattcgctggcccttttcattttggcgcgcaagaagctcaacaaa 
aacagcaaatacacgctcatgctgcgatgcctggcgaccaacaatctggtggctctactgggcatgctgaccacgacgctgctgaaaatgtatctctcaaaggaagtcctgc 
agtcgttcatccgagtggattgcgtgggactcgtggtttggcgatttttcgggctcagttctggatgcattgcagcagttatggcggcagaaagatggatggccctagccag 
accctttatctaccacaagcacattacctacgagctcattcgcaagagcatcaacagcattctgatgatcgccgtggtgatcacgttcctgccattcgttggttttggtgca 
tacatcgatgaatccaatccggaccagcttaagtgtatacgatatcgcgatgcgccgggcgtgtggaacaagacgtatgcggtgctctttatggtcttcggtaccctgctgt 
gcattgtaatcgtggcctgcaacctcttcgtggcccacaccttgctctgtgtgatcggaaggagtcgcacggccaagcggcatatgcactacgacctggtttccagggacaa 
aaacagtgccatcagcatcgatcccgagagcagcagcggcaccaccctctaccagacgcagctgagcaccggaagcggaaacagccatcgcagtgtacagccggcaaggcag 
tacaggcacagcgttagcgtcacaatggcggccaccgactcctcgccagtcgagatcaagttcgccaaactaatggcgttcctcagcatctcgttcgtcatctgctggatgc 
cacagatgatcgccatcccgttggctatagctccaaatcgggtgcccgcatcgaacaagttcttcatcatcgccgatgttcttacggcgctgcacttcacctcggatccgta 
tgtctatgtgctgagtcgctccaagtccatcaactggtccttgctgggatgcattaagcgctggaggagcggatggcgtccgggcggacttcgtcgttcccagagcgaccag 
agccgcatgcgtacgacgatgacggaggcgaatactctggaattcaactga 

(SEQ ID NO: 89) 

Start ATG : 269 (Reverse strand: CAT) 

METTTPVLDLLMPHNSTTVAPPKALYANRNRLIIGVIIMVLGVFGNSLALFILARKKLNKNSKYTLMLRCLATNNLVALLGMLTTTLLKMYLSKEVLQSFIRVDCVGLVVHR 
FFGLSSGCIAAVMAAERWMALARPFIYHKHITYELIRKSINSILMIAVVITFLPFVGFGAYIDESNPDQLKCIRYRDAPGVWNKTYAVLFMVFGTLLCIVI VACNLFVAHTL 
LCVIGRSRTAKRHMHYDLVSRDKNSAISIDPESSSGTTLYQTQLSTGSGNSHRSVQPARQYRHSVSVTMAATDSSPVEIKFAK1MAFLSISFVICWMPQMIAIPLAIAPNRV 
PASNKFFIIADVLTALHFTSDPYVYVLSRSKSINWSLLGCIKRWRSGWRPGGLRRSQSDQSRMRTTMTEANTLEFN* 
(SEQ ID NO: 90) 

Name: Prostaglandin receptor-like 
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Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000012711419 

tacactuat gaatattcaaacaaaactcttcaacttgtctggcatatatccatagacaattacgcgggtatgtatgctgggttttgtgttcttaaaatcgtatatctgctgc 
aggattgacatcgaactgcttaaccccacgacggcaaccattcgtcgaaaaagtttgaacacagtttatatttttgcaatattctaagtactataacctaccaaaataactg 
taaaagaaagtttctatgaacgtttgaaatcaaaattaggtccgattttgcgtagttttcgacaaagtgtcaccgtcaaaagggcttcgactggcatatccctcaaagggtt 
aacatgcttttctttggtcttcttatggcctacccccttcatatgtctgcttttcattgatatattctgcggaacaacaaaagaagtattattctatattgggatatgcttt 
agagttttggtttgtagggttttctagtctcggtaaatgtgtatatcttatgcttaatttattttattagctatcatatcatatactttgcatggttatcatagattggtta 
ttaacattaaaatatttattgtctaaaacatttcgccccgcttagtgccactctagacctatgtatgcatcccaaatcggattgcttgttttcccccccaacgattctaact 
atagcacacacctttggctacttacagataactcttccctgtccaccgattaaattttgaatttaataagtgtttcattctaatactttgctgatggtcgaaatccagcaga 
tattccattaagcggagcactttcgttgggtatgtatagaaaacttatgggcctaaggaggtgtaaggggtttgggaatggaaaagtttgcaaggcgccagactctcggggc 
tctgaagtgtctggtcctttgcgcaagtgcagtgcaaaatggaaatctatcagggatttaactcaaccaccagcagcttaaccacttcagatccccaaaccatcttaggtca 
tgtgcagatcccccttgaggatccgcgtaaaggtcttcttgaactgctgattggccagagcgtagcaaaaaggattcatcgggctgttggcgtagcagaggaagtacgagaa 
catgtagaggtgttcattgatgcagggcggatgccggcagaatcccicgacgagggccaacacatggtagggtgtccagcaggccacgaagcagcccaaaatgaacgagatc 
gtgcgaaacgctttgcgagcacgattctccgacttggacttctgccgtccacccactccaaggatcaggggtagagccttcttgctcttgaagtgctttcctatcgaatgaa 
tgaatgcccttttgctggagtttcggctggccgccaaactcttctcctccctgttggcatcgtcatcccccgcgttctttttggtggcaccaacaccgtcttgttccccgcc 
tccacccacaccgcccgtactgctggtggtgcccgaggtggagggcatcttgttggtattactgtccgaggtttctcgctcctcaatcctttccagtggcaccggtcgtgtt 
acacccgccgccgactggggcgttcccgtttgtggcgttgaagtctttaccttgggtgagagtggcattggtggcatcggggaatagagtatggcatccttgctgctggtct 
ccataggactggaggcccggcgatggggacagtccgcagctagctttccgggcacatccaacaggcagatggcctcaatgcagctctcccgcctcaccttagcaaagtcctc 
gattcctgtgtacgagacgagaagacgtttttcaatggccaccgcctcctcgatactctgactctgactctggttgtgattcggattcaggttgaggctcaggctttggctg 
ggactctggttttggctgggagtttgactgaggctctggttcaggttcgattttggtggatttcggcgtgcgggcggtggtggaggcaggactggtggtttcccctggacgc 
tgtcattcggggaggtcgtcgagttggccgtcacactggccagcatggcggagctctcgtccatatagcgtagctccgagttcgtcatcaggtcataggtcagggccagttc 
actgttaccggaactattcccaaagggacgatccgagtagtcatcggagagcggggtggggcactgaaaggcgtcgggaggtggcaaaatggtctgcaccagaccctgatgc 
acttgatcgctagggacctcgatgggtgccaagtccacgggggacaggggatcactggtgttgagtaatgttggaggagcggacaggtcgaccaaacgatcatgctcggcta 
tctgggacagtgtggagcaggatgtgcgcttgggcctgtccagcggtgggttcggctggttttggtggtgctggtggtgcggttgttgctggttctggtggtggtgcaggga 
gcgcatctcttccttgctctgcgcccgctgtaacggactctggttgtgatggtgctgttggtggtggtggtactggtgcatggcctggatcgacttggagttagtggtgctc 
ttctgcaagttacctttgccacgagtggccagcagggcggcctgtgccccaaagaccagaccaatcgatgaccgcttcctcatgttgttcaatttctgctgggtgatcaact 
gctgggcctggttgacggatgattcctcatcggaatcgaaggcaggactgctggatctctcgctcttttccgcatccacggctgcctcacgttttttcgcctcctcctggat 
ttttgctgccgacgctctcctctgttccggtgtcatcttctccacttcctcggtggccacggccagaggatttgcctgaccggaaccacttaatctcttaatcactgtggtg 
caggcactatccaaatcggtttgatctcccgcaagttccactttggtctttaggattttctcctcaatcacaccaatgccagctgcatgacctgccattccggacatcgctc 
cagcactcagtgccaccatggattgcattttccgttgcttcgcctcactgcgcttttgcatatcataggctgttttgtagatcccggcatataagacaaaaagcacaattag 
tgtggtccaatagtagccaatgattagggccgtattaaagatcggatccttgagaaactgaacggcgcactggccaggaagtagatgccgcttgccggtgaagtgctcccat 
ccaaaaatcgagatgaagaagagcagagcgggaatgatccaagtgattgtgaccatatatatcactctggttctcgtcctccaactccggtacttggcggcaatcttcacag 
agcagtagcgatctatcgtgatcaacagcaccgtgtactgggacaccagacacacagtatagtcgacggaaagccagagatcgcagagcatcggacccaggtcccaataacc 
cttgagcacgtatatcgtgtagaagggcatcgaaacggttcctgaaatatgggtttatagtacaattaataattttttttttttgaatttaaggtaaaatagttactaccta 
ttagcatatccgtggctgctagggaggcaataaagtagttgctgggctgccgaatattcctgtccacgatgaaggccagtagcaccaggatattaccgcccaccgtaagaat 
gatacagatcgccaggcatatggctatcaggatcgtctgccacagtgcaaatggtggcaaaacaggaccaagaatctccgagtcggaactggatcccgacatggtggaaccg 
ccatcagaggattgatttctaacagccaacagaaggttcgtattgttcacatcgaagttcctgtaccattgcagctgttcattaaccggccaggaattgaagtcactgctct 
cgccatagatcttggccgcctccaacagacgtttgcactccgtgtgcgtcaggactcgcttgaaaatgaagctgcacagcttatcccgctcccttctaaggctcctggaagt 
gcgaaagtaaatatatgcttaattagagttgacataaaactatctctatctctatctctatctctatctctatctctatctctaaactaatgtccttgaaaacaaattctaa 
caatgtatctttttaagatctccattttctaaaagctttgccactgaaattactattctaataggcgcctttattctattaagacaactcaccagtcggtgatccacttgtt 
cttgatggtatagatggagtcgctggagttgccattcaggccagtcagctcctcctcgcccgccgacttgttaagcgaccgcatccttaaccgctccgccctcttcccgctc 
tggccacgtcccgctccctgctccccgcacttcgcctctgcgctccacaacctctgcggtttttatggtaatttcaagttgttcctccttaatgtgctgtcgacggcggttg 
tctcggaatgcttacattgcctccatggaatgccaagacttgtctccggtcaggtgcagcatctgcaagaggcaacaaaagcagccacagtcttggttagcacttgaaaaaa 
tatagaggttattaatattatatgatataatatatatatgcatataaaatatgatataatatatatatgcatataaaatatgatataatttatatatgtatacataatatga 
taaaaagtctgaatttcagaattctgccagaagtcgttttccatccaaaaattaattgttttttaatcgaatttttggtaataataatccgaataaggaatggaatatttcg 
ttgagatcgtaaataaattttaaatccgtgtgtgtttgatttggaaccagtcgttttccatccaaaaattaattgttttttaatcgaatttttggtaataataatccgaata 
aggaatggaatatctcgttgagatcgtaaataaattttaaaatatatgtgtgtttgatttggaaccatttatttttttcacgatgttgccactttccaaaaattcgaaaatt 
gacaaaaaaaatatttttccgttttagttgcgcgaccatttttagccaaataatagtaaaaaatctatttgaaaaatacattgtcttcattagatggtggaaaataatgaaa 
atatgaaaataatggaaattaaattaaatcaaaattagtaatgaaaatcaagtatttactaaataaagaaactttcttttcgtgtagtcttcgtggacgtcttcgcttccgc 
caagtttaaactaattttgcaaccgttaactggcaaccggcgactggcgatgcaatttgcttgtggcagttgcttc 

(SEQ ID NO: 91) 

Exon: 4340.. 4237 
Exon: 4024.. 3582 
Exon: 3513.. 1001 

Start ATG: 4340 (Reverse strand: CAT) 
Transcript No. : CT23924 

ATGCGGTCGCTTAACAAGTCGGCGGGCGAGGAGGAGCTGACTGGCCTGAATGGCAACTCCAGCGACTCCATCTATACCATCAAGAACAAGTGGATCACCGACTGGAGCCTTA 
GAAGGGAGCGGGATAAGCTGTGCAGCTTCATTTTCAAGCGAGTCCTGACGCACACGGAGTGCAAACGTCTGTTGGAGGCGGCCAAGATCTATGGCGAGAGCAGTGACTTCAA 
TTCCTGGCCGGTTAATGAACAGCTGCAATGGTACAGGAACTTCGATGTGAACAATACGAACCTTCTGTTGGCTGTTAGAAATCAATCCTCTGATGGCGGTTCCACCATGTCG 
GGATCCAGTTCCGACTCGGAGATTCTTGGTCCTGTTTTGCCACCATTTGCACTGTGGCAGACGATCCTGATAGCCATATGCCTGGCGATCTGTATCATTCTTACGGTGGGCG 
GTAATATCCTGGTGCTACTGGCCTTCATCGTGGACAGGAATATTCGGCAGCCCAGCAACTACTTTATTGCCTCCCTAGCAGCCACGGATATGCTAATAGGAACCGTTTCGAT 
GCCCTTCTACACGATATACCTGCTCAAGGGTTATTGGGACCTGGGTCCGATGCTCTGCGATCTCTGGCTTTCCGTCGACTATACTGTGTGTCTGGTGTCCCAGTACACGGTG 
CTGTTGATCACGATAGATCGCTACTGCTCTGTGAAGATTGCCGCCAAGTACCGGAGTTGGAGGACGAGAACCAGAGTGATATATATGGTCACAATCACTTGGATCATTCCCG 
CTCTGCTCTTCTTCATCTCGATTTTTGGATGGGAGCACTTCACCGGCAAGCGGGATCTACTTCCTGGCCAGTGCGCCGTTCAGTTTCTCAAGGATCCGATCTTTAATACGGC 
CCTAATCATTGGCTACTATTGGACCACACTAATTGTGCTTTTTGTCTTATATGCCGGGATCTACAAAACAGCCTATGATATGCAAAAGCGCAGTGAGGCGAAGCAACGGAAA 
ATGCAATCCATGGTGGCACTGAGTGCTGGAGCGATGTCCGGAATGGCAGGTCATGCAGCTGGCATTGGTGTGATTGAGGAGAAAATCCTAAAGACCAAAGTGGAACTTGCGG 
GAGATCAAACCGATTTGGATAGTGCCTGCACCACAGTGATTAAGAGATTAAGTGGTTCCGGTCAGGCAAATCCTCTGGCCGTGGCCACCGAGGAAGTGGAGAAGATGACACC 
GGAACAGAGGAGAGCGTCGGCAGCAAAAATCCAGGAGGAGGCGAAAAAACGTGAGGCAGCCGTGGATGCGGAAAAGAGCGAGAGATCCAGCAGTCCTGCCTTCGATTCCGAT 
GAGGAATCATCCGTCAACCAGGCCCAGCAGTTGATCACCCAGCAGAAATTGAACAACATGAGGAAGCGGTCATCGATTGGTCTGGTCTTTGGGGCACAGGCCGCCCTGCTGG 
CCACTCGTGGCAAAGGTAACTTGCAGAAGAGCACCACTAACTCCAAGTCGATCGAGCCCATGCACCAGTACCACCACCACCAACAGCACCATCACAACCAGAGTCCCTTACA 
GCGGGCGCAGAGCAAGGAAGAGATGCGCTCCCTGCACCACCACCAGAACCAGCAACAACCGCACCACCAGCACCACCAAAACCAGCCGAACCCACCGCTGGACAGGCCCAAG 
CGCACATCCTGCTCCACACTGTCCCAGATAGCCGAGCATGATCGTTTGGTCGACCTGTCCGCTCCTCCAACATTACTCAACACCAGTGATCCCCTGTCCCCCGTGGACTTGG 
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cacccatcgaggtccctagcgatcaagtgcatcagggtctggtgcagaccattttgccflcctcccgacgcctttcagtgccccaccccgctctccgatgactactcggatcg 
tccctttgggaatagttccggtaacagtgaactggccctgacctatgacctgatgacgaactcggagctacgctatatggacgagagctccgccatgctggccagtgtgacg 
gccaactcgacgacctccccgaatgacagcgtccaggggaaaccaccagtcctgcctccaccaccgcccgcacgccgaaatccaccaaaatcgaacctgaaccagagcctca 
gtcaaactcccagccaaaaccagagtcccagccaaagcctgagcctcaacctgaatccgaatcacaaccagagtcagagtcagagtatcgaggaggcggtggccattgaaaa 
acgtcttctcgtctcgtacacaggaatcgaggactttgctaaggtgaggcgggagagctgcattgaggccatctgcctgttggatgtgcccggaaagctagctgcggactgt 
ccccatcgccgggcctccagtcctatggagaccagcagcaaggatgccatactctattccccgatgccaccaatgccactctcacccaaggtaaagacttcaacgccacaaa 
cgggaacgccccagtcggcggcgggtgtaacaccaccggtgccactggaaaggattgaggagcgagaaacctcggacagtaataccaacaagatgccctccacctcgggcac 
caccagcagtacgggcggtgtgggtggaggcggggaacaagacggtgttggtgccaccaaaaagaacgcgggggatgacgatgccaacagggaggagaagagtttggcggcc 
agccgaaactccagcaaaagggcattcattcattcgataggaaagcacttcaagagcaagaaggctctacccctgatccttggagtgggtggacggcagaagtccaagtcgg 
agaatcgtgctcgcaaagcgtttcgcacgatctcgttcattttgggctgcttcgtggcctgctggacaccctaccatgtgttggccctcgtcgagggattctgccggcatcc 
gccctgcatcaatgaacacctctacatgttctcgtacttcctctgctacgccaacagcccgatgaatcctttttgctacgctctggccaatcagcagttcaagaagaccttt 
acgcggatcctcaagggggatctgcacatgacctaa 

(SEQ ID NO: 92) 

Start ATG : 1 (Reverse strand: CAT) 

MRSLNKSAGEEELTGLNGNSSDSI YTIKNKWITDWSLRRERDKLCSFIFKRVLTHTECKRLLEAAKI YGESSDFNSWPVNEQLQWYRNFDVNNTNLLLAVRNQSSDGGSTMS 

GSSSDSEILGPVLPPFALWQTILI AICLAICIILTVGGNILVLLAFI VDRNIRQPSNYFIA3LAATDMLIGTVSMPFYTIYVLKGYWDLGPMLCDLWLSVDYTVCLVSQYTV 

LLITIDRYCSVKIAAKYRSWRTRTRVIYMVTITWIIPALLFFISIFGWEHFTGKRDLLPGQCAVQFLKDPIFNTALIIGYYWTTLIVLFVLYAGIYKTAYDMQKRSEAKQRK 

MQSMVALSAGAMSGMAGHAAGIGVIEEKILKTKVELAGDQTDLDSACTTVI KRLSGSGQANPLAVATEEVEKMTPEQRRASAAKIQEEAKKREAAVDAEKSERSSSPAFDSD 

EESSVNQAQQLITQQKLNNMRKRSSIGLVFGAQAALLATRGKGNLQKSTTNSKSIEAMHQYHHHQQHHHNQSPLQRAQSKEEMRSLHHHQNQQQPHHQHHQNQPNPPLDRPK 

RTSCSTLSQIAEHDRLVDLSAPPTLLNTSDPLSPVDLAPIEVPSDQVHQGLVQTILPPPDAFQCPTPLSDDYSDRPFGNSSGNSELALTYDLMTNSELRYMDESSAMLASVT 

ANSTTSPNDSVQGKPPVLPPPPPARRNPPKSNLNQSLSQTPSQNQSPSQSLSLNLNPNHNQSQSQSIEEAVAIEKRLLVSYTGIEDFAKVRRESCIEAICLLDVPGKLAADC 

PHRRASSPMETSSKDA1LYSPMPPMPLSPKVKTSTPQTGTPQSAAGVTPPVPLERIEERETSDSNTNKMPSTSGTTSSTGGVGGGGEQDGVGATKKNAGDDDANREEKSLAA 

SRNSSKRAFIHSIGKHFKSKKALPLILGVGGRQKSKSENRARKAFRTISFILGCFVACWTPYHVLALVEGFCRHPPCINEHLYMFSYFLCYANSPMNPFCYALANQQFKKTF 

TRILKGDLHMT* 

(SEQ ID NO: 93) 

Name: Muscarinic Acetylcholine receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384504 

GACATACGCAGGCGGAGAATTGATTTGATTTCTGTTATTGTTGCTTATCACTATCGATTTCGATTATTCAAAGTCCCTTTTTGCCCTGCGGCGAACGGCGATTGCCCTCAGT 

TTCTGAGTICCTGCAAATCAAATGCAATTATGTCGTTCAATCGTCTGAATGCTGAACGGTAAATCGAGTAAACAAGAGTGGGCGGAGGTGGTCTGAATAATTTATGACCGCC 

CATTAATTATTACTCATAGAGCTGTCCTATGACTTATCGACTTTAAGCCAATTGGAGAGGGGGGAGTAAGGCTGGCAGGGCAACCCGATCAAAACGAAAAATCAAAGTGTGG 

GTATCATTGTGTATTTCGTAAATTAGAAGGCCAATTTGGTGGCAATTCCTGTTTGTGATACAATTCCTGGCCGCCATTGTACCCGTGGTACTTCTAGGAATTCATTAAATGT 

ATCCAATTTGCTGTCAAGGTAATTTACTTGGTGAACAATTAAGTTCAAGTAGCTATGTTTAGTCATTTCAAAGTTATGTCTTTACAATTTATTTTATATAAATACTAAGTGC 

TATTCTTAATTAAGCATCTGCACTTAAGGTTTTCTCTATAAGGAGCCTATTCAAATAGTTTTCATTAATGTTTTCATTCTTTCCCTTTCTCGATTGCAGCTCCAACTTCCGG 

CGCTTTGGTGGCGTAAAATGTTTGCCACCAATATCCGTTCGGTTGCCGTAGAAATCGTCCTGGTGGCGCCGTGAGCTCTCGTCCTTTTCCTTTCGCGAGTCCTGAATCCCCG 

TCCCTCGCTCATTTCCGGTGCGCCGTTTTTACATTTGCAATTCGAATCGGATTGAAATCGGAATCGGAATCAGAATCGGCGCGAGGACATTTCCATTCCTTCCCATTCTCTT 

TTTATTTTTTTTTTTTGCGATCTGCGTTGCATATGCGAGGGCTAATTAGCATGCGGCATTTCCAGCAATCAGAGTAGAGCGCACAATAGGAATAATAACGCGGAATGGAAGA 

GGATGTGTATGCCTCGCTAGGTGCCTACAACGACAGCGGTGGCGACGATTGGAGCAGCTCGGAGCACCTGGTCCTGTGGGAGGAGGATGAGACGCAGCGAACGACTGCTAAT 

GCCACCAGTCGGCATAATCAACTGCATGTGGCCAGGTGGAATGCCACCGGCAATGCGACCATCAGCGCGACCTTCGAGGACGTACCCTTCGATGCGAACAACTACTGGGCCC 

TATTGGCCCTCGTCCTCGTCCTGGGAACAGCGGCCGGGAACATCCTTGTCTGCCTGGCCATCGCCTGGGAGCGCCGGCTGCAGAATGTGACCAACTACTTCCTTATGTCGCT 

GGCCATCACCGATCTCATGGTCGCCGTGCTGGTCATGCCGCTGGGCATCCTTACGCTGGTGAAAGGTGAGTCCTTTGTCCCATTCAAAGTCCGGAACAGAGGTATTTTTTGC 

AGCCACCTTTTGTTGAGGGACTCAATGGACACTTGAGTAAGTCACTTGTGCGCCAGGATTGGCTTGTGCGGCTAATCTTAGTTGGAAAAGTTCCTTTTCCATAATTTCATAG 

TTTACTCTAACGAAAATGAGTTGGTGGAAAACTTGATGAAAATTAATATTAATTACTTGGGGCAAGGGAAAAAAGAGTGAGAAGTTTTGATAGTTGTTAAATTTTTAAGGCT 

TTTAAAAAGTGACGAAGAAGATGCAAGGAAACGCAAAGATGTAGGGAATATACAAAATTATTTTAAATCTTAAGATTTCTTATATTTGCAATTTAAATGCGGTAAATGCGAA 

TCAATTGAATAATGTACAGGGAATTGTGAAAAAATATGGATATTTACGGATTCCTTTTGCTGCAACTCAATCCGAATGATAATCTCTCCAAGTGGAAGAGTATAATAATCAA 

AGCCAGCGACAGAAACGTGGCAAAGACATTCAGGCCAGGACCTCCTGCGATGTCGCCAACACGAGGCACGAGTTGCATTTAATGAAGCTGCTGGTGGGAATGGCGTTGGCGG 

TGTGCAGATTGCATTATGTCTCAGATATTTCGATCAAAGCTGGCAAACGTAAGTAAGCCCGGATAGAAAAATGGAATCGCTGGTGTCTATGGGCCGGCAGGGCGGAGCGGAG 

CTCCATCGTTAGACAATTTAATAAGCGCGGCGCCTTTGTAGCCAGCCTGGCCAGGACGAACCGCTGGCGCAGACAGGGAAGCCCAGAAACAATGCCAGCAAGATGGCAACGG 

AAGCTTCCAAAGCCTTTGCAGGCGCCGGAGCAACTGCAGCATAAGAATTGCATTAGGGGCGGCAAACGAGCGCTGGTGAATTTCGCTTTGACTTAGTTCGGATCTGGAAACA 

TCTGCCGACACATTGCTGTCCTGCAATCTGCACTGAAAAAAAAAACAAAATATATTTATTCAATTATTTCTATACTTAAAGTAAAAGTCATCAAGTTTAAAGCTAATTGTAT 

AATTTTTTTCTTTCGATTTTTTCGATTGAGCAATAAAATATGGTTCATATATGTTAATACTTACATTTTTTTTAAGCTAAGTAACTTGAAGCTAGCTACACAAATATTATAA 

CATACCAGACAGGAAATGGAAAAGCTATTTCCGTGACTTTCAAAGGCGTTGCTTATTTCGTTGCTCCATTTATGTGAGTGCTTCTTCCGGTGGCAGTTAAACGGTATTTCTT 

CGATTTTCGAAAACGAATGGCTGATTTGAGGACTGCATTTATCTGAGTGCTGCTTCCGCCTCCTGCTGCCGTCCTTTTTGTGCCAACCCCTGCAGATGTTGCAACACTGATG 

ACAGTTTTGTTTGTGCGCTTTGGGAGCCATTACAAACACAAACACAAACACAAACACGAGCACGGGCACCAATACAAGCAGAAAGGGCGACGCACACACACGCAAACACACA 

CCTGGAGCAATCCTTGGCAAAGTGGGCAATGTAGCAAGAAGGAGGACCAACAAATGGCTGCTTGTTATATCAAGTGTGAAAAATATTGCAGTGGCAGTGACAGGAGCAGTTG 

TTGCTGTAGTTGCACGGGCACACGCCCGCCAACTGGAGGCAAGAAAGGACGAAATCGGGATTTGCCTGCTGTCAAGGGCTTTGCAGAGGGTCCTGCGACAGAAGCTGAAATC 

TCCAGCGAACAGCAGGCATTAGATCCCAAGGATTAAGCTCTTACAATTCAAAGGCACAATGGCGCAAGTAGCAACTGGATATGAGTAAGCCGTGCAGTACATTAATTGCATC 

GGTTACGTAAGTAACATGTGGCATACAATGTGGCATTGAAATGGCCTATATATTCTATTCTCTGTTGATAAGTATGCAAATTCAACTTTCAATGAAAGCTGAACTAATATGT 

TCATGACCAATAGGCGTATGAGTAATATTAGTAGACTAAATGTTTAAAAACGCTAAAGGTGTAATATCAAAATTAATCTAACAAAATCATTCATGGCTTTGGTTGTTGACTT 

TCTGGAGTGTAATAATAATACAAATTTAAATTTTCGTAAAAACAATTTGATCCGTACGTACACGCACAAATCTAACAAACTATTGAAATGGAATCAAATAGCTGAAATAATA 

TTTATAAATGAATATATATTTTTAGGACTTAAGTTCACTGAAGGCTTAGCGGGAAATAAGTTTTTAAGGATGAAATATTAATGCTATTATGCTAAACTCTGTTGAGTTTCCA 

GATGCATATATCCTTTGATTTTGAACTTTCAACGAATTTTCCACTTGATCAATCGACCTTTTGGAAACGAAACAAGACCCATGCAATTAAACGACATGCGTTGATGCCATTT 

TAAGCGCTTGCACTTGCAGTGAATTATGATTACGCGTGGAAAACATAATTAATGCATAATTAATTTCATTTTAAAATCAGCATTAAACGCCAGTTAATTAATAACAGTAAAC 

GCAATTACTGCATAGCATAGCCATCAAAAGCCATAAAAAAGACAGTAAATTGAATTACAACGTCCCGTCAATGGTCGTGAATATTTAAGTAAACGCGAAAAGCTCGCAAATG 

TTTGCAACTTAATAAATTATACTTGGGACGAATAATGATAAACAAGAGAACAGCCGCAAATATAAAAAGAAAAGCCAAGCGAAAAAAGGCGCGAGGCACGAGAAGCTGGAAA 

ATTAATTGAGCGCAAATTGGTGAAGCTG 

(SEQ ID NO: 94) 

Exon: 1001.. 1445 
Exon: 1885.. 2064 
Exon: 2784.. 3172 
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Start ATG: 1001 
Transcript No. : CT24036 

ATGGAAGAGGATGTGTATGCCTCGCTAGGTGCCTACAACGACAGCGGTGGCGACGATTGGAGCAGCTCGGAGCACCTGGTCCTGTGGGAGGAGGATGAGACGCAGCGAACGA 
CTGCTAATGCCACCAGTCGGCATAATCAACTGCATGTGGCCAGGTGGAATGCCACCGGCAATGCGACCATCAGCGCGACCTTCGAGGACGTACCCTTCGATGCGAACAACTA 
CTGGGCCCTATTGGCCCTCGTCCTCGTCCTGGGAACAGCGGCCGGGAACATCCTTGTCTGCCTGGCCATCGCCTGGGAGCGCCGGCTGCAGAATGTGACCAACTACTTCCTT 
ATGTCGCTGGCCATCACCGATCTCATGGTCGCCGTGCTGGTCATGCCGCTGGGCATCCTTACGCTGGTGAAAGGTGAGTCCTTTGTCCCATTCAAAGTCCGGAACAGAGTGG 
AAGAGTATAATAATCAAAGCCAGCGACAGAAACGTGGCAAAGACATTCAGGCCAGGACCTCCTGCGATGTCGCCAACACGAGGCACGAGTTGCATTTAATGAAGCTGCTGGT 
GGGAATGGCGTTGGCGGTGTGCAGATTGCATTATGTCTCAGATATTTCGATCAAAGCTGGCAAACATGTTGCAACACTGATGACAGTTTTGTTTGTGCGCTTTGGGAGCCAT 
TACAAACACAAACACAAACACAAACACGAGCACGGGCACCAATACAAGCAGAAAGGGCGACGCACACACACGCAAACACACACCTGGAGCAATCCTTGGCAAAGTGGGCAAT 
GTAGCAAGAAGGAGGACCAACAAATGGCTGCTTGTTATATCAAGTGTGAAAAATATTGCAGTGGCAGTGACAGGAGCAGTTGTTGCTGTAGTTGCACGGGCACACGCCCGCC 
AACTGGAGGCAAGAAAGGACGAAATCGGGATTTGCCTGCTGTCAAGGGCTTTGCAGAGGGTCCTGCGACAGAAGCTGAAATCTCCAGCGAACAGCAGGCATTAGATCCCAAG 
GATTAA 

(SEQ ID NO: 95) 



KEEDVYASLGAYNDSGGDDWSSSEHLVLWEEDETQRTTANATSRHNQLHVARWNATGNATISATFEDVPFDANNYWALLALVLVLGTAAGNILVCLAIAWERRLQNVTNYFL 
MSLAITDLMVAVLVMPLGILTLVKGESFVPFKVRNRVEEYNNQSQRQKRGKDIQARTSCDVANTRHELHLMKLLVGMALAVCRLHYVSDISIKAGKHVATLMTVLFVRFGSH 
YKHKHKHKHEHGHQYKQKGRRTHTQTHTWSNPWQSGQCSKKEDQQMAACYIKCEKYCSGSDRSSCCCSCTGTRPPTGGKKGRNRDLPAVKGFAEGPATEAEISSEQQALDPK 
D* 

(SEQ ID NO: 96) 

Name: 5-HT2 receptor-like 1 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384504 

TATAAATGAATTCTGAAAAATGTATAAAATGAAATGGACTCATCATTAGGTCAATTTGTTTGGAATCAAAAATTTCCCAAAGAAATGGTTTTACATTTCAGAAAGTGACCCT 
TAATTTTCGCCATCAATATTTATTAGAACATTGACCTTAAAGCAATTTGTATAGTCGTCACTTTAACAGATCAAAGTTATACTTAAAGTCGTGCAGTTGCAACTGAAACTGA 
TTCATTCTACGCCTTATCGATCCATCTTCCCCCGAACTTATCCCAATTTCTGTTGGCTATCTCGTTAGGAAATTCCTTTTCCCAGCGGCCATACGGATTCATGTTCGACCAG 
ATTCTGGATTCACAAATCCCGCGCCCAAATCTCAGGGCACACGCAACAACACAGTAAACACAAGTATCACTCACCGAATGTGTGGGAGATTCTCGGCCACACAAGCTACAGA 
GAAACAAACCCAATTCAAACTAAAAATTTCTAAATGGAAAATAAATTAATTTTATTAGTTCTTTATAAGATACTAAGACTTATACTAGTTGGAAATCTTCTAGGCATGGTTT 
GTAAATCATCCTTATATCTTTCGATATTATTCTAAGACTTTTTTCGTAGTGCACACACAATTGTATTTTTATGGGAATAGGGCGAGAGTGGTGCGCTGAGTTATGACGTCCA 
ACAGCTGTGCAAAACGATGGCCACAGCATCGGGAATAGCCGCTCATATGTGGGCCAGTTGGCCTGAAATATGTTGCATAAGCTCCCCCCTCTTGGCAATATCATTTGGCAGC 
TCACGAATGAACTGTGGCTCCCGGGAAAATATGGCCAAAAAAAATAAAAAAAAAAAGGCGGGGAGTATTAAATAAAAATGGCTAGAGAACATTGCATTGTTTGCCGCCGACC 
AAGCCAATCGTAAATCGTGTGTACACATAGGTAGAAAGTATTGCAGCCACTTAATACAGTCATAAGTCGGCCATGGAACATTTACAAGCTGACCAAGAAATACAATGAAGAA 
TCGGTTTTTTTTTTCTTTTACAGGCTACTTTCCTTTGGGCTCGGAGCACTGCCTCACCTGGATCTGCCTGGATGTACTCTTCTGCACGGCCAGCATCATGCACCTGTGCACC 
ATCTCCGTGGACAGATATTTATCGCTACGATACCCAATGCGATTTGGCCGGAATAAAACACGTCGGCGGGTCACTCTCAAAATTGTTTTCGTTTGGCTTCTGAGCATCGCCA 
TGAGCTTGCCCTTGAGCTTGATGTACTCCAAGGTGAGTAATAATGCTATTCTTTTGTTTATATTCTATATTCCAACAATGCGTCAAAATATAGTACCCTTTTGGCATTCTAT 
TTCCCTTAATCTAATAGTGAAAACCCATTAAATTCCCTAGTTAACCTAGTACTCAAAATCTAAAGTGAAAATTACAAACACTGTGCCAAGTCAATAATTAAAATCCAAAGCG 
AGACAAGTGTCTGAAATATTTTGCGCATACATATGTCCTTTACTTACCCAAATTAATTCCGTGCTCGCCCTGCTATCTTTCCTTTGATTCCAGAATCACGCCTCGGTGCTGG 
TGAATGGAACTTGCCAGATACCGGATCCGGTGTACAAGCTGGTCGGCTCCATTGTATGCTTCTACATTCCACTGGGCGTGATGCTGCTGACATATTGCCTGACCGTCCGACT 
TTTGGCCCGGCAGCGCCAGAATCTGGGTGGTGGACAGCAGACGGCGGCGGCCACTCCCGGATGGGCCAGTGGATGGCTCGGCCAGGCACCGGCCTTGGGTAAGGATGCTAGA 
CACCACACTTCTGACCCCCACTCTCACCCAGAAGTGTCCACGAAGAGCTGCTGCTTCTGCTGCAGCTTAATATGATTTTCAGTTTGCCTAAAATTTGCATTTTGTTCTCCGG 
ATTGCACTTTTTGCACTGAAAACTTTACACGACACTCGACAAAAGTAGAGAGTAGGAAATGGGCGGATCAGGATCAGCAAGGGGAAAGTGCTCAAAAGTGGGGAGGGGTGAC 
AACACTAGACAAACAATGCAGAGACATGTGCCGGATACTCCCCGAATCCGTGTCCCTGGGCTTTCTGGAGCCCAAAGTGCGACTAAGCCGCAGCAATGCACACAAAATCGAC 
ATTGTCGTCGACATTGAACCGGATGGAATGGGGCGGGGGGGGGGGGGGGCCAACACCCCAGGACAATCACCCATTCCACCCACTTGCACACACAACCTTACAAATAGGATGA 
AAATCCAGCCCATATGTGTAGATAGCTAATTTCCACGTTGCCATGAATCGAGGGCCAGCTACGTAAATGGCTTGTAATGCTCTAAGAGCTCTTCCATAGTGCTCGACTAGTT 
TTAAAACCGAAGTTTTAAGCACCAACTTTTCAACGAAGCCTCGTTACAGCCCCGTTGCGTAGGGTAGTTTTTAAAAACTCAAGAGCTGCGTTAACTTTTAGTTTTATTAACC 
ACTTTTTGAAATTTCCATAACACTTTTCAGCGCATCTACAAAAAAGAGGAGGAATCTTTAAACATGTAGCTAATAAATAAAAAGTGAACAGTCTGTTATTTTAGATTTAAAG 
AGTCTAAATTGCTTCCTGCATCCCTAATTACCCCTCTCGTTACTAGCCCAGTTAACTGGAGCTTTTGGCTAAAAACAATCCAAGATCCTTCGCTTCCATTATCCCTTCACAA 
TTCAATAGACCAGATTCAATTTATTTTAAGCATAACTAAGTGGCTCCTAATGCCATTGTCTGCCCGCAGAATTTATTCAGCGCTCCCATTTCTCATCCAATTGGATTGCCTT 
GAGCCCCAAAGGGGACAGAGAGTAATTTTCTGGGGGACAAGCTGTGGATCTCTGGATGTTGAAAGTCAATACAGTTGGGGAATATTTAATGGAATGCTTCTGGTGGTGGCGA 
ATGGTGAATGCTGAATGGTAAATGCTGATGGTGATGGTGACTGCAGTCGAATCGCGTGCACTGCAGGGCATTTTTGCGAAAATCGTAGCGTGCATTTAGGGGCGATTCGCAG 
GACGCGGTGTCCCCGATTCGTCTAAGTGGATTTGAAGCCTCAGCTGCAGGCATTAGGGTGTGTATGTATGTATGTGTGGGTGCGCATCTGTGAGCGGCCAAGTGCGACTGTG 
TTAGTGCCGTGGATTTGGGCAGTTATTTTAAGCCAAATCTCTGCTGTAAAGCGTCCGCATTTCGGTCAGGCACGTAGTCGGATTTGCCGTATTGTACTTTTTACACGCAGAG 
AAAAATCTTCGCATTTATTACACAAAGAATCAATGTGTCTGATTAATATAAGCAAGATATTTAATGCAAGTTAGAATAGCTTCAAGCACTATTTCATAAGGTCTGAACTGGA 
CTAGGAGAAGAATTTCTCTCAGTGCCTATGTTGAAGGGGCATTTAAGTGTGCTCGTGCGAGTTTATTAGCGAATTTGTGACTTTTCGTAGCGCACTATTGGACTGAATGCAA 
TTTAATGGAAACTGTATATCCGTTTAACCCCGTGTTCGTGCTCCACAGAACGACGATGCACCTGGCGGCGCCTTTTGAAACCGGGTCCAGGTAACGCCTCCTCGGTGCTGCA 
CGCACACTCGGCCAATTCAACGGATACGGATCTCAGCACTTTGGACAACCACGAGCTCTGGCTGCCCGATTCCAGGTGAGCTATCGAGTTTGCGAGCCAGTGTTGCAGTGCA 
AATTACCATATATATTATTCATGTTTATGTAGCATAAAGGAGCCAACGCCCACGACCATGACGGCCTTGCATCAGTTCGGTGCCGAGATGCTGAAACTATCCCGAGGCCTGG 
AGTCGGTGGCCTCGTCATCCACCACCGGCTCACCCACCAAATCCGAATTTTCCCTATCGAACCACCTGCAGCTGCAGTACCCCAGCAGCCCACAGCGATATGCCGCCACCCA 
CCAGCAGTCCCACCACCACAACCACCAGCAGCATCCGGCCGGAGTTTACCACCAGCAGGCATCGCCCAAAGGTCGCCACGGAACCACAGTTCTGGGTCTGTCCACCACCACG 
CTGGGCATCGAGCGGGAGAGCACCAGGAATTCCCTGGCCAGCAGTCGCATGGGCGGCGAACAGAGCGATGGGTGAGTTCATAAATAGTAATATAAATAAGTGAGTCCAAATA 
ATGGGGTCTAAAAATATACTATCTAAAAGTCTGGAAGCCAACCAAAATTATGGTTACAATTTTCTCAAGATGTAATATGATGCTCAAACATAATTAAATTGCTTCACCACTT 
AACCAGAATCTCTGTTGTCACAAATAATATTAACACAGACAATTAAATTCTTTTAACTTTATCCCTTTTGCGGGCCACAATTTTCACTAGAATTGTGTCAAATCCCTTGAAA 
AGCATAGACAATTCGGGCAGGTACCTTGGAAAAACGTAAAGTTTTCGTGGCTCTTCACCTGACAAAAGGACAAACAGCTGCGATGGGCAAACCGTTTTGCCAGAGTGATCCT 
TTCTAGTCGCTCGCAGGACCTTTTGTTGTCGCTGTCTCAGCCGGTTTTCGAGTGCCAGTACTGCGCTTGGCCAAGTGTGCTGACCACACCAAAAGCCAAGAAAATGCTCACC 
TCGAGTCCTTTTTGTGACTCGTTTACTTTCTCCTTTTCCATTTATTTAGTTTTCTTTTTTCGTTAGCTCATTTTTGTGGCATTGCCTTTTATTGTCCTTCCGCCTTCGACTT 
GTTGTGAAATTTTTCGTAATAATTAATTTTTTTTCGCTACGTTATCCTTTTTCTTTGTGTCGGTGCAAACTGTTTTATTTCTTTTTTTTGGCCAAAAGGCAGAGGCACTGCC 
TTTTTATTTGGCATTCGAGCCGCTCAAGTGATTTTGGCGTCGACTGAGGCCTTTTGCCAAAATGTGTTTTTCATTAATTTCCTAAATTTAGTCTACTGGCTTATGCTGGGAT 
AAGTGATGCGACGAGAAATTTAATTAAAAAATTGAGATAATAGAAGACGAATATATTTTGAAATTACCGAGAATTTAATAAGAACTGGAAACATGGCTAAACTGGCATATTC 
AAGTAAATAGGGCGCTTCATGTTTATGGGCATTCAGGTGGAACTAATATTAACTATTTAATTGCGTATGATATGAAATTCTATGGAAGAGCCTGGCAAATTCTAAAAATACT 
TTTATTTTAGGCACTCAACCCAATTCGATATTGTACACGCATTTTATTTTGATTTAATTGGTTATCATATTTAGTCTGCAAAATCCATCCAATAAAAGCATGAATTTTTCAC 
TGCAACTGCGAACATTGATATCTTGAATCCCCCGTTCGATTTTAAGCTTGAATAAAAGGCAGCTCAACTTGCGAAATGACTGCACATTTTATCTCGCTCATATTTTTCCCAT 
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TTAACCAATCCCTTATGCATGTTTATATCATATATCATTGTGftCTGTGCGCCTTTCAGCCAAATGCTTGGTATTGGGTATTGGGAGCftCTATGTGCCACATTGTTTGACAGT 

TGAACGAATTTGCACAATCAATAAAATGGATTTGGCTTTGGCTCCGGCTTCCTATTGTCCGCAATGGATGGGATGCCACTTGGCATAAAGGACGAAAAGAAACGGGACGAGG 

TCAAAGTCGAGTGGTTTTTAATTGTTGCACAATTTGCCGTCTCATGCGGCAAAGTTGACAAATGATGCTGGCGATATTCTAATTTTTGAACTTGGCTGGGAGACTTAAAATC 

AGCTCAACACTTCGCCACATAAGAGTATATATTATATGTATGTGTATGCGAAGGCGGTATCCTGCGATGTCCTGATGAGCTTGGCAGCTGTTTAATTCCTTTGTGAGGGGGA 

AACAATAACAAAGTTGGCATCAGCTCGGCTTTGTTTCACTTGCATAATGAGTAGCTGGCCCCAAATACCACCTCCCCACTCCCCCATTGCTTAGCCCTGTTCCAACGCATCT 

CAATTTATGCGCACGTCTCGTGCAAACAAATCAAAAACGCGAAGGATGCATAATGAGCTGACATCCTTATCCTGCGTCTTGGCAGCACCGAGCATCCCGATCCCTGTCCTCC 

CTGCAGCCACCACCACTTGGCCTTCTGTCTGCTAACGCCTTCATTAGCAGCTAACCACGCCCCCAACGCCACCGCCCCACACCGTCTTGGCCAAATAAAGTTGGCGCGCATT 

TGCATGCAGGGCAAAACGGGCGGCAGGACAGAAAACCGCAAAGGCAGCTCGCAAATTTATGCGCTTTATTAAAATGAATGCAAAATGAGGCGGTACAGAAAAAAAATCGATG 

TACAATATGTTTATATATAACTTTAAATTAAAGCTACACATATGAGCTAAGTATTTCTGCAATTAATAATATAAAATTAAAAAAAAAAAAAATCCCGCCAGTAAAGTACAGT 

AAGCCGAACTAACATACCAAATATAATATAGTAAATAATAATGCAAGAATTTTTCAAGTGTACCACAAACAACGGAGCAAGAAAAAATGTTAGAAGGCGGTAAACCACAAAG 

AGCAAACACAAAAAATGTTGCCTACTTCCCGGCAAAACAGGTGAAAGAAACGAGCTGACCACAAAGCGTGCTGGCTTTTGTTCAATTGCCGTTAGCGAGTGTTCACTGCATT 

GTTTTAGGCACCTCACCTTCGAATAGAGTTTTGCCAATTGGCTGCGCAATTTTAATTAAAAATGTACGACGCCGAAGGGGAGAAGGTGAAAGAAAATGGCTCGGCATTGGGT 

ACTAAATCAATAAATCGGTGGGGTATGACATAGACATATTTTATATATTTTCATTATTAAAACCACATGGTTCCTTTTGTTCTAGTCTGCTTTAATTTGATTCAGGAAAATC 

GTTTAATAGACCACTTTTTTAAAATGAAATGAAAATAATCATTTAAGGACTTTGACTGTAGTATGTCATTATATTTATTTAATCAGTAAAGTTCTTTTCCTTTCAGGACTCT 

TTCACAGCTTTCCCAACGATTGCGAGCGTATAAAAAGCGACGAAGAGCTTCGTCAGCTGTTCCTGGACGTGAGAAACGTGCCGGGCATGAGGACGCAGACGAGGATGTGGAA 

ACGCCCACGTCAACGCTGCGCCGCCACAAGCGCCATAACAGTCTGCCCAAGAACGCACTCTATCCGCGACACACCACTGTACAGGAGAGCCTCGATGACGACGACGACGAGG 

AGGAGGAGAGCGCGCAACAGAAGGATCAGCAGACGAAGTCGAAGATCTGCCACTCGGACACTGAGCTAGATCCTCCGCAGAAGGCGAAGCAGTGCCACATAGCCGGGCAGAA 

TCATCAGTCGGACTACCTGCAGCTGCCCTCGGTTTGCACTTGTCCGTATTTTGGGGACAGACCGCTCCAGAATTGTGTGAAATCGGCGGAGGTGAAGATTATTTCGTCCGCC 

TTCCGAGTGACCACCACCACGACGGCGGTGAGCAGTTCGCCCAGCGAGATGGAGCTACTCATGTGCAGTAGTGGCAATAAAAAGTCATTGACCTCCAGTGTGAGTGCCGGCA 

TAACAGGTGGACCGGCAGGAGCGGCTCCAGCCACAGTTGGCTCCACCCTCAGTCCACACTCTGCTCACAATCAGGGCAGCTCGCTGACAGTTCAGAGCGATGGTAGTGGGTA 

CTTGGCTGCTCCGGGAACTCCTTGTCCTGGCCGACGAAAGTTGAGCATCTCAAAGACCGCCTCGGTGGTCACGTGGGATTCAAGTCGTCATCGTCGGCGGGGCAGCAGTTTT 

GGCGGAGTGCGAACATCCCTGCTGTTGACGCCCACCAAAACAGCGACCACCTCCAACTCATCCACACCGTTGAGAAGATCGGCCACCTTGAGGAGTCACCAGAATATGAACT 

ACCAGGGAGCGGGCGAATGTGGTGGCAAGACCAGGACGACAACATCCrCACCCTGCATGTTGCAGCGTCAGCAGACGGTGCGATCTCATCACTCGCGGAACTCCAGCGTGAT 

CTCCAGAAACTCCTCGCGCCACGGCAGGATCATCAGGCTGGAGCAGAAGGCCACCAAGGTGCTGGGAGTGGTGTTCTTTACGTTCGTAATCCTCTGGTCGCCATTCTTCGTC 

CTCAATCTCCTGCCCACGGTGTGCGCGGAGTGCGAGGAACGAATTAGCCACTGGGTCTTCGACGTGGTCACCTGGCTGGGCTACGCTAGCTCCATGGTGAACCCCATCTTCT 

ACACCATCTTCAACAAGGTGTTCCGGCAGGCGTTCAAGAAGGTCCTGCTGTGCCGCTATTCGAGCACGAGTGCCTGGCGACCGAGCAGATAACAGACGCCCGTTAAACCGGG 

CAAACTGAAGACGAATACCCACAAGTGCGCCAGTGTGGATTTTCAGGATTCCACTGCTCCGGTGCGGGGCGTTCTGAAAAGAAGTGTGGCAGTGGCAAGTGACATATAAATA 

CGGCAAAGGCCGGTCATTTCGCCTGCGGTCCACCGCACAACGTGTCGTATACGCAATCTCGAAAGGACGATAATGCCCACCCGCACTCCACACCACGCACTCCGCACTGCTC 

ACACGTGTGTCCTTGGGGCTTAAATTCTTATGTGCCTCTAATTAATTTCAAATTCAATTTGCATGCCATTGAAAACACATGTCCCAGGCGTAAGAAGTGGGGCCAAGTGAAT 

TAGATATTGAAGTGGAGTCGGATCCCCGCAGGATTGCCAAGGGTCGGGGATTAAACTAGCCACAGACAACAGCATAACGAGCATGCTACTCACCCCTGGTGGTTCCTTCCAT 

TTCCCTCAGCTCTCAGCAAACGGTGAAGATGCACTTAATTGCGGAATGTTTGCTTTCGAGCAACGTAATACCGGCAAACACATTTACTTTATAGCAACGCTCTTCAGTTATC 

GACAAAGTTCTGGCTCACCGAAAGTTCATGTCAATCTGTTGATATTTGATAGCATTTTCCGCGAAATTCCTTTTAGATTCACTACGTACTCGAAAATAAACTTGAAAATTCA 

CGTAAATTTAAAGATTGTATATATAAAGTTCAACTTTTGAAATTCTTAAAAATAAAGCGCCAATGTCGCTTCGCAAACTTCGATCGATAAACGCTATTTGGTTTGGCATTTG 

GCAATTATGCATTAAATTTATGGCAAATTCACTGCCCGCCAATGCACAAAGAATTTCATCGGTATCCATTAAAGAAACCGACACGTACACAAATCCACGGCAATTTGTGACT 

TTTGrTGTTTGTTTGTGTGGTTGGACCGGCAGATTGCTATTTTTATTTGAAAAGTTCATTCATTTTGTTTTATTTCTGTGGCCA 

(SEQ ID NO: 97) 

Exon: 1001.. 1264 
Exon: 1550.. 1778 
Exon: 7511.. 8268 
Start ATG: 1001 

Transcript No. : CT24060 

ATGAAGAATCGGTTTTTTTTTTCTTTTACAGGCTACTTTCCTTTGGGCTCGGAGCACTGCCTCACCTGGATCTGCCTGGATGTACTCTTCTGCACGGCCAGCATCATGCACC 

TGTGCACCATCTCCGTGGACAGATATTTATCGCTACGATACCCAATGCGATTTGGCCGGAATAAAACACGTCGGCGGGTCACTCTCAAAATTGTTTTCGTTTGGCTTCTGAG 

CATCGCCATGAGCTTGCCCTTGAGCTTGATGTACTCCAAGAATCACGCCTCGGTGCTGGTGAATGGAACTTGCCAGATACCGGATCCGGTGTACAAGCTGGTCGGCTCCATT 

GTATGCTTCTACATTCCACTGGGCGTGATGCTGCTGACATATTGCCTGACCGTCCGACTTTTGGCCCGGCAGCGCCAGAATCTGGGTGGTGGACAGCAGACGGCGGCGGCCA 

CTCCCGGATGGGCCAGTGGATGGCTCGGCCAGGCACCGGCCTTGGGTGGACCGGCAGGAGCGGCTCCAGCCACAGTTGGCTCCACCCTCAGTCCACACTCTGCTCACAATCA 

GGGCAGCTCGCTGACAGTTCAGAGCGATGGTAGTGGGTACTTGGCTGCTCCGGGAACTCCTTGTCCTGGCCGACGAAAGTTGAGCATCTCAAAGACCGCCTCGGTGGTCACG 

TGGGATTCAAGTCGTCATCGTCGGCGGGGCAGCAGTTTTGGCGGAGTGCGAACATCCCTGCTGTTGACGCCCACCAAAACAGCGACCACCTCCAACTCATCCACACCGTTGA 

GAAGATCGGCCACCTTGAGGAGTCACCAGAATATGAACTACCAGGGAGCGGGCGAATGTGGTGGCAAGACCAGGACGACAACATCCTCACCCTGCATGTTGCAGCGTCAGCA 

GACGGTGCGATCTCATCACTCGCGGAACTCCAGCGTGATCTCCAGAAACTCCTCGCGCCACGGCAGGATCATCAGGCTGGAGCAGAAGGCCACCAAGGTGCTGGGAGTGGTG 

TTCTTTACGTTCGTAATCCTCTGGTCGCCATTCTTCGTCCTCAATCTCCTGCCCACGGTGTGCGCGGAGTGCGAGGAACGAATTAGCCACTGGGTCTTCGACGTGGTCACCT 

GGCTGGGCTACGCTAGCTCCATGGTGAACCCCATCTTCTACACCATCTTCAACAAGGTGTTCCGGCAGGCGTTCAAGAAGGTCCTGCTGTGCCGCTATTCGAGCACGAGTGC 

CTGGCGACCGAGCAGATAA 

(SEQ ID NO: 98) 



MKNRFFFSFTGVFPLGSEHCLTWICLDVLFCTASIMHLCTISVDRYLSLRYPMRFGRNKTRRRVTLKIVFVWLLSIAMSLPLSLMYSKNHASVLVNGTCQIPDPVYKLVGSI 

VCFYIPLGVMLLTYCLTVRLLARQRQNLGGGQQTAAATPGWASGWLGQAPALGGPAGAAPATVGSTLSPHSAHNQGSSLTVQSDGSGYLAAPGTPCPGRRKLSISKTASVVT 

WDSSRHRRRGSSFGGVRTSLLLTPTKTATTSNSSTPLRRSATLRSHQNMNYQGAGECGGKTRTTTSSPCMLQRQQTVRSHHSRNSSVISRNSSRHGRIIRLEQKATKVLGVV 

FFTFVILWSPFFVLNLLPTVCAECEERISHWVFDVVTWLGYASSMVNPIFYTIFNKVFRQAFKKVLLCRYSSTSAWRPSR* 

(SEQ ID NO: 99) 

Name: 5-HT2 receptor-like 2 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384673 

GACCTTTAAGCAGACTAAGGGGTTGAGGCAATCCTTTTTTCCCAACTCGTGTCAATTAACTCATTAAACTTCCCACCGTTTTGTGGAGCGTCGGACCCAATTAGCTTTCAAT 
TATACCGTCTATGGGCTGGTTTTTCATAATCAAGCAAAATTTCAATTAATTAAAATGTAACCGAGCTAAAATGGAGGTGTGAGAAAGAGCCTTTATTACGAAGGACTCTATT 
ACACATTAAAGTTTTATGGAAAGAATGAATATTCTCAGTGAGCAAGACCAGATAAGCTGCCAACATATTTTCTCTATCCTGTCAGCTTCATAAAATGTTGGTTTTGATGATG 
GAAACTTTTGCCCATGAAACTTTTTGTTCGCAGATTATCGTGAAAGAGTTTTGAGCTCCGTTTCGCTGTATTGTTGGCTTGCCAAACTGTTGCATACTTTTAGGAAGTTGCA 
AGGAGCGGAACATGTGTTTGCCATGTCAACTTTTTGCCATATTTGTAGCCATAACAGCTACACATCTCCGAGGAAGGTATTTTAGGAATTTTTTTTGAGTACATCAAAGCCG 
ATAAAGAAAATCCCCCAGACAATGGAAGCTTACCGAAAGAAAACCGCAATGATAATTCCCCAGAGAAAAACGCCGATCGAGTACACTGAAGTAAATAGACCATTTATTTAGA 
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TGCTAGAATTTCTTGTTTATACAGAGAATTAATCCATTTGCACATTGAGTTACATTCAACAGCTACAGTTGGAGTTCATAGCATGGAGTTGGTTGGATTGTTAATGGGGCTC 
TCAGCTGCTCACATATACGACATTTAACAGTTTAAAACGATTGACCTAACTTATGTTGCTTGTTTGCTCCGCTTGGTACTTGGTTTCGTCCTTTCATCGCACTCCAGCCTTT 
AATAATTAGTACTAGCTaACTAATCGCACTCGCAGATCTGCGACACGGAAGTGGAAAGTTCAATGAAAGTTCAGTCTCTTAAGATGCACTTGCTAGGGATTTGTCTACACCG 
AGTTCTCCTCGAGTCCTTGCTGGAATTCGAGAATGATTATTGGATGGTCTGTATGAGTGAGAAGAGCTGAATTCTGGTTCTCACCCTCATCAGGGGACTAATGGCCCGGGGC 
ATTACGCTCAGAGTGCTGCCGGAACTCATGGCATTTGCCGCCAAAGTGGGCATCGTGTGCAGGGCATTGTTGGAGCCCCGGTGGAGTGAAAGTGGGGCATGCTGACCCACCA 
GCGTGGTGGTGGTGGCCGAACTTACGCACGATTCCCGCTTGCCACGCCCATAGTAGCTGGTCAAAGGTCTGCAACGGAGTGGATGTCAAGATATCCCGTCACTTCTTAGCCA 
CGATTGCCACATAACTTACCGCATACTCTCGGCTCGGGGGGAACCACCGCCTTTCGAGAAGCTTTTCGTGGTATAACTGCCAAGTTTACGAAGAAATGCGCAGAAAAAGTTA 
GTGTACAATAGTGAAAATATTTTTAATAGCTGCCATTCCTAGTAGCAATCTAGCATTGCCACACTTTATGATAGGTTTAACATTAAAGGCATTTGAGGAGAAATATGTACAA 
CTTAGTTGTTTCCTATATAAGTTTGCAGTTTTCTCTTCTCGAGTAGTGTATTTATTCTTAAGTGTACTTACAAGCAGACTTAAAAGTTTGGACTACTTACCGTCGGTTCTGG 
TTAAGCTGAATGGTGCGCGTGTCCCGCCACGTGGAAATGTGGTGTCTCAGTGCATTTCGCACCTCCGAGTTGAGGAAACAGTAGAACAGCGACACCGAGAATCCCTGTGGCA 
CACACACAAACAGATCTTTAGAAAAAAAGTGGCTAAGAGGATTGACAAACGAAACCCACCTGAGTGCTGAGTAATACCGCTCGCAGGACGGCAAACATGTGTCCCATTAGGC 
CAGATTCCGAGGGACCAGCCAGCACCACCAGGTAGGTGATGCCAAAAAGTGGGATTAGCACCAGCAGTGCCTTAGCAGCTTTCCTATACTGACGAGTCTCCACTGTATTGGC 
CGAGCGCAGCTTTGTGATTAGAACCTAAAAGAGAAGCAACCCGATAATCTGGCTCAGTATTTTCGGCAATTTACAGTCGGAAAAATGTCGGATGATAAATTGTAAATTCATA 
CAATAGCTTTAAAGTGTTTCAAGAAACTAATTGGGTGATTCATAAATGTGAGTTTTTGCATCCATGTTAACTTGTTCAATCTTCTCTGATGTTCTTTTTTTCGCATATCAAT 
ATTTTTTCTGTGTAAATTCCATACGCACCCACATGATACGCAGCAGAAATGTGAGATTGATTATTAGCACCGCACAGACAGGTCCCTGGTATATCCAATCCACATGGGTCTC 
CTGCATCCAGGGGCAGTTGATTTCGTACTGTCGGATGGGAAACAAAAGGAAGACTACTCTCCAGGGGTCTCTTCAAATCCCAAAATAGACATACCTTTTCGGGGGTGCTGTA 
GGTGACCGTCAGACTCTTGGCCACAGCCCAGGTGACAACAAACAAGGCCGGGCCACCTGCCAGCAGGAGAAAGACTTAATTGGCTGGCACACACAATCGGGCATTCACACTT 
ACCCCAGCCTATGGAGGCGTAGATATTAAAGCGTAAATTGTCCCCGGAGAAAGTCTTAACCACCAGCATGTACAGGTACAAACCCTCGACCAGCATCCAAAAGAAGTTGGTC 
AGGGTGAAGAAGTGGAACAGGGTGATCAGAGCAATGCAGCTGCCCACTCCACTTCGGATGGAGATCTGTAGATTAAGTGGTAGCCATTAGTCTTAAGCTGTCAGCGTCTGTT 
TATAGTTTCCTTTCACTTTGGACTAGCACCCCTTTCAGGCACGAGTATGATTTAATAAGCCAGCAATTGCAATTATCTGGCAGTCTCATAATGCTCACCTGGACGGATAACA 
AGAGTATCCAGAACAAAGCCGACATGATGTACGTGAAGAACAAGTTGGCGTGGATGGTGTTGCGCAGGCAACGCAGCTCCCTGGTGAACAAAGACAAATACAAAGGACGGGG 
GGTAAGTCGGGGGTTAGGCAAAAATAAATGCCATACCCAGGGTATCGGGGTCACCAGGGAGGGGGCTGGGGTGGGTACACTCACTTGAAGTAGGCGAAAACAATCAGCGCCA 
GCGAGAGCGATACCAGGCTGAGGGTGTATCCAATGTAGTAGATAATGGTGGGCAGCTCCACGATGACCTCGAACTCCGGCACGGATTCTGGGGCGGGCAGGTGGGCGCAGGC 
ATCGTAGTTGGTGTACTTCTCCCAAGTTCCATTGGCGTGGCAGAACCTCGTTGCATTCTCTGGTCAAATACAAAAACGCACGCAGCGAAAAATGCAAACAGTTTTTAAATAG 
ACTAGTCCTATACACTTGGATCTATATTGGGAATATGCCTTCTCTGTTTGGACATCGACTTAAAAAATTCCAATTTAAATTCATAACTTAGTGCAAAGCTAACAGTTTTTTT 
CAAACGCTTAGTTTAAATATAAGTCTAATGTTAAAGTATAAGGTCAAAGGGGTACTCACTGCTGCTATCGTAATGAATTCCCTGCAACTCGTCCATGCACTGGAGCACCGCC 
AGGGTACCGCGAGCTGTCCTCGGCCAGCACAGGATCGAGTCGAACTGGGTGAGGCAGTGGCCACTATCGTTGCCATAGGTGGATGCCTCGATGTGCTCCTGCACCAGGCACT 
GCAACTCCACGCTCTCACCGATGCCATCTAGGTTGTGCAGATCCAGGAGCGGATCACTCCCGCTGGCGTTCACCGAATCGATGTGGTTGTGGTCACTCATTCTACAACATCC 
TGGCACTTGGCATCTCTGGTACTGAAATAAGTCAGTTTAAAGTTGAGGCAAAGATTGTAAAATACTTGAAGCAGTGGGAAAATTTAAAATATTTTAGAATCATTATAGAGTT 
TTAATACATTTCATTTAAGAACCGTCGGACAATAACGTTTTAGAAAAATGGTAAGCTTTAATCGGAAATAAAATATTATAGACTGATACGAAAATATTGCTGTCATTTCCTG 
ACCATTTCCCTACGAATTGACAACGTTCATAATACTCCCCAGCTGAATTCGTTCAATCAGAAAGTTATAATTTCCTCAATTGTTAGGGGATTCCCCGGCGGTTGGCTGAAAC 
TAAGCTGTCGGGCAAAGTTCAATTGCCAACCCTTCTGCTCGCTCCACCGCAGCCAGTGGCATTTCGCCTGTGTGCGTTAATTGCATTGGCCATGTTTCACCGACAACGGGCC 
ACGCCCACATCGAACTCGGCTTAAAGATCCATCAACTCGGCGGGAAGTATGCTTTAAGCCAAAGACCATTTCTAGGGAAAACAGGGTGGCTAGAGATAAGTAAATGCACAGG 
CCCAGTGCCATAAAAATTCATTAGTTTTGCCATCAAAGGCGGACAGAAGCGTCAGCAAATCAGAGAAGCGGCAGGCAACCTTTTATTCTGATATGTGCCCATAAAAAGCACG 
TCAAAGGACCCAAGTCTGGGGTGATGGTGATAGAAATTAGAAAAGCTAACTTCCGGCCAAAGATTGTGGCCTTTGTGGCCTGGTTCTGCCTGGTTCTGCCTCAGAGACGAAA 
TCTCGCATTTATGTTCTGATAATAAATGGCATAAATGATGCATCGTTACATTAATGCACGACATAAAACGGGGCGGCAAGTAAGACACACACACAAACAATACATATTAAAA 
ATGTAGATGGGCCTGTTTGTACAGTGAGCTAGAAAAGTAAAAGTGCGCTCTTTAAAATTTAAGCGGAAGTGGCTTTCACTTTCTACTTAACAGTGAAATAAATTAGGCAAAC 
AAGCCAGCTTGAATTCCAAGGAGAAGCCATAAGCCAAGGCAGC-AATTTAAATCCACTCGGAGTGAAAATGGTTATCTCTTAAGTGATGCCCAACCCGAACCAATTACTTTCT 
GTGGAATTTCCGATTTTCTCGGACGGCACATTCCCCAGGCGATTTTCTAATTGGCACAGGAATTTCTAACCACAAGCCATGAAATATTAAACCCGAAGGGAAATTGAAAATT 
CTGCCCCAAGGCTTGGGTGCTCCACTTCTGGCTGCCACTGTATGCTCGGCTTCGTATTTTTCTTTCCATTGTTTCTGCTGTTTGTTTTTCTCGTAAATTAACGCAATGCATG 
TACATAAAACAAAGGCTGGCTGATTATTACTGCACGGCACATGCCTGCACGTCCTTTCCGGAGTTTGCCAAGGTCCTGGGTCCTGGTTCATTAGCCGCTCGCTTACTTATAA 
CACATACGCCCCGTTGCCTTCATTTTCCAGCTTCGCACGCTCACAGCAACGCAGATGAGTTAAGACAGCAATATACGCGCCGCAATTTTAAGTTGAAAAGCATATTATTTGC 
AAAAAAGGGAAAGAACGGCGAAGGACCGCCTCTAATTACCTCATAAATAACGTCGCACGAAATTAGCTGGCCTAAATTATGGCCATAACTGCTAATTGCATATTTGATTTCA 
ATTAGTTCAGCTGACAGGGGCCCAAAAACCCAAGAAGGACACATTTGAAATCAGCGTGCGAATGCAAATACGTTCTGACAGCTCGCTTGGACAAATCTCATTTATGGCCAGC 
CAATTTGACGAATGTCCGCCCGCCGGGGAAACAGGCAAAACAAATTCAGCTTGGCCGAATCATCGCCGGCATAAAAAGGACATTACCAGAAATACATACATATACGGAGGCA 
TAAGAAATGGTTGGTCCTGGACGAAGACCAACGGACAAATCGCACATCGTCCTTTCTGCTCCTCTGCCTCTTTGCAGCATGCTATTATAGAAGATTGTATGTAATCTCGTAC 
GATTGCACACAATGGATTTGCCAAAGGCGACACAAAAAAAAAAAAAAAAAAAAAAATAGGGGATTGCGTGGGGGGAGTTTGCTTGTGGGTCCTTTACTTTTTTTCTCCCCCA 
TGTGAGAGGTTCTGCAATTTCCTGCTGCTATAATGCTTATAAATGTTCACCCGCAATCGCCGGGGATTTCTTTAGGGAATTGAAAACATAATAACAGTGCGAATAGTGGCAA 
GCATTCGCATTTTCCACTTATCCAACTAAATATGCAGACTAAAAATTAAATTCCAGTTTTTTCAAGCCCTCGAAGGGGATTTTTATTTTACGCCCTTATATGTGTACCTATG 
CAAAAATTCCGCAATTCCGCGGGACACCGGGGAAAAGTGCTATTTTCGGTTCTCATTAATGTAGCTCATTAAATTTTATATGCATAAGCTGCCAGCTGCAAACAAAAAATGT 
AAGCATTTAAAATTTATAAGCGTGCCACACAAGAGACCAACAGCACATTACGTATACGCAGCATGGACAAAGCCAGCTTGGAAAGCCAGCACGCAGGCATAAGCGTATAGAA 
AGTTTGTAATTAAATTGCTCATTTAAGCGTGGAAACAATGAATTCGATCGGGAATACAGGGAGTGGGGAGTGTTCTGAGGAAATATGTAATATTCAGCGTGTAATCCGAAGA 
AAATATATTACTAATTTTATTCATCCGCCTTGGGCGCTTAAATTTTATTGTTCGGACTTTTGTGGACTCTCCTCGGAGGCCGATAAGGAAAAGCCAGTCTAGACAAGCCAAC 
ATCATAATTTGCAGGCACGCACTGCCATTTCCATTTCCGCCCAAGCTTGAGCCTGATCTGCTGGCCACTTTCCATGCTTAAGTCAGCACATATCCTTTAGCAGGAAGATGGA 
ATGGGATGGGATGGGGCTGACACTTCAGCGATTGCCAACAAGTGTTGCGTTTTATTGTCTGGCCTTTAACAGCTCTCCTTTGCCATCCGCCCATTGTGTGTGCGGCACGTGC 
AGGCGCCTGCATTTGGAGCTACGTGAGGGATCTCCACTCCTGTGGATAGTCCGAGCATAAAACCATAAAAATCCGCCGCAAAAAAAGAAACGAACCTGGCCACCAAATCTGT 
TATTTATTGGCCCAATAAAAATTTGGTTTAATGCATTACAAGTTGCCGAAATGCCACGGAGATTGCCGGAGATTTACTTGGCTTGCAAAGGGAAGTCCTTAAGGCCATGGCA 
ACAAGTCCTTCCTGGTTAGCAGTTAGTTTATTAGAGTGGTCAAGTGCATGGAGTGGGGGGCATTTCGGATTTCGGGCTGAAATTTTCCCAATCGATACTGCAAACACGCAAA 
ACAATGCCTCTGCTGCCAACTAATATCCATACTCTCGGCTCATTCAGCCACAGAGTTTTCGTTTTAGTAACCGCTGGCATTGTTTTCAGCAGGTGAGCGTGTGTATGCACTG 
GCATTTTAATTAACCACAATTTGTTTCTGCATTCCAAACGCCCTGAAAACGCTTACCTTCTACTCCTAGCCAAGTTTCTATCTGCAGAATGAGATTCCTTTCGCCACTCACA 
CTCACAATTTGCGTTGGAATTCTTGGGATTACTTGCGTTTAGCACTCGGACTACGGACCGCAAAAGCGCCAGGAGGCGGGTCCTTTGTCCTTTCGACCACCTACTTGACCCG 
TCCTAAATGCGTTTAACGCCTTCCAAGCTGCGAAATGATTATGTTAAAGCGGTTTTTGCTGGAGCGGTTATTTTTGCTAGTTGCCTTGGTTATTGACACACGCCCATTTACG 
TGGGCAAAACTTTATTGAGTTCACATTTATATGCGGGATTATTTGTTATAGCGACCCTTGAACTTATTCGAAGCACTCAAAGCCAACGACTAGAAAAGTTATTCGACATTAA 
TGCGCGGCACTTATTTGTTATTTCTTTTGCGTTTGCTAATTAAAATTTTCATTTTTCTCTTCGCACATTATTTAAAAATTAGTTTCCACTTTAACGGCTCTGAGGAGGACAA 
TAGGTTTTCGCAAGTTTTGAAATTCCGAAACGAACCGCACTCGTACACATCCCGCTGATAACTGCGGGGATACCCGAAGGATGTGTCTTAACAAGGCTCGCCTTCTTACCAA 
AAGGCGCCAATTCTGGGTTCCACAAAGTGAAAAGCTCGCAAAGCTGCATTCCACAAGCTGTTGAAAAAGCTTTCCGCCGGGCAACAGGTGGAACATCCTGACGGGAGGTTGG 
GTAAAAGGACCCCAAACAGAAACCGATTCGATTGCACTTCTAATGCCTGAAAATCTGCCTATTTGATTTATGCAAATATGTGCCGGAGTCTCATGTTTGCCCAGCAATTCGC 
TGGCTGTTTGTTTAGGGCATGCGGCAAAAGGGATACCCCGGACTCAACAGGATTCGGCAGGATTCGCCAGGATATTGTTTGTGAAACTCCTTGCAGGCCGTCTTTGAAGTCG 
CAACCTGATTTAATTGTGATTTTCGGAGCCCGTTCAAAACACCTCTCACGTGGGCCTAATCAAATTAAACTGAAGTGGCAGCGCCTCGGGTCTGACAGTTATTGAAATATCA 
ATATGGCTAGGCTCCGGGAAAATGGTTTCCCGGAAAATGTGCGAAATTAAATTGTAGAGAAAACAAGGGCAACATTTTTACTCACTGGCATCTTTTCTCATGCCTGCCAAAT 
TTTATATCACAATGATAAACATTTAGACGGATGTCCTGGCTGCTTGGAATGCGTGCGACACATGAGGGTTGCCTGTGTGGATTTTTGGGCGCAAGGATATGGGCGGGGTGGT 
GGGGGCGATATCTCCAACCAAACTGTTTGATTTTTAAACAAATGCTAATGTCAACTGCCACTCAGAGGCATTTAAATTCCAATTTAAAAATGCTTTCTCTCGATGCCAAAAG 
CAAAGACATGTTCGTTGTCCTGTTTCGAGTCCATCCACTCGTTCTTTTTCTCTCAGATTCTCCGATTCTCTGACCCGTTTCTCCAGTTTTCTGCTCGCATTAAAGCGTTCGT 
CCTGTCTCATGCATATTTAATGTCACACG 
|SEQ ID NO: 100) 
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Exon: 7877.. 7337 
Exon: 3829.-3532 
Exon: 3307.. 3109 
Exon: 2992.-2899 
Exon: 2753.-2579 
Exon: 2520.. 2447 
Exon: 2379.-2269 
Exon: 2040.. 1852 
Exon: 1784.. 1669 
Exon: 1420.. 1364 
Exon: 1300.. 1093 
Exon: 1027.. 1001 

Start ATG: 3796 (Reverse strand: CAT) 
Transcript No. : CT24513 

CGGTTCGTTTCGGAATTTCAAAACTTGCGAAAACCTATTGTCCTCCTCAGAGC^ 

AGCAAACGCAAAAGAAATAACAAATAAGTGCCGCGCATTAATGTCGAATAACTTTTCTAGTCGTTGGCTTTGAGTGCTTCGAATAAGT^ 

CCCGCATATAAATGTGAACTCAATAAAGTTTTGCCCACGTAAATGGGCGTGTGTCAATAACCAAGGCAACTAGCAAAAATAACCGCTCCAGCAAAAACCGCTTTAACATAAT 

CATTTCGCAGCTTGGAAGGCGTTAAACGCATTTAGGACGGGTCAAGTAGGTGGTCGAAAGGACAAAGGACCCGCCTCCTGGCGCTTTTGCGGTCCGTAGTCCGAGTGCTAAA 

CGCAAGTAATCCCAAGAATTCCAACGCAAATTGTGAGTGTGAGTGGCGAAAGGAATCTCATTCTGCAGATAGAAACTTGGCTAGGAGTAGAAGTACCAGAGATGCCAAGTGC 

CAGGATGTTGTAGAATGAGTGACCACAACCACATCGATTCGGTGAACGCCAGCGGGAGTGATCCGCTCCTGGATCTGCACAACCTAGATGGCATCGGTGAGAGCGTGGAGTT 

GCAGTGCCTGGTGCAGGAGCACATCGAGGCATCCACCTATGGCAACGATAGTGGCCACTGCCTCACCCAGTTCGACTCGATCCTGTGCTGGCCGAGGACAGCTCGCGGTACC 

CTGGCGGTGCTCCAGTGCATGGACGAGTTGCAGGGAATTCATTACGATAG<^GCAAGAATGCAACGAGGTTCTGCCACGCCAATGGAACTTGGGAGAAGTACACCAACTACG 

ATGCCTGCGCCCACCTGCCCGCCCCAGAATCCGTGCCGGAGTTCGAGGTCATCGTGGAGCTGCCCACCATTATCTACTACATTGGATACACCCTCAGCCTGGTATCGCTCTC 

GCTGGCGCTGATTGTTTTCGCCTACTTCAAGGAGCTGCGTTGCCTGCGCAACACCATCCACGCCAACTTGTTCTTCACGTACATCATGTCGGCTTTGTTCTGGATACTCTTG 

TTATCCGTCCAGATCTCCATCCGAAGTGGAGTGGGCAGCTGCATTGCTCTGATCACCCTGTTCCACTTCTTCACCCTGACCAACTTCTTTTGGATGCTGGTCGAGGGTTTGT 

ACCTGTACATGCTGGTGGTTAAGACTTTCTCCGGGGACAATTTACGCTTTAATATCTACGCCTCCATAGGCTGGGGTGGCCCGGCCTTGTTTGTTGTCACCTGGGCTGTGGC 

CAAGAGTCTGACGGTCACCTACAGCACCCCCGAAAAGTACGAAATCAACTGCCCCTGGATGCAGGAGACCCATGTGGATTGGATATACCAGGGACCTGTCTGTGCGGTGCTA 

ATAATCAATCTCACATTTCTGCTGCGTATCATGTGGGTTCTAATCACAAAGCTGCGCTCGGCCAATACAGTGGAGACTCGTCAGTATAGGAAAGCTGCTAAGGCACTGCTGG 

TGCTAATCCCACTTTTTGGCATCACCTACCTGGTGGTGCTGGCTGGTCCCTCGGAATCTGGCCTAATGGGACACATGTTTGCCGTCCTGCGAGCGGTATTACTCAGCACTCA 

GGGATTCTCGGTGTCGCTGTTCTACTGTTTCCTCAACTCGGAGGTGCGAAATGCACTGAGACACCACATTTCCACGTGGCGGGACACGCGCACCATTCAGCTTAACCAGAAC 

CGACGTTATACCACGAAAAGCTTCTCGAAAGGCGGTGGTTCCCCCCGAGCCGAGAGTATGCGACCTTTGACCAGCTACTATGGGCGTGGCAAGCGGGAATCGTGCGTAAGTT 

CGGCCACCACCACCACGCTGGTGGGTCAGCATGCCCCACTTTCACTCCACCGGGGCTCCAACAATGCCCTGCACACGATGCCCACTTTGGCGGCAAATGCCATGAGTTCCGG 

CAGCACTCTGAGCGTAATGCCCCGGGCCATTAGTCCCCTGATGAGGCAAGGACTCGAGGAGAACTCGGTGTAG 

(SEQ ID NO: 101) 

Start ATG : 575 (Reverse strand: CAT) 

MSDHNHIDSVNASGSDPLLDLHNLDGIGESVELQCLVQEHIEASTYGNDSGHCLTQFDSILCWPRTARGTLAVLQCMDELQGIHYDSSKNATRFCHANGTWEKYTNYDACAH 

LPAPESVPEFEVIVELPTIIYYIGYTLSLVSLSLALIVFAYFKELRCLRNTIHANLFFTYIMSALFWILLLSVQISIRSGVGSCIALITLFHFFTLTNFFWMLVEGLYLYML 

VVKTFSGDNLRFNIYASIGWGGPALFVVTWAVAKSLTVTYSTPEKYEINCPWMQETHVDWIYQGPVCAVLIINLTFLLRIMWVLITKLRSANTVETRQYRKAAKALLVLIPL 

FGITYLVVLAGPSESGLMGHMFAVLRAVLLSTQGFSVSLFYCFLNSEVRNALRHHISTWRDTRTIQLNQNRRYTTKSFSKGGGSPRAESMRPLTSYYGRGKRESCVSSATTT 

TLVGQHAPLSLHRGSNNALHTMPTLAANAMSSGSTLSVMPRAISPLMRQGLEENSV* 

(SEQ ID NO: 102) 

Name: Diuretic hormone receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013385198 

CATCGATTAACCACTATCCCATTTATTGCCTGCCCGTCTACACAAATCGGCACGTGAACGAGGACTGGTATCGACTTTATAGGACCTACGACACCGAGGGATTTGTTTTCGG 
GCAGTTTTACGAGCGTCTCCAGCGGTACGAGTTGGACTGAATTAATGCCCTTTTTCAGAAAGAATAAGATTTAGACAAAATCTGTGATATCGATACTGTGGTAGCGTTGCAG 
AGATCCCAATCGCGTGTGTGGCGAAGGACGCAGACAGATAACACATTCCTCCGTGCTGAAAATCAATTATAATTAGTCGTGGGAAACAACTAAAAGAATATAAAAATGGCCC 
GTTGGTTTATAAAAAATATAATACTTTAATCTGATCTCTTATAATGTAAACTATCATGCAATACTTAGGGACGAACGGATGTGGTCTACATGTATGTCGTATACGTATATTT 
TTGTGTGGTGTTCATTCATAGTGTCCAGTGGGTATAGGACTTGTTGATGTCAGTCTACAATAAACGATAAACAGAAAAAGGAAAACCCCAATCAGAAACATAAGAAAATAAG 
TAAGAATAAAAAGGATATCTAAAGCCAACCAACCATGCAGAATAAACATAACATTAAGAACTGATGTAGGGAAATGCACGGTTTTAAGTAGATCACTTTAGTTGGCACGAGT 
GTGTTAGTATATTTAGAGAACGGGTGCAATTTAACTTTGGAATTATTTGGATGGGTGGATGTCAAAGAAGTAGAAAGCATATACATGAGCATTGCGTATACGCACCGTAGAC 
TTTCGGTTCGCGATCTCAGCGAGTAGTCTTTTGAGGCGCGATGCCTGAAATGGAAATTAATTGGTTAAATGACAACAACAACAACAACAACATCGACGACAATAGCGAACAA 
CTACAATATCAGCCATAGCCACATCAACTAGGTAGTGCAAATAGTGAAAAGCCAATATGTCGACAGCCAGGGTTATATAAATATAATCCAACAACAATAATGTGCTAGAATG 
CCAAATGAGTGCAGTCGCCCCTCAATGATTGGTTAGCCCACCTGCGATTCTTGATGGAACTGTTCCGATGGATATTCCTGCTCTCCCGCCACCGGGTGAATCCATGCCTCAG 
CGTCTGGCGCACCTCTGAGTTAAGGAAACAGTAGAACAGAGCCACAAAGAAGCCCTAAAGAATGGATTTTCGAAGATCAATGTTTATGGAAAGACTGTCACCTAATCTATAT 
ACCTGCGTGCTTATGAGAAAGGCTCTTATGGCCTCGAAGAGATTACGACTGATACCCTGTTCCGGGCCTGTTAGCACCAACAGATAGGTGATGCCAAAGAGAGGTATCAGCA 
CCAGCAGCGCCTTCGAGGCCTTGTAATACTGCCGCGTTTCCAGGGTATGAGCAGAACGTAATTTAGTGATGAGTACCTGTACGATATGATATAATAGCATGAAGCTAACTCA 
CTTAAGCCAGAAGTTAACCGGGGGACTCACCCACATGATGCGTATGAGGAATACTAGATTAACCAGCAAAGCCAGTGATGCAGGTACCTTGAATATCCAGTCAATGTGAGAT 
TCACGCATCCATGCACAATCAATTTCCAGCTTTTGAAAAATCAGAAGAGGTTAGCAAGCATATCTAGGTAGGTGGCAATTGGTATCATACCCCATTGAAGTGCTCGTTCTCG 
AGATGCGGAGCAAATGCCTTGGCAATGGACCACACCAAAATGCATACGGCTGGACAGCCCCAGCCGATGAGGGCGTAGATAATAAAGCTAATGTTATCACTGGAGAATGTTT 
GCACCACCACCGTGTACAGATAGAGGCCTATAAAAAGCAAATGATAAAACCGAGGAAATTCAATTGATACCGCCTTTGAACCCACCCTCCACAAACATCCAGAAAAAGTTGG 
TTAGGTAAAAGTACTGAAACATGATTACCAACGTTATGCAGCCAGCCTGACTAGACTCTGTGGTTATCTAAGTGAATAATTAATTAGTGGAGTGGGATATATATATTCCAGC 
TTATCTTACCACTTGCAGGAACAGTGTGAGTATCCAGAGGAGTGCGGATGTGATGTAGGTGAGGAACAAATTGGCATGAATGGTGTTTCGCAGGCAACGAAGATCTCTGAAG 
AGTTTTAAGCATGAAAATGATGCAACTCATGTGTGGCATTACAACTCACTTAAAGCTGAGGAATATGATGAGAGCCACCACCAAGGTGGCGAAGCTCAGGAAATAACCGCCG 
GCATATATGATGGCCGGCAGTTCGACGTTGGGTGAGAAGTCGGGCACCACCGGTATGGAGCCCGAGTTCTGGTGACAGCGGTCATAGTCCGAATAGTGATCCCACGTTCCGT 
TTGGAAAGCAAAAGCGGGTGGCATTGTCTGCAAATATATTGATTTTATTTAAGATCTAAACCCAATCTGGAATACCCACCTGTGGTGTCGTAGTGCACGCCCTTGAATTCCT 
CGAAACAGGGTAGTACAGCCAGACTGCCGGCGTTTGTGCGTGGCCAGCACAGGACCGAATCGAACGAGCTGGGACATTGTAAAATCTGTAAGAAATAACAATTGAGAATATA 
CAATGATTGGCAAGGTAAACTTCGTTGAGCATTTACTGCAGTATAATTCGTGTTTCCCCTCTGTAAACGCAACTGACCTAGCCCAAAACCCATTAAATCAGAGTACATAAGA 
TTCATGCCCCGTTTCTGACTCGGTATAATAATGATGCCTTCATTGCGGATCCTGGTCATTTATAGCTCTGACGCATGATTTTCTTTAATCAAATTTAGTTTCCAGACTTTAC 
ACACACACATACGCAGCGTGAGCCAAATATCGATGTAGTCCCAAAAAAAAAATAAAGAAAGAATCACCAGCTACCGCTTTATTTATATATTTGCAATGGCGTGCTAAACAAT 
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TTCGTGCATACGGATACACAAGTTATRTGTACATATATGTATATTAACGTTAAGTATACGACGACCCGGCGATTGTTTACCCAGTTCAGTTCGTAGCGCCAAATTAATCGCG 

ACGCGCTATGGTAGCGGTGTTAGAAACAGAGCTTCATTGATATTTTTTTTATTGACCTTATCAGGTGACGCCGAGAGCGCGGAGCGGAGACTCGTCGAAGAATTCACCAAAT 

TGGCAATTCTTTTATGGCCGGTCTTCCCATTACTGATAAGCCCGAGAACCTTTTTGCGACAGCTAATGCGATCGGACTTAGCCACGCGACAAGTGAGCTGTCAACGAGTAAG 

CCAAACACTGGAGACTGGAAACTCCGGCAATATTTGTTTCTTTGCTCTTCTGATTTTCACTCATTAGCCTCGTCATTTGGACACAGTCATAGTACGAATACATTTTATGACA 

TTTTCGAGGAACAACGCCCTTGATCGTGCATTCGGTATAAACAAAAATGGCTAAATGCAGCCTCAAGTGGATTTCCTTGGGTCTAGTTTTGTTTACAGGCGAGAGGAATACA 

TAAATAAATATGTGCTGCATTTTCCGGAAATAAAATGCCAAAATAATTTAATCCGAAAGCGATCGACAGTTTACTTGCCTGGTTTTTCATCGTATCGTACTTAAAGTGAAAG 

AGATTGACGGGCGGATGGATTGAAGCCAATTATTTCGGAACAGAAGATGGATCAATTAGCGGGTGCGATTAATTGGGAAATTCCTCAATCGGCAAAAGAGATGAATTCATTG 

GCCAAACAAAGGGTAATTAGGGAATATTTATTGGC 

(SEQ ID NO: 103) 

Exon: 2731. .2708 
Exon: 2549. .2432 
Exon: 2379.. 2178 
Exon: 2122. .2025 
Exon: 1971.. 1878 
Exon: 1819.. 1659 
Exon: 1597.. 1487 
Exon: 1420.. 1235 
Exon: 1174.. 1001 

Start ATG: 2731 (Reverse 3trand: CAT) 
Transcript No. : CT24959 

ATGAAGGCATCATTATTATACCGAATTTTACAATGTCCCAGCTCGTTCGATTCGGTCCTGTGCTGGCCACGCACAAACGCCGGCAGTCTGGCTGTACTACCCTGTTTCGAGG 
AATTCAAGGGCGTGCACTACGACACCACAGACAATGCCACCCGCTTTTGCTTTCCAAACGGAACGTGGGATCACTATTCGGACTATGACCGCTGTCACCAGAACTCGGGCTC 
CATACCGGTGGTGCCCGACTTCTCACCCAACGTCGAACTGCCGGCCATCATATATGCCGGCGGTTATTTCCTGAGCTTCGCCACCTTGGTGGTGGCTCTCATCATATTCCTC 
AGCTTTAAAGATCTTCGTTGCCTGCGAAACACCATTCATGCCAATTTGTTCCTCACCTACATCACATCCGCACTCCTCTGGATACTCACACTGTTCCTGCAAGTGATAACCA 
CAGAGTCTAGTCAGGCTGGCTGCATAACGTTGGTAATCATGTTTCAGTACTTTTACCTAACCAACTTTTTCTGGATGTTTGTGGAGGGCCTCTATCTGTACACGCTGGTGGT 
GCAAACATTCTCCAGTGATAACATTAGCTTTATTATCTACGCCCTCATCGGCTGGGGCTGTCCAGCCGTATGCATTTTGGTGTGGTCCATTGCCAAGGCATTTGCTCCGCAT 
CTCGAGAACGAGCACTTCAATGGGCTGGAAATTGATTGTGCATGGATGCGTGAATCTCACATTGACTGGATATTCAAGGTACCTGCATCACTGGCTTTGCTGGTTAATCTAG 
TATTCCTCATACGCATCATGTGGGTACTCATCACTAAATTACGTTCTGCTCATACCCTGGAAACGCGGCAGTATTACAAGGCCTCGAAGGCGCTGCTGGTGCTGATACCTCT 
CTTTGGCATCACCTATCTGTTGGTGCTAACAGGCCCGGAACAGGGTATCAGTCGTAATCTCTTCGAGGCCATAAGAGCCTTTCTCATAAGCACGCAGGGCTTCTTTGTGGCT 
CTGTTCTACTGTTTCCTTAACTCAGAGGTGCGCCAGACGCTGAGGCATGGATTCACCCGGTGGCGGGAGAGCAGGAATATCCATCGGAACAGTTCCATCAAGAATCGCAGGT 
GGGCTAACCAATCATTGAGGGGCGACTGCACTCATTTGGCATTCTAG 
(SEQ ID NO: 104) 

Start ATG: 1 (Reverse strand: CAT) 

MKASLLYRILQCPSSFDSVLCWPRTNAGSLAVLPCFEEFKGVHYDTTDNATRFCFPNGTWDHYSDYDRCHQNSGSIPWPDFSPNVELPAIIYAGGYFLSFATLWALIIFL 
SFKDLRCLRNTIHANLFLTYITSALLWILTLFLQVITTESSQAGCITLVIMFQYFYLTNFFWMFVEGLYLYTLVVQTFSSDNISFIIYALIGWGCPAVCILVWSIAKAFAPH 
LENEHFNGLEIDCAWMRESHIDWI FKVPASLALLVNLVFLIRIMWVLITKLRSAHTLETRQYYKASKALLVLIPLFGITYLLVLTGPEQGISRNLFEAIRAFLISTQGFFVA 
LFYCFLNSEVRQTLRHGFTRWRESRNIHRNSSIKNRRWANQSLRGDCTHLAF* 
(SEQ ID NO: 105) 

Name: Diuretic hormone receptor-lixe 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013385192 

TTTCCGCTGCCATCTGCAGCTGTAGCTGCCTAACAAATGCAATTAAAATCTTTGCGCAAGAAAATTGTTGCAGCAGGCACGCAGAGATACTCGTACTGTATGTGCTGCAATA 
GTTTGGTGGCAGCAACGGGGCGGCAGATAACATATTGCCCACGTCCGGCTGAAATTGTACGGACAAGGCGAACTCAAAAAAAAAAAAAAAAAAAAGGAATCCCTCAAAAGGA 
GAACCAAGAAAAATGGGATTTATCTCTGGCCAAGCTCCAGCCATACAAATCTAATTTAGACCACACGACGGCTTCAGATATTTTATATTCCCCACTCAGTTGCGGTCCATCC 
TGATCTCCTCCAGCTTGCTTGATGAGCCCAGCCGTTGGCCAAATGAAATTATTTATGGGCAGTCCTAGGTCAGTGAGAATTTGGTTTTTTAAGGGTTTGAACTTTGCGGCTC 
AACAATGCCTGTGATTTTTTTTTTTTTTATTATTTTTTTGCATTGCATCCATTAGAGAAAGCACTCCTTTGGCATTCTACTCAGCTACTTGCCACATAAATATCTACGGAGG 
GTAGCTAACTAGTCCTTATCTCAACTTCCCCATTTATCCTCGCCGCAAGTGCACTTTTCCTGCTGCTAATTAAGTTGCAAGGCAGCTTTCTCAGCAGGCGTTGCACGAAGTA 
GATAAAAGGAAAAGCCCCCGATTTGCAGACACTATGCTGGGAAAGATATATATATATATATATTTTGCCAGCATTCGCTAATCGGCTTAGGGCAGCCACCATGCGACCACAA 
CAACAACAACAACAGCAGCAGCAGTGTGCTCCATGCAATATTTAATCACATTAGTTGATGGGGTAACGGCCATGTACGCAACAATTTACAACTTGCATTAATTATACGTCTA 
GTTGGACAATGGCGCCTCTAACTATCCCACTCTAAATGTATCTCTATCTCTATCCTTATTTGCAGGGGACACCCGTTGATAATAAATTATTAATGCATCTGCTGGGAAGGCG 
GCACTGCACGGCTGTCCTTGGCAGCAAATCCAGTTGCAACATCGGTGAGTGACGGGCGGCAATCACAAAGGACTTGCAACAGTTGCCGTTGCTTCTGCTATTGCTGTTGCTG 
TTGCTGTTGCAACAATGGCCAGTGGCAACAATGAAACTGAGCCGCTCTACTGCGGCAGCGGCATGGATAATTTTCATACAAGGTAAGCGACTTTCACCAATGTCACGATCAA 
GGCCAAGGGGCTTTTCCACCAGACATGGCGACAACAATACCATAAGCCAACAGTTGGGGGCGTGGTCGGGGGCGGAGGCGATGGTGGCAACATCATAGCCATTAGCTGCTGG 
CGCAAGGGAACCGTTCAAAAATCGATTATCGCCCCATTTCGGGGGAGCTTCTATTTTGATTTGCCGTACAATTTTCTCGGGCGATTAAACGACGAAGCAGAACGAAAACAAA 
AAACAGATTTGTCAACAGCAAGGTCAACAATTGA'1'GGCTGAAATCAATTTAATTGACCATATCCTACGGGCCCTCCAAGTGGCCAfCTGCTGCACCTATAAAAAAGTGAATC 
CGGTCTGCGATTATTTATATATTCGTTGCATGGCAGGCGGTCGTAAAACCTCGAGATGATGATTAAAAGCGGCCCTAAAAACTTAATGGCGGTTTAGGAAATTCAATTCCTG 
TAATTTAAGCCGAGTCACCATTCTTCGAAGTTCTTACATGTAAGCGATAATAAATAGTTAAGTCAATTGGCCAATAAACCTATTAATATTGTGCATTTACCACGATTAGACT 
TTGATTAAAGTGACAATGCTGATTTCTGTAGAGGAAATCTAGTTCTAGTCTTCCCACAAAGCTATTTAGTTACTCTTGAATAAATATGTTACTTTTCTTTTGCCAAAACCAA 
CAGAATTTTAAATTTAATAATTTGGATTTTTTGCAATAAACTGTACTGATTAATGGGCCACACAAAAATGTCTAGTTTATTATGGAGCTCTTGGTTTCATAAATTAAGAACA 
TAATCCAATCGGCATATAAATCATTGATAGCAATTTATTTTCCGTGATGAAACTGTGCTCCGTGTGAACTGCGAATTACTCATTCTACGGTTGCAAAAAAAGCCACCAACGG 
TCAACATTTAGACCAGGACTTTTAGTTTTAATTAGAGCCAGCCTGGCCAACAGCAGTGTTAATGACCACAAAGTGGCTGGCCACAGGATCAGCATCCCAGAATGCGATGCCG 
CATTTGCTTTAATTAAAGGTAGTAGCTGGAGTTTGAAAGATGACTGTATGGCAATTAGATGTGTAGCCAGAACACTTGGCCATTTACTTTTGTGTCAAAGTCGTGCCAAATT 
GCCAGCGGAGGCGACACTTGACGCTGTCACGCCCCAGACAGACGCAGACCGGCCCAAAAGCACCCACTCAGCCGTCTCCAGGCGCCACTCAAGCGGCAAAGGAACGCCAAAA 
CACTAGGACACAGAACGCCAGAAGACTCGAAAAAAAAGTATAAAATTCAAACGCGTATACAATTCAATGTGTCACATCGAAGAAGAACCCTTTCTCTGGCGGCAACGTGACG 
TATACGCAATATTAATGGCAAAGCGAAAAGGGCCTCAAAAGCTAACGAAAAACCGCAGCATAACACGCACAAAACAAGCAATTAACATAAATAATAAAGAACGCACAAAGGC 
GTTCGGCTCTCCGTTGCCAAAGAGAAAAGCCAACGAAAGGAATAAAAAACGCCAAAAAGCCCGAAAAAAAAACCGAAGACCCAAAAAATGTGTAAAATTGATTTCGGAAAAT 
ATTTAATTTACCAAATGCATGCTCGCTTTTTGCGGCCGTGTACGCCCCGAACAAATAAAACAAATAGGCACACATCGTCTGGCATTTCATGTTAAAGGCTTCTCATGCTGAT 
GGACAGGATACAGAAAATATATATACGAAGTGAGGTGTGTGTCTACGTGCCACAAAAACAGCGAAATTGAAACGCTTTCCGAATGCCGGCGACACTTTTTGGCGGCACAACC 
CTCGTCTTGTCGACCAACTCCTGGGGAATTTACAACTCCATTCATCAAACTCACAAAAACTTTGCCTGGTGTACGGTCTTGGGTTCGTCGTCGCAGGATTTCACCTGGCTAA 
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AAATCCAGACTCCCAGCCATCCCTGAGTCTAATTTATGGCCTTTGATAATCCATGTTGATGAATGGCGGCAACGACAACAACGTCGACGACATGAATGAAGTCCTGGAATTG 

TTTTGCACCAGGATGGCATCGGGGCTCCAGGTGGGACGTACTGGCTCAAAGTTATTGGCCCAGAAATCAGGCATAGTTAGCTGCCGAAATGAAACCCAAATACCGAGAAAAC 

TAGGCAAAACAAACAGTAGTACACCGGAAATGCATATCATTGTAAAAACTACATCAGTTTACCTAAAAGGCTTGGCTTTTAAGCTTTCACATTTATAAAATATTGAAAATGC 

ATATAAAAGTATGAAATTAATTCCCTTTTGTCAATAAACTTTCTTTCTTTCTTTCTGTGTAATATGGGGGATACCGGTTTTTTTTTTTTTTCAATGAAATCCCTTCGAAAGG 

TATAAGTTCAGAATCGAGAGTTTTATGCCAAGTTGGGCACAGTTTTTTTTTTCCCCAGCTACCTAAAATAATAGAGACATTTTCCTCCCACTACAACTGATTGCATTGCCGG 

TGCAGAAAGTrTTTTCAGTTGGTTCGGAAAAATTTGGTTCGCAAACAAATTAATATGAACTGGCAAGCATTTTTCGGGCAAAAAGCTCTCATCTATGTAGATTGGAATGGAA 

ATTCCGGCTAGAATTGCATAAGACCACCTGCAGTGTGGGCTAACATGACTAAAAAGTTGTCCACAAATTTGGCTTAGATTCTCCAATAAAACTGTCGTTCGGCCAGGAATCC 

CCTTTTTTGTTTCGAGTGAATGGGGAATTTCGCACGACAGACAGCAATAAAGAATTTAACTAAAGTCCTGACACCGACAGCACCAGCAGGACGCACACGTGTCACTCCATTT 

GGAGAGCTTGGAGTATATTAAACATTTTTTCCCCACCAGTCAGCCGCAGGACTTGCATCGGTCTCGCCTCGCATrTTCCTATATAAATTTTATGCTAAGTCTAATTTGTTGG 

CTGCAACTTGCACAAAGGCAAAAAATAAACAAGGGCGAAATGCCGAAAGCCAAAACCCAACCGAAACCGTTGAGGGCTGCCTCGCTTTTTTCCTGTGCCGAATTCCCTAAAA 

CTTTGCACATAAATTTGAGTCCTGCGCCTGGGCTTTTCCTCTTCCACCTTTTTTTTTAGCCTCAAAGCGCTCGACGAAAACTAAACAAAAGCTGGTAAAATGTTTATCCCAG 

AAGGGGGCGGrGTGGGCGTGGCTTGCTTAAGCCCTAGACACCCGAGAGTTTATTGCCTCAGTGTTGCAAACGCTGCTGCTTGGCAAGTCAACGGAAATGGCTTTCGGATGGT 

TTTATTGCCTACTTTTGCTGCGAGAAAGCGGACTAGACATGGAAGTAACAAATTGGTTAAGGCTAACTAGAGCATAATAACAGAAACATCAGAAACATAATTTGGGAAATGC 

CTGCCACTCACCGACAACATTTCGATTTGTGTCTTTATTACACTTCTCTGCCGTGATGGGTTATAACTTTTGGATCCTTGCAGAGAGTTGGGAAATTGCTTAAAAATAAAAT 

TTGTTTAATTTAATTTGAGGTGGTTTTTCAAACAATTTTAATGGATTAATAAGTTTAGGAAGTTAAGTATATACATCTATCTATTATACCATCTTCAAATATTGTTGAATTG 

AAGAGATAACrCGGAGCTAATCCCATATTTTTCATTTTTTTCAGCTACAAGAACATGCATGGCTATGTTTCGCTGGTGGTCTGCATCCTGGGCACCATCGCGAATACCTTGA 

ATATCATTGIGCTAACCCGACGGGAGATGCGCTCCCCCACGAATGCCATACTCACGGGTCrGGCCGTGGCCGACCTGGCAGTTATGCTGGAGTATATACCCTACACCATACA 

CGACTACATCCTGACGGACAGTTTGCCGCGGGAGGAGAAGCTCAGCTACAGCTGGGCCTGCTTCATCAAGTTCCATTCGATTTTCGCCCAGGTTCTGCACACCATTTCCATT 

TGGCTGACGGTGACCCTGGCTGTTTGGCGTTATATAGCGGTGGGTTATCCGCAAAAGAATCGCGTATGGTGCGGTATGAGAACCACCATAATAACGATAACCACCGCTTATG 

TGGTGTGTGTTCTGGTGGTGTCGCCGTCGCTCTATTTGATCACGGCTATAACCGAATATGTCGATCAGTTGGATATGAATGGCAAAGTGATAAACTCCATTCCCATGACCCA 

GTACGrAATCGATTATCGTAATGAGTTACTGAGTGCCAGGACGGCTGCCCTGAATGCCACGCCCACCAGTGCACCACTGAACGAAACTGTGTGGTTAAATGCGAGCACCTTG 

CTGACATCGACAACCACCGCTGCACCACCCACGCCATCGCCAGTGGTGCGAAATGTTACTGTCTATAGGCTATACCACAGCGATTTGGCGTTGCACAATGCCTCGCTGCAAA 

ATGCCACATTTCTCATATACAGTGTAGTGATTAAGCTGATACCATGCATAGCACTCACCATTCTGTCGGTTCGATTGATCCTGGCCTTACTGGAGGCCAAGCGGCGGCGGAA 

GAAGCTCACCAGCAAGCCCGCCACTCCGGGTGCCAGTAATGGAACCAAATCACCGGCCAATGGTAAAGCAGCGGATAGGCCCCGGAAAAATAGCAAAACTCTGGAAAAGGAA 

AAGCAGACGGATCGCACCACGAGAATGCrGCTGGCGGTGCTACTCCTCTTTCTCATCACTGAATTTCCACAAGGGATTATGGGTCTGCTGAATGCTGTGCTCGGAGATGTCT 

TCTATCTGCAGTGCTACCTAAGACTGAGTAAGTAAATATGTACTATGCATAACGCCACAGTAAATTCCACAGATTCCCATTTCACTTTAAGATGTAATAACCCAATGCCTTT 

TGCCTTTTGCCACCAGGTGACCTGATGGATATCTTGGCCCTGATCAACTCCAGCATCAACTTCATTTTGTACTGCTCCATGAGCAAGCAATTCCGCACCACGTTCACGCTGC 

TCTTTCGTCCAAAATTCCTGGACAAGTGGCTGCCGGTGGCGCAGGACGAAATGGCAGCTGCTCGAGCTGAACGCTCTGCGGTGGCACCGGTCCTGGAAAAGGGACGACAGCA 

GCCGCAGGTGGTGATGGCCAGCACGACCACCAACATCACGCAGGTGACAAATCTGTAGCACAGGAGGAGTCGTGGTCGCCGAACTTTGCTCAGTCGCCTGCTGAGTGTCCTG 

AAACGCGGCAGAAGGCGCTCCTCCGGCGAAGGAGGAGGCGTGGGAGGAGGTGGTGCGCCGTTGGCCGGCAACGATGCGGTGGAACCAGCGTTCCAGGCCATCGTGGTGGTGG 

TGGACAAGGTGAGCGGTGCCACGGAGAATCAGCTGTACACCGCCGAGCAAGCTCGTATTGTGACGTAGTAACCCTGATAGTAGTACATATTATATGCCTAGTTATCCTTGTA 

AAAAGTAAACTAGTTGTGTAAGAATCGCAGATAACCACAAGTTATGTGCATACTTTGGTGGGAAATCGTAAAACCCGTACAGTCATAATAATAAAATGCTGGTTCATCACTC 

ATACGCACCGTTTGACTTCACTCTATTCCACATTCACAGTCAAGTAGGAGCTGTAAAAATGCTGTGCATAAACGAGCTCGTATACGCTACAGCTGGCGTCACAATAATAGTT 

ACCATTGGGTTGGCTTCACAACATAGATCAAAGTCAAAAAATATATAGTTTTGTAAAATGTTTCATAAATCTATCTTCATATTTAACTAACACAATTCAGAAACTTTCGCTT 

GCATGGAAAATGTTTTATTTTCCCCACTTGACCCAATTGTGCGCAACTGTTCAGCTTTTCGGCATGAATCAAATCAAATATAGTAAACTTTTGCATCTTTTCCACTTTGATC 

CCATGCGTTTTCATGTTGATTTCATTCGCATAAAGTATGCAGCACACGTACTCACTCAAAACTACGAATTTTCGGCAACGATTTGAGCAGAATATTTGCCATTTGTTGTACA 

GATTATATATACACACACACACACTTATGTCCGTATCTTTAATAATGGCAATTAAGTCCTAGAAATTTTGACCTGATATTGATGCAGGCATTGGAAAAATAAGTGGCAGTTG 

CAGTGCGGTGCGTTGATAAATGAAATACTTTGCGAATTCTATTATTGGAAATCGCAATTTGGGCTGTGAATGAAATTATAGTTTTCTTGAACGCAATTGAGAATTCAGTTCC 

GTTTCGTGGGTTACTGAATTTATGGGAAATAATTTAAGGCAGTCACTGCAGCTGACCGAATAATCCAATGAAACATTGCATAAACAGTTTGCAGTTTTTCCACACTTGCGAA 

AATTGTAATTGCAAATCGCTTAAGCCCCACGACGAATGACAAATGGCCAGCGAACGCATA 

(SEQ ID NO: 106) 

Exon: 1001.. 1202 
Exon: 4861.. 5963 
Exon: 6065.. 6282 
Exon: 6331.. 6564 
Start ATG: 1135 

Transcript No. : CT25824 

GGAAGGCGGCACTGCACGGCTGTCCTTGGCAGCAAATCCAGTTGCAACATCGGTGAGTGACGGGCGGCAATCACAAAGGACTTGCAACAGTTGCCGTTGCTTCTGCTATTGC 

TGTTGCTGTTGCTGTTGCAACAATGGCCAGTGGCAACAATGAAACTGAGCCGCTCTACTGCGGCAGCGGCATGGATAATTTTCATACAAGCTACAAGAACATGCATGGCTAT 

GTTTCGCTGGTGGTCTGCATCCTGGGCACCATCGCGAATACCTTGAATATCATTGTGCTAACCCGACGGGAGATGCGCTCCCCCACGAATGCCATACTCACGGGTCTGGCCG 

TGGCCGACCTGGCAGTTATGCTGGAGTATATACCCTACACCATACACGACTACATCCTGACGGACAGTTTGCCGCGGGAGGAGAAGCTCAGCTACAGCTGGGCCTGCTTCAT 

CAAGTTCCATTCGATTTTCGCCCAGGTTCTGCACACCATTTCCATTTGGCTGACGGTGACCCTGGCTGTTTGGCGTTATATAGCGGTGGGTTATCCGCAAAAGAATCGCGTA 

TGGTGCGGTATGAGAACCACCATAATAACGATAACCACCGCTTATGTGGTGTGTGTTCTGGTGGTGTCGCCGTCGCTCTATTTGATCACGGCTATAACCGAATATGTCGATC 

AGTTGGATATGAATGGCAAAGTGATAAACTCCATTCCCATGACCCAGTACGTAATCGATTATCGTAATGAGTTACTGAGTGCCAGGACGGCTGCCCTGAATGCCACGCCCAC 

CAGTGCACCACTGAACGAAACTGTGTGGTTAAATGCGAGCACCTTGCTGACATCGACAACCACCGCTGCACCACCCACGCCATCGCCAGTGGTGCGAAATGTTACTGTCTAT 

AGGCTATACCACAGCGATTTGGCGTTGCACAATGCCTCGCTGCAAAATGCCACATTTCTCATATACAGTGTAGTGATTAAGCTGATACCATGCATAGCACTCACCATTCTGT 

CGGTTCGATTGATCCTGGCCTTACTGGAGGCCAAGCGGCGGCGGAAGAAGCTCACCAGCAAGCCCGCCACTCCGGGTGCCAGTAATGGAACCAAATCACCGGCCAATGGTAA 

AGCAGCGGATAGGCCCCGGAAAAATAGCAAAACTCTGGAAAAGGAAAAGCAGACGGATCGCACCACGAGAATGCTGCTGGCGGTGCTACTCCTCTTTCTCATCACTGAATTT 

CCACAAGGGATTATGGGTCTGCTGAATGCTGTGCTCGGAGATGTCTTCTATCTGCAGTGCTACCTAAGACTGAGTGACCTGATGGATATCTTGGCCCTGATCAACTCCAGCA 

TCAACTTCATTTTGTACTGCTCCATGAGCAAGCAATTCCGCACCACGTTCACGCTGCTCTTTCGTCCAAAATTCCTGGACAAGTGGCTGCCGGTGGCGCAGGACGAAATGGC 

AGCTGCTCGAGCTGAACGCTCTGCGGTGGCACCGGTCCTGGAAAAGGGACGACAGCAGCCGCAGGTGCACAGGAGGAGTCGTGGTCGCCGAACTTTGCTCAGTCGCCTGCTG 

AGTGTCCTGAAACGCGGCAGAAGGCGCTCCTCCGGCGAAGGAGGAGGCGTGGGAGGAGGTGGTGCGCCGTTGGCCGGCAACGATGCGGTGGAACCAGCGTTCCAGGCCATCG 

TGGTGGTGGTGGACAAGGTGAGCGGTGCCACGGAGAATCAGCTGTACACCGCCGAGCAAGCTCGTATTGTGACGTAG 

(SEQ ID NO: 107) 



MASGNNETEPLYCGSGMDNFHTSYKNMHGYVSLWCILGTIANTLNITVLTRREMRSPTNAILTGLAVADLAVMLEYIPYTIHDYILTDSLPREEKLSYSWACFIKFHSIFA 

QVLHTISIWLTVTLAVWRYIAVGYPQKNRVWCGMRTTIITITTAYWCVLVVSPSLYLITAITEYVDQLDMNGKVINSIPMTQYVIDYRNELLSARTAALNATPTSAPLNET 

VWLNASTLLTSTTTAAPPTPSPVVRNVTVYRLYHSDLALHNASLQNATFLIYSVVIKLIPCIALTILSVRLILALLEAKRRRKKLTSKPATPGASNGTKSPANGKAADRPRK 

NSKTLEKEKQTDRTTRMLLAVLLLFLITEFPQGIMGLLNAVLGDVFYLQCYLRLSDLMDILALINSSINFILYCSMSKQFRTTFTLLFRPKFLDKWLPVAQDEMAAARAERS 

AVAPVLEKGRQQPQVHRRSRGRRTLLSRLLSVLKRGRRRSSGEGGGVGGGGAPLAGNDAVEPAFQAIVVVVDKVSGATENQLYTAEQARIVT* 

(SEQ ID NO: 108) 



Start ATG : 135 
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Name: G-protein coupled receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384565 

TAACCTTCTTTTGACCTTCGTAAGTCTTCCACTGGCRCTTGAAAAGCTTTGGGAAAACAGCTGTGAATTTGCCCAGCAGAGCTTCCGAflACGACGGCTTTTRGTACCTCCGC 
CGTAACACCACCCACAGTGGGTCGGTTCTGGTCCATGGTGGGGATGGTGTGGTGGTTCCGATGATACGGGAATCGCTGGAGAAGCGTTACTGATGCAATGCACTGGATACCG 
TTCCTGTTCAGATCATTGAGTACTCCAGGCGGCATTTCTATTGTGGTTACAATAGTAAGTAAGGAGTAGTAAAAGTTATTCTGAAATTATATAAACGGCTTGTTGGGGTTTT 
AAAAATGTTGCCGGTTTTAGATTTAGCAGTGACTCCAACCATCTGATGGGGAACTTTGAGCAAGTGAGCTCCAATAGTTTTGGCGTCGCTTCCAAAAAGCTCTGGAAAAGCT 
TTTATCAACAGGCTTGCCAAAATCCGCTATACTAGCATATGTAAAAGAAAAACTTGTATACCCTACCAATAGGCATACATCGCATCAGGACTTATTTAACCAACAAAAATAA 
AACACAACATGAATGTACATTCTGGTTTAATCGCTTTATTTAACTTCAACTTTGAAATTCATGTCGCATGGCTTAACTTGGTTTTCACAGACGTTCAGGTATACATAAGCTG 
ACATATACACAGTAAATACTCTAACAACATTAAACATATGTATGTACATAATTAACTAGTCCGTAGAAACGTATTATGACCATACACTCAAAATAAATGCGCATAGCTCCTA 
ATCTCTATATTTCACTACCGAACTAATTGATTTAGATCCGATATAGAATTTGACAACGTTTCCGAAAGAGAGCCTAAGAGAGCTGGAAAGTACTTAACGTAAAAGCTTTGAC 
TTGCGAGTATTTCCTCAGGGGTCACCCCTTACCCCTTTCGCATCCTTGTGATCCTCTTCCAGTGGACTTTCCTGCTGATCCCGCTTCTTTTCCGACCGCGAAGGTCAACTCT 
CCGCGCTGGTCGCCAGAAGGGACGGCGAGGAGGTGGTCAGACCCACTGGAACCGATTCCTCCTGAGCCTGGCCCATGGGCGGTGGTGATGGAGTGGGCGTGGAGGCCGTGGA 
CGACGCCGTCGATGAGGTGGGACGGGCGGCGTGAGGCTGGAGGAAGTGAAACAAGTTGGAATGCTGCATGAAACTGCCCACCGATCTCAGCGGACTAAAAAAGTTCTCTGTG 
GGATGGTTGGCTGGCTGCGGATGCTGTGGAAGTACAGAGGACTCCTGATCCCTGCAGAGATTACTGCTGGGCTCGTCGTGGACCCCTCGTCCCTTAGCGATGGAGCTACAGT 
TGCAGGATCGGCTCTTCrTCAGCGAACTCCGAGTGAATGCCGTGCCCTTTCGATACTTAGGCTTGTGCCCATTTGAGCCGGGACTCGCCTCACTATCGATCACAAAGATACC 
GGATTGCTCCTGGGGAGCAGGACTCGCCGTGGATCCTGGCAGGCAGTGAGCGCTATTGGAGTTGCCAGCAAGCAGTTTGGCCCTCAGAAGAGCCGGAAGTGGGCCATTTGTG 
GGAGATGTTTGAGTGGAGAGCGGTCTCTCCCTGGACTGTTCATCTATGGGAAGATCCTGATTGTTGCCCATTGGCCTGGAGTCCGTGTCATCCGATCGCAATTTCTCCCTCA 
TGACCAAAGCCAGGGAGCGACGCAGTTCCACGGGATCTATGCCAGGCTGAACGGGATTTGGAACAGAGCTGGCCGGCACGAATACGTCCTCAAACTCCAGATCATCATCATC 
TTCGTCCTCATCCTGGCTGAAACTGTAGTTGTGGTTGTCATAGGAGGCGGACGGAACTGAGGGCTTCGCAGGAGCAGGAGGAGCTGGTGGGACTGTGGAACTGCTATTTCGT 
TCACTGCTGCGCTGGGAACCCGGAGCGGGGGAAGGGAACTCGTAGCTGGTTTCACCCAGCTCGGCGCCACTGGAGGCCTCCACTATGTTCCACATTTCTCGCTCGAGATCTG 
TGGTCACCGAAGCCACGGCGGCCACCGTTTGGTTGATGTTGTTGAAACAGGGCGACACCACTCCGCACTTGGGCAGGAGCTGGGAGTTCTTCAGGGCCTCCTGCTGCTGCCT 
CAACCAAATGGGTGAATCTGCAGGGAATGTTACGAGAGTTATAGGGATTTTGCCACAGTTCCAAGGAGACAACGCGCACTCACACTCACCCACATAGAGGCGCGGCTGCGTG 
GAGCGCATATTGGAGTCCATGGACTGCAGGCGATGCTGACTGGCCACCGAGAAGCGGTGGATTGCCTCCGCCTGCTGATCAATGTCCACGCCCATCATCTTCAGGAGTAAGT 
TCTTCAGCGCGGCCCTGAAGTCCTTGAGGTGATAGGCATACAGGACCGGATTTACCGCCGAGTTCAGATGGGACAGAATGATGCAGAAGAGCGTCAGCTTGGGATGCACATA 
GCAGTCGGGACAGAAGGCCTTGATGCAGTTGATCGTATAGAGCGGTATCCAGCAGATCATAAAGAACAGCACGATAATGGACAGATTCTGGGTGGCCTTGACGTCCGGCTTC 
CTTGCGGCGCCCAAGACCCGCAGCATGGTTCCCGTGTGTCCACCTCTTCCGGGTGTCGTCACCTGCACCACGGCCGCCGAGGAGCGACGACTGAGATCGGAGGCGGGGTTCA 
TCGTAACAATCTGACGGACCTTTAAAGTTCAATTGAAAACGGATAGTTAATATTCCAAACTCTATGGAATATATAAAAAACTTTACTGGCGAAAATCTATCTACCACTCACC 
TGTTTGATGATGACCCGGTAGATGTGCGTGTAGAAGGCCAGCATCAGCAGAGCCGGAGTGATAATGGTGGCAAAATAGAGGAAGACGAGGTAGTTGTAGTCCATCACCTCCA 
CGAAGAGACACTCCTGGTTGTGATTGACATCGGCGTGCCAGCCGAAAAGCGGCAGGAAGCCCACTATCGTCCCGGCCACCCAGCACATGGAGATGATGACTGCAACGGAAGA 
GGAACATTGGTAATCAGAGTGCCCTGAACTGAAGCCCTTGAAGTGGAGCTCCCCCAGACGTCATTTGAGCAGATTGCAGGAATCCCTGCCGTCTTCGCCACCAAGCACCATA 
AAAACTTTTATCGAATTGCGGGAAACTAAACAAAACTGGCAGGCAGGGTGAGTGCACTTCCCAAATGACGCAAATAATTACCAATTTTCCGGACGTAGCATATTGGAAAGTA 
ATCTTGCCTTGAGCGGCCATTTATAAAGCACTATATAGTAGGTGATATATAGTGCTTAGAAAATTATATATTTATGAGATATTTTTAGCTAGAGGCCAATTATTCTGATTCC 
AAACAGTTGCTATAAACATCTAATATGAAAATTCTTAGAATCAATTTGAATCTAGTTAGGCTAGTAAGACTATTATGTTACGGATTTTAAGGATGTTTATTACTGTGTACTG 
TTCTCATTGCGTCCATTCGTGGTAGTTCAAATGAGGCGTCTTTTGTTTAACTCAAGTCGCAACTCAAACATGGTCGATGGCCGGAATGAAACACAATGTGCTGTATATTAGC 
CATCCAGAGTCATTAGTATCACGGCAACAAAGGCACAATACTAAACAATGGACAACTTAAAGTCGAAATAGTTTGACAAATATCCTGACAAGTAGGTGAAACCAGAAAACTG 
GTCCTTTAAAGTGGGTTTTAGCATACCATTTTGTACCTACTGATTTTATACAATACATTGTATATACCCCCATATATACTTACATATCGCCGTGCGGGTGCGGACATTCCTT 
GAGTAGGCCATCGGATATAGGATGGCCCAGTATCGATCCACGGACACGGCCACCAGACAGAAGATGGAGATGGTACACAGCACCACGAGCAGGGAGACTGTGAAGAGGCAGG 
CATGAAGGTTTCTGGGTAGTCCCATGGAGGCCAGGATGGCGAAGGGAATACCCAATGCGCCCACTAGCAAATCGGCCATGGCCAGGGATACTATGTAGTAGTTGGTGCGGCG 
TCGCAACTTTCTTTCCCGGCGGAAGACGATGATCACCAGCACGTTGCCGATGATGCTGACGATGGCCACCAGGACCTCGAAGACGGTGTAGGGTATGTTCAGCTCGGAACTG 
GGGCTGTCCGAGTCCTTGGCATCCTTGCTCGTGGTGGCCGCGTGCAGCGGTAGCAGAGGTCCCTCGAAGGAGAAATCGGTGATCGAGAAGTAGCGAAACGCGGACATGGTGG 
GCGGTGTTGCGGGTATTTTACGGGTTCCACGGGGTTCCCTGCCCCGATCGAGTATTTAAGGCTAGTGCCGAGCTAATGAAAGGGCGGAACAGGTGTGGGCTTCAATTACAGT 
GCCCGCTGGCCTGTGAACTCCCTAATTGACTTGAGGTGGCCGTGGCAGTAGCAGCGATAGTAATAGCACATGCTGCGCATTGCGTATACGCCATGCCCGCCGCGCTGCGGTT 
GCTCCTTTCCCGCTTTCTCGCTTTTCCGATTCGCTTTTTTCGCTCTCGCTCTTTATTTTCCTTTCCGTTGATGTATTTGTTTTTCTGCTGTCGACGTACCCTATTAAGCTGG 
AAAAAAAAGGGAAAACTTAATCATCGTTCGGCACGTTTGTTACACACATGCAGCAACTTAATTGTAGCAATTTTTTAGTGTAATAAATTGATAAATATTTGCCAAAAAATAC 
ATGTCCATATTGATAACGTGTGCGGCGAGCAAAACAACAAAGGCCGTATCACAACTATGCAATACCCTAGATAATCGGCCAGTACTTGAGTATATTATTTGAAAGGGGCCAT 
CAGTTGCAGGTCTTCCTTAAAGCCAGCAGCTCAAGGATCACAGTGAAATATGTTGGAACTTCGATGGCTTTTACGATGATGGTGTTTCTGGTTTGGCTTATGGGGTATATAT 
ACCCGCAACTCTGCGAGTACACACTGCCACACTATCTGATTCGCGTGCGGCAAATACAAAGCTCGAAAAGCAAACAACTCCCGACTAGCAACAAAATCAATTGCCAGACAAT 
CGAACCGGGCGCTGCAGAACATAATAAAAGTGTATAAAAAGGACGGAACAGCGAGAACAACGCAGAAACAGACGTCCCATAAATAAAAGAGCATCCTACACAGAGAAAAAAG 
AATTAGGCAGGGGTTTACGAAAATAATTCATATTCAATTCAAAAGTATTCGCTTTTTTTGTAATTATTAAGATAGATGATGTATTGCTCCTAAAAACTC 
(SEO ID NO: 109) 

Exon: 4251.. 3780 
Exon: 3011.. 2800 
Exon: 2707.. 2212 
Exon: 2145.. 1001 

Start ATG: 4251 (Reverse strand: CAT) 
Transcript No. : CT27563 

ATGTCCGCGTTTCGCTACTTCTCGATCACCGATTTCTCCTTCGAGGGACCTCTGCTACCGCTGCACGCGGCCACCACGAGCAAGGATGCCAAGGACTCGGACAGCCCCAGTT 
CCGAGCTGAACATACCCTACACCGTCTTCGAGGTCCTGGTGGCCATCGTCAGCATCATCGGCAACGTGCTGGTGATCATCGTCTTCCGCCGGGAAAGAAAGTTGCGACGCCG 
CACCAACTACTACATAGTATCCCTGGCCATGGCCGATTTGCTAGTGGGCGCATTGGGTATTCCCTTCGCCATCCTGGCCTCCATGGGACTACCCAGAAACCTTCATGCCTGC 
CTCTTCACAGTCTCCCTGCTCGTGGTGCTGTGTACCATCTCCATCTTCTGTCTGGTGGCCGTGTCCGTGGATCGATACTGGGCCATCCTATATCCGATGGCCTACTCAAGGA 
ATGTCCGCACCCGCACGGCGATATTCATCATCTCCATGTGCTGGGTGGCCGGGACGATAGTGGGCTTCCTGCCGCTTTTCGGCTGGCACGCCGATGTCAATCACAACCAGGA 
GTGTCTCTTCGTGGAGGTGATGGACTACAACTACCTCGTCTTCCTCTATTTTGCCACCATTATCACTCCGGCTCTGCTGATGCTGGCCTTCTACACGCACATCTACCGGGTC 
ATCATCAAACAGGTCCGTCAGATTGTTACGATGAACCCCGCCTCCGATCTCAGTCGTCGCTCCTCGGCGGCCGTGGTGCAGGTGACGACACCCGGAAGAGGTGGACACACGG 
GAACCATGCTGCGGGTCTTGGGCGCCGCAAGGAAGCGGGACGTCAAGGCCACCCAGAATCTGTCCATTATCGTGCTGTTCTTTATGATCTGCTGGATACCGCTCTATACGAT 
CAACTGCATCAAGGCCTTCTGTCCCGACTGCTATGTGCATCC.CAAGCTGACGCTCTTCTGCATCATTCTGTCCCATCTGAACTCGGCGGTAAATCCGGTCCTGTATGCCTAT 
CACCTCAAGGACTTCAGGGCCGCGCTGAAGAACTTACTCCTGAAGATGATGGGCGTGGACATTGATCAGCAGGCGGAGGCAATCCACCGCTTCTCGGTGGCCAGTCAGCATC 
GCCTGCAGTCCATGGACTCCAATATGCGCTCCACGCAGCCGCGCCTCTATGTGGGTGAGTATTCACCCATTTGGTTGAGGCAGCAGCAGGAGGCCCTGAAGAACTCCCAGCT 
CCTGCCCAAGTGCGGAGTGGTGTCGCCCTGTTTCAACAACATCAACCAAACGGTGGCCGCCGTGGCTTCGGTGACCACAGATCTCGAGCGAGAAATGTGGAACATAGTGGAG 
GCCTCCAGTGGCGCCGAGCTGGGTGAAACCAGCTACGAGTTCCCTTCCCCCGCTCCGGGTTCCCAGCGCAGCAGTGAACGAAATAGCAGTTCCACAGTCCCACCAGCTCCTC 
CTGCTCCTGCGAAGCCCTCAGTTCCGTCCGCCTCCTATGACAACCACAACTACAGTTTCAGCCAGGATGAGGACGAAGATGATGATGATCTGGAGTTTGAGGACGTATTCGT 
GCCGGCCAGCTCTGTTCCAAATCCCGTTCAGCCTGGCATAGATCCCGTGGAACTGCGTCGCTCCCTGGCTTTGGTCATGAGGGAGAAATTGCGATCGGATGACACGGACTCC 
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AGGCCAATGGGCAACAATCRGGATCTTCCCATAGftTGAACAGTCCRGGGAGAGACCGCTCTCCACTCAAACATCTCCCRCAAATGGCCCACTTCCGGCTCTTCTGRGGGCCA 

AACTGCTTGCTGGCAACTCCAATAGCGCTCACTGCCTGCCAGGATCCACGGCGAGTCCTGCTCCCCAGGAGCAATCCGGTATCTTTGTGATCGATAGTGAGGCGAGTCCCGG 

CTCAAATGGGCACAAGCCTAAGTATCGAAAGGGCACGGCATTCACTCGGAGTTCGCTGAAGAAGAGCCGATCCTGCAACTGTAGCTCCATCGCTAAGGGACGAGGGGTCCAC 

GACGAGCCCAGCAGTAATCTCTGCAGGGATCAGGAGTCCTCTGTACTTCCACAGCATCCGCAGCCAGCCAACCATCCCACAGAGAACTTTTTTAGTCCGCTGAGATCGGTGG 

GCAGTTTCATGCAGCATTCCAACTTGTTTCACTTCCTCCAGCCTCACGCCGCCCGTCCCACCTCATCGACGGCGTCGTCCACGGCCTCCACGCCCACTCCATCACCACCGCC 

CATGGGCCAGGCTCAGGAGGAATCGGTTCCAGTGGGTCTGACCACCTCCTCGCCGTCCCTTCTGGCGACCAGCGCGGAGAGTTGA 

(SEQ ID NO: 110) 

Start ATG: 1 (Reverse strand: CAT) 

msafryfsitdfsfegpllplhaattskdakdsdspsselnipytvfevlvaivsiignvlviivfrrerklrrrtnyyivslamadllvgalgipfailasmglprnlhac 
lftvsllvvlctisi fclvavsvdrywailypmaysrnvrtrtaifiismcwvagti vgflplfgwhadvnhnqeclfvevmdynylvflyfatiitpallmlafythiyrv 
i1kqvrqivtmnpasdlsrrssaavvqvttpgrgghtgtmlrvlgaarkrdvkatqnlsiivlffmicwiplytincikafcpdcyvhpkltlfciilshlnsavnpvlyay 
hlkdfraalknlllkmmgvdidqqaeaihrfsvasqhrlqsmdsnmrstqprlyvgeyspiwlrqqqealknsqllpkcgwspcfnninqtvaavasvttdleremwn1ve 
assgaelgetsyefpspapgsqrssernssstvppappapakpsvpsasydnhnysfsqdededdddlefedvfvpassvpnpvqpgidpvelrrslalvmreklrsddtds 
rpmgnnqdlpideqsrerplstqtsptngplpallrakllagnsnsahclpgstaspapqeqsgi fvidseaspgsnghkpkyrkgtaftrsslkksrscncssiakgrgvh 
depssnlcrdqessvlpqhpqpanhptenffsplrsvgsfmqhsnlfhflqphaarptsstasstastptpspppmgqaqeesvpvglttsspsijLatsaes* 

(SEQ ID NO: 111) 

Name: Adenosine receptor like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384520 

GAGCCAATTCTTTTTTGGCTTTACGTGACGTTGGCAGAACAAATCGTTTTTGTTT^ 

TTGCGTCGCTTTAATGCGATTGCTGGTTGCTCATTACGGCCAATCCTTACTAGTACTTGCACTTTCTGTGAATTCGTCTGGGGTTTCCATTTGGCTGATTGCATTTTCATAA 
CAATTTGGAGGATTACTAATGCGCCACGTTGTCCCTGWTTTGCATGACCTTGAACCACTAGCCACCTCATTAGAATTTCATTTGTAGTCAACAAGGCGCAGACGACAGGGC 
CACAATCGATAAAAACCCAAATGTATTTTATGGCCAAATTACATTTTTACTAGAATGAATCCATAGAAAGATGCGGAAAATAAGCGGGCGCGGAATATGCTAAGCTTTATTT 
GCAGCCATGGGAATAATCTTAATACAGCAAATTCATCCTTTAAGGTATCTCTAG^AGTAAGTTTAGTAAGCGAACTTTGTCCGTCATTTCCATTGAGTTGAATGCTACATTT 
TCGAAATCTTATTTTAAATGCGATTCAATACAATATACAGAATCTACTATGTTTAAGATCGCAACATTTTTTTTTACACTTTCTTAACTAAACACTTTAAAAGCACATCTGT 
GTTACACTTTGGTATGCATGTGAAACATTTGGTTAACTAAAACATATATCTACGCAATTTAAAAGGCGATCTATTAAATTTCAAGGGGCAAGCTTTTCGTTTTTAAATTTAA 
TTCCTATATTTAAATGGTGGGCGTTCACTAGTGATTGATTTATGCTTACCATTATCGATAATATTATTAAGTTATATCCGCCTCAGGCTTGTTTAAGAGTAAATCCCCCGAG 
CTCTGACCAAAGTTTAATTTTAAGAGCGCCAGTATGCTGAATAGTATGAGTCAAAAGATTGAATATTCTTAGACTGAATAATTCTGAATACTTGAAAGACTAGTTTAGGAGT 
TTTTCCAGCCGTTGTCCGCAGCTGAATGAATGAAAAACTTAATGCTTAAGGGAGAGTGTGCGCTCGACTGTGATTAACTTTTAATTGCCAAATGCAAATGAATGCAAAAGCA 
GTTTAAACAAAACGCTTTGGTAGCAGCAGCAATAGCAATAGACGACAAATAACTGGCTTAAACAGAATTCAAATAAATTAGCATCAGGAAGCCGCACGAGAGAGTGAGTTCG 
AGTTTATCAGTTAGTTGACTTGGACACCGATCATGGCGTACTGAGCGGCGACTGGCGGCTTGTTCGTCGGATTCTGGATGTGGAGTACACATAGAGAGAGAGAGAGAGAGAG 
AGAGAGCCAGTTAATATCGAACCTATCCATTGATTATAGGGGGCACTTGCCTGCATTATCGACGGTTTCGGCTGCTCCGCCGACTCAATGGAGAAGCGCTGCGTTTGACTGG 
AATTGGTTAGTGCCCTCCTCAGGCGGCGCGATTCGATGTTGTTTCGCGAATTCAGCGAACCCTTCGATACCTTCTTGCCAAACAGAACGGCCTAAAGGAGTGGGAAATCGGT 
AAGGTAATTAGTTAAATAGAAAAATTCAGACATTCGAATTATTAATAAATATATTAATAAATATATATAAATATTTATAATACAGATAAATTTGAAATATAGTATTGCGCAT 
ACTATAACCCCCACTTCCGTTTTACTCACCTTGAATGCCTCTCGGAACTTGTGGCTCATAATGTTATATAACAGCGGGTTGATGCAGGTGGACAAGTAGTAGAGGACACCGG 
AGACATAGGTCATCACCGTGTAGACAAACTCGTGCTGATCCCGCAGTTTGGCCCCCCGTGCAGGGGCGTAGATGGCAATCAGTCGCTGGGCGTGGAAGGGGGCCCAGCAGAG 
GAAGAAGCACACCACCACGGCCACTAAAGGGGAGGGGGGGGCGGTAAAAGTCAGTGACGTGGGGATGAAAATGGAATGGCAATGGAAATGGGGAAGACACAGCGAGAGTCAC 
AGATAAAGAGTCAACGATTATATTTTTTGGTAATCGCAATTGTCATCATAAATGGAGAGTGCCAAAATGACAAACGCAGTAAATTGGCTGCAGACGGGTTGATTAGCCCAGC 
TTAGGGCATATGGTCAAACAGTTGGCATACACTGAGCAAAAGTTGAAGTAAAGCTTCTAAAATGGTTCTTCATTGCATATAAAGAAAGATTTAGATATCATATTTCAGAATT 
TTTAATTTGCAAGACTTTTGTGAGAAGACTAAACCTTATTAAAAACGGAGTCTTCTACTTTTGTTTTTGTCAAAATTTCTTACTAATTTAATCATCTGAATACTATTAAGTC 
ATGTTTACTTAATGGTTAGCGGTTTAGAGATCTGTCATTTAATTTGTGATACCCCCGTACTATTGCTGGTATTTTTCGCCAGTGTATATCGTGCTATAATTACCTAGCATCC 
TGAGTACTCGCCGGGTGCCATAGTGATTGAGCCGACCTCTCACCGAGCTAAGTTGAGCCCCCGAGCCGCCCATCAAGCCCGCTGTCCCTGCTCCACTTCCTCCGCCGTTGAA 
ACTCATAGCGGTACCGGATCCACCATAGCGATATAGGATCGTATCACTGGGCACACTCTTCAGCTGCTGCCGTCTGGCAACCGAGGCAGGACCCTCCACCAAAGTGGATCGA 
TACAGGTGCACACCGATAAGTAGGTACAACACTAGGATAATAGACATCGGAGCCAGGAAGAATATGAACGTGGACAGCTGGAATGAGTGCTTCACTATGACCCGCACTATGC 
CGCATTGCTCCACTCCCGAATAGTGCTCGATTCCAAATTGGGCAGCCTGCGGAATGGCCGTAACTATGGCCATAATCCAAACCAGGACGATGATGCGAATGGCGCGACTGAG 
TTTACTCATGGCCTGGCCCAGAAACGGATGGCAAATGGCAATATACCGCTCCACCGTGAAGGCCGTAATCGTTAGCACCGTGGCATTCGCCGATGTCTCCGCCAACAGACCA 
CGTCCGATGCAGATGTACTCCCCAAACACGTACGGGTACTTGGACCAGATGTAGGACACCTCCTGCGGAACGCCCGACAACAGGAGCAGGAAATCCGAGATGGCCAGCGAAA 
AGAGGTAGTAATTCGTGGCCGTGTGCATTGAACGGTTCTTCTTAATCACAATGCAGGTACTTATGTTGCCCACTACACCGGTTATGAAAATGAGGGAGTAGACCACCGTTAC 
GGGTATCACGATGGCCAGCGGATCGCGTGGCGGTCCAAGATCATGGCTCATATTGCCAGCGGACATGGCGCCGGCGGCCTGGAATTTTTCCCCTTTTGTTTTCACAAAATTT 
CCACTAACAGCGCTAAATTCCACTGCCGTTTAACTGCTGCGGCTCCGGCTGCGACTCGATGTCCCCTCAACTATGCTAATTATAATAATTTATTGGAGGCGACCGGGCGGAG 
GAGGGACAGGTCTGGACTCGCGCTTCGCAATGCTAGATACTCGATTCTAAGATTCAGATACTCGACTGGCACGCAAACGCACCAAATTACCGACGAGGTCGAGATCCGTATA 
CCGTATTCAGTATTCAGTATTCCGTATTCCCGGCTGTTTTTCTAGGGGGCCCGACTTCTCTTCAATAAGTTATCGAGTTAATTTGATTTCGATTCGAATTCTGATTCCCGAG 
AGCAGCCTGCAACCGCACTCCAGCGCGCTCCAAACTTATCTGATAAGCCGCGGCGACATTTCATTCAGATTTGTATCTGTATCTGAAAGCCAGCGACGAAGCGTCGCTGTTT 
TTCCCTCACATGAACGCGGAAGGTGAGCGACATGGTTGTAATGACCGAGCGTCCAGCGAACGTGCCGGCTCAATTAAATCTTGAAAAGCGGCGATATCTTGATGGGACTAGC 
CGATGTTAGAGACTAAGCACAATGGCTCCGAATACAACGCAATTTTTGGGTAATTAAAAAGGCTAATTAACAAAAACTTGTAATAATGTTTGCTTTGGCCCATTTATAAAAC 
ACTTAGTTGGGAATTTTGACTTTGTGATGCACATAATCCTAAATAAACCACGATAATGTAAACAACACATCTTGGAATTAAAAGCCAAAATAAGATGCAATAATATTAGATG 
ATGCAATAATACCAAATGACCGAAGGCCACCGTGCGTATGAGTAATGTCCAGCCAGCTTACTTGTTAGCTTATAAGTAAGAATATTGTTCTTTGTTTATCTTACTAAAGTAT 

ATTGAAATGTAAACAAATAATAATAATTACAATTCAAATCTAATTTAGATTAATTAAA 

(SEQ ID NO: 112) 

Exon: 3314 . .2456 
Exon: 1927.. 1710 
Exon: 1547.. 1367 
Exon: 1029.. 1001 

Start ATG: 3314 (Reverse strand: CAT) 
Transcript No. : CT27924 

ATGTCCGCTGGCAATATGAGCCATGATCTTGGACCGCCACGCGATCCGCTGGCCATCGTGATACCCGTAACGGTGGTCTACTCCCTCATTTTCATAACCGGTGTAGTGGGCA 
ACATAAGTACCTGCATTGTGATTAAGAAGAACCGTTCAATGCACACGGCCACGAATTACTACCTCTTTTCGCTGGCCATCTCGGATTTCCTGCTCCTGTTGTCGGGCGTTCC 
GCAGGAGGTGTCCTACATCTGGTCCAAGTACCCGTACGTGTTTGGGGAGTACATCTGCATCGGACGTGGTCTGTTGGCGGAGACATCGGCGAATGCCACGGTGCTAACGATT 
ACGGCCTTCACGGTGGAGCGGTATATTGCCATTTGCCATCCGTTTCTGGGCCAGGCCATGAGTAAACTCAGTCGCGCCATTCGCATCATCGTCCTGGTTTGGATTATGGCCA 
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TAGTTACGGCCATTCCGCftGGCTGCCCAATTTGGAATCGAGCACTATTCGGGAGTGGAGCAATGCGGCATAGTGCGGGTCATAGTGMGCACTCATTCCAGCTGTCCftCGTT 
CATATTCTTCCTGGCTCCGATGTCTATTATCCTAGTGTTGTACCTACTTATCGGTGTGCACCTGTATCGATCCACTTTGGTGGAGGGTCCTGCCTCGGTTGCCAGACGGCAG 
CAGCTGAAGAGTGTGCCCAGTGATACGATCCTATATCGCTATGGTGGATCCGGTACCGCTATGAGTTTCAACGGCGGAGGAAGTGGAGCAGGGACAGCGGGCTTGATGGGCG 
GCTCGGGGGCTCAACTTAGCTCGGTGAGAGGTCGGCTCAATCACTATGGCACCCGGCGAGTACTCAGGATGCTAGTGGCCGTGGTGGTGTGCTTCTTCCTCTGCTGGGCCCC 
CTTCCACGCCCAGCGACTGATTGCCATCTACGCCCCTGCACGGGGGGCCAAACTGCGGGATCAGCACGAGTTTGTCTACACGGTGATGACCTATGTCTCCGGTGTCCTCTAC 
TACTTGTCCACCTGCATCAACCCGCTGTTATATAACATTATGAGCCACAAGTTCCGAGAGGCATTCAAGGCCGTTCTGTTTGGCAAGAAGGTATCGAAGGGTTCGCTGAATT 
CGCGAAACAACATCGAATCGCGCCGCCTGAGGAGGGCACTAACCAATTCCAGTCAAACGCAGCGCTTCTCCATTGAGTCGGCGGAGCAGCCGAAACCGTCGATAATGCAGGC 
AAGTGCCCCCTATAATCAATGGATAGCTGCGGACAACGGCTGGAAAAACTCCTAA 
(SEQ ID NO: 113) 

Start ATG: 1 (Reverse strand: CAT) 

MSAGNMSHDLGPPRDPLAI VIPVTVVYSLIFITGVVGNISTCIVIKKNRSMHTATNYYLFSLAISDFLLLLSGVPQEVSYIWSKYPYVFGEYICIGRGL1AETSANATVLTI 

TAFTVERYIAICHPFLGQAMSKLSRAIRIIVLVWIMAIVTAIPQAAQFGIEHYSGVEQCGIVRVIVKHSFQLSTFIFFLAPMSIILVLYLLIGVHLYRSTLVEGPASVARRQ 

QLKSVPSDTILYRYGGSGTAMSFNGGGSGAGTAGLMGGSGAQLSSVRGRLNHYGTRRVLRMLVAVVVCFFLCWAPFHAQRLIAIYAPARGAKLRDQHEFVYTVMTYVSGV1.Y 

YLSTCINPLLYNIMSHKFREAFKAVLFGKKVSKGSLNSRNNIESRRLRRALTNSSQTQRFSIESAEQPKPSIMQASAPYNQWIAADNGWKNS* 

(SEQ ID NO: 114) 

Name: Growth hormone secretagogue /neurotensin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384297 

TTAAGCCACAAAATGATTGAAAGGAAATGTTGTGGGGTAAAAGTAAAATTCTCAGATCGATGAGTACAATGTACTATGTACTTCCCTCGGCAGCTCCTGCTGTGTTCCTTCA 
GCCGTTTGTGTCTGTTTGTCCCTAAAGTAATTAAAAGTAAGTAAAGCAAAGTCACCCCCTCGTCGGTGGGCGTGGCTGTCCAGTGATTTGCCACCCGCAACGATTTTTTTTT 
CCAGTACTTTCCTTTTTTTCTTTTTTTTGCGTTTATGCAAATGGCCGCTGACCAAAAGAATTAATGCGTCCCGAAAGTGAGTTGAATTGGAATGAGCCGCAATTCTCGGAGT 
CCAAAAACTGCGGGCAGCCTGTCAGCCGTCTGTTTTCACTTCTTTTGCGGGCGGGACGAAACATCAACGAATGTCAATGAATTTACAGGGGAATGCTATAAAAAGCCCGACC 
AAACAAAAAATGTAAAGCCCGTAATTCCCCATTGTCTCAGTGCGTTGCTAATTTGGCCATATCTTTAAATAGCCAGCCGAACGGAAAGGAATCATTTGTAATTTAAAGTCCA 
ATTATTGTGCCATTTGAGGGGAAAACTGTGGCATGCCAGGGAATGGAATGCCTCGTAAGCCACATCGCTGCACTTCATTAAATTGACGAGACAAAAGATCTCTACACATACA 
ATAATGGAAAATATTTGTTCCACAATAAAAGGGGTTGAGGAGGCACTTTGTGTTAATGCCCAACAGCTGCAAATCTCGGTCTGCGGCGGCCTTAAAAATAGAGAAATGAAGA 
TGGACGGTCCGATGATGATCCGCTTGGCTGAAATTATTCTGTAAATTCTACCTATTTATATCGTAAATATAGATTTTGATGCTGATGCCGATGCTGCGCGAGTAAAACCAAA 
TTCGGCCCACGCGGCGTATGCGCGTTTTTCGCACTTAACTCATTAAAACGTTTGCCGCTGGCCCATTGAAATTTAATGGGAAAACACGCGCTGTTTTTTTTTTTTTAGAAAT 
TCGCCTCAATAGATTAAGCTCGTAAATTATTTAATTGCATTCTTGATTTTTGATTTGAAATCAAAACATTCCCTCATTTAAAAACTCATTAGGAATAGCAATGGAAACGTGG 
TTGAGCGGATGTTTATATTGTAGAAAAAAAAAATATATATATATCAAATATATGAAAAAGGTTTTGAAGTTTGTCAGTCAATAAATCAATATAAAGCCCCACTTAAAATTAC 
GACACAAAAGTTTCCATCAAACGAAATGCAAACGAAAACAATGCTCTGAAAATATTTCATCGATATATTGACCGAGCACTCAGATGTCGTCACTTGAAGGACCCTTCTCGTC 
CAGCCCCCGAGTGCACCAACCCCACTCACCTTGTAGAAGGCCTTCCGGAAATTCTCGGAGAGGAAGGCGTAGAGCAGCGGATTGATACACGAGCTGCTGTAGGCCAGAGTCT 
GGGCGGTGACCTGGATGACTAGCTTGGTGAGGGTGTTCGTCTCGATGACATCCAGTGACTTGAGCAGCAGGATGAGCTGTTCAATCAAACGAAAGAATATGTTTGATTATTT 
AAAATATATGTGGGTTTCATATGTATTATTGCGTCTCACCTGGACAGGCAGCCAGAGCGAGGCGAAGGCGATGACCACCACGACGACGAGTCGGGTGACCCGCTTGCGACCG 
CGCTGCGACTCCTTGGACATGCGGACGCCGGTTCCCTGGCGCCAGAGCCGCATGATCATGCGCATGTAGAGACCGCTGATGATCATCAGGGGCAGCAGGTAGGAGCTGATGA 
AGAAGGTGACCTGGTAGGTGCGCGGACCAAGGAAGTCGTTCGTCGTGAAGGTGCACATGCCGTAGGTGATGTTCTTCTTTGCATCGTAATCCACCTGCGGATATCGATGAGA 
ATGTGGTTGTGGGAACTATGAAGAAACGTGTGCGGTGTGGGAATCGAATATTGCGCATACGTCATGTTGGCGGTGACAGTTGCATGGCAGCCAAGCAACGTCAAACGGCCAA 
CAGCCAGACATCGTAAATAAAACTAAACTGTCATACGAATAGTTCGGACGTCGACAACTAACTCGCAATCTCACTTTGTTTCTCTGTCGCTGACATTTGGCTGACAAATTGT 
TCAGCCGGAGTTTTGGATTTGGATAAAAGCGCAATGGGTATGGGCAATCGGGCTTTCGAGGCTTGTTGTTTGGGGGATGGAGAATCTATTGGTCTGTTGGCTAGACGAGACT 
GGTTTGCTGACGCTTAGGAGGGTCGTAAACTGCCCGGTGAATCGTAAAAATATTCGCAGCCTTTTGTTTACATTTTTGTTTACAATCTCCGCCCCTTCACCGCGCTTTCAAT 
CGATGAATGCGTGTGGAAAATTGCATTTAAAAAATCTTCCCATTAATGCCGGCATGAATTGCTTACAAAAATGTTTGTTTATGGCCCGCCCTTGTTTTCGTTACACTGAGCG 
AGCGATGTTGGCCCACAAATTGAGTTCGAACTCTGAATCTGCGCAGCGTCAAGTGGCCAAGGCCACAGAAGTAGACAAATCATGGCCAGACCCTCCGCTGATTTATGGGTTC 
ATTTCGCTGGCATGTTCTCCGCAAAACCCAACGACTTCATAGACTTGACAAGCCATTAATGCCCTTTCCTCGTTCGCGAGAAATCAGCCAACAGCTGTCAATCAATGCCATG 
ACCTGGAGATCGCATATCTAGTTACGCAGTCGGACATATGTGCATTTATGATTATTATTTGATATCGATTTCTAATAAAAAGATAATGGCACACGCTCTCACAGGCTCATTT 
GATATGCATATACCAAGCATGTAGCTGGGCAATGTAATATGGAAATAATTAATAGCAAACTGTTAATTAACATGTTGATCAAACATGACTTAATTATGCCTTTTTATCGCCT 
GTTCCTTTGTATTTGCACACGCAGACACCAGATAACAAAGCAAAGCACACATATGAGCATTCAATGTGGGCTCTATTTTTCCAGCTTAATCCTTAGACAAAGCAAAGCAAAC 
ACATAAACAAACAATAATGCCACCCATTGGAGCACTCTCCCCAATGCGAACAAAAACCGAACGAGACAAAAATAACAATATAACGAAGCGTGAAATGCCATTTAATAAAATA 
TTAGGTTACAATAACAACACCGGGAGCAACAGTGGGCCATTGCCATCTCACAGATAAAAAAAGTTCATCATCCGTGTGAAGATGAGATTCGTTTAATTAACCGCGACTGTCG 
CTTTTGGCCAAGGAAAAGGGGAAACACAACGGCACAAACAAAAAGTTGGTCATGTGTGATGTGGGAATCGCTTTATTGGCACTGGGAAATCAGTGGAAAAAGCCCGCGGGCC 
ATTTATGAGTATCAACAGAAATTGGAACCAATGTCCGACCATAATTACGTGTAGTTGATACTCGCAATGAAAAAATATGATAGATATTCAGTATTAAATAGCACTCAACAAT 
TGTTTGAACTGTACCGGAAAATTGAATATTAGTTAATAATCCCAGTGTTTTCTTGCGAATAAAAACTTAAGGTTAAAGTCTGCAAATACAAATACATTGTATATCCTCAAAA 
TCGCCGGTAAATTCCCCTTTTCATTTTTGATATCACCCGCTGATCTCGAGCAAAACAATTCCGATCTTCGTCGATTATTCCGCAATTAAATTGTTTCTCGCACTTTTCGCAT 
CGATCATAAAAGCCTAATGTCCAAGATGAGCATTTCAAGGTCGCCACCCGCGAAGACATTTATCATACGCAGTGTGCGCCCAAAGTTCGCAAGTGCAAAGTTTCCATAATGG 
CCGCTTGTTACGACTTACGACTGCCCATGATTTACGCGCTTTTGGATCAACAGTTGAATTTGAAATTTTGCAAGCCACACGGAGAGGAGGTGGGCAGATATGGGCAGATACC 
GCCCAGTTGGCAAGTGTTTCTCGCTGGCTATCTGGTCTATAATTAAAAATTTATGAATTCAAAATAGCAAATTAGTAACAAAAACAAATCGGGCGCCGCGCATCGCGAAATT 
TTCAATATAAACACAAAACAGAAAATAAGATAAAATGGGGATACTTTAATAGGGGCATATAGTAAAAACAAATGAGTTGTATCAGGATCAGTCGATTGGACGATCGCTGGTT 
TGTGCGGGGTCTTCGCTTTCTGGATTACGATCTGGCAGAAGGCAGACCATCTATCTAGGTATCTATAGCTACATTAGAACCAGGCAGTCGGCACCATTGACGTGCAGCGAGA 
GGAATCTAACGAGGTGACGATGATGGAAAGCAATAAATTTGCCTGGCCCGTGTTCTTCACAGCAAATTCAGGCACAGCAGGCCCACTTATAGCATTCTACCCACTCCTCCCA 
CTCTTCCACTCCCACTTACCACCACGTCGTGGGTGAAGGCCACTGGCACCGAAACGACCAGCACCACGATCCACAGAGTCACGATGGCAATCAGGGTAATGTTCTCCGTCCT 
CATCATCCGCGAGCGAATGGGATGAACCACCGCCAGGAACCGATCGATGGACATCAGCACCAGCGTGTAAATGGAGGCGAAGGCGGTCACCACAATCAGGTACTGGACACTG 
CGGCACCAGAACCTTCCATATGGCCAGTAGTACACCATGTAATCGGTGGCCGTGAAGGGAATGCAGAGGATTACGAACATCAGATCAGCGGCAGCCAGATTGACAATCATCA 
GGTTGGTGGTGGAGCGCATGTTGTTGTTGAAGACCACCACCAGGATGACCAGCAGGTTGCCGAAGAATCCCGTGATGGCGATGACGCCGAAGAAGAATCCCACTATCCACGG 
CAAAGTGCCATTGATGGCCAGCCACTCTTCGTCGGTGAAGAATGAGGTGATATTCTCCTCATTTCTGGTTGCATTTAGGCTAATATTAGCCAGCATTGTGGTGTTCTCCATG 
TTTTCTCAGCTTTAGTGCACAACTTTCCGGTTCACTTTGTACTGGCTATTGGCGATTTCGCTTGATTGTCACGGGACATTTTCAATTTTGGGCTCTTTTTCGCTCACAGTTC 
ATCGAACGCAAATAACAAATTTCCAATTAGGAGAGTTTCACAAAGCAGCCGCCACAAAAATTCAATTATCATGCGGAATACTACGACCGTTGAGCTTCGGAGACCTCAAAAA 
CGTTTCGCATACGCCGTGTGGTATCGAACTCGACCGGAATTGTTCGCAACGTGGCGGGGGCAAAAGCAATTTGGTCCTTCCACGATAACGGTAATTGTAGGTGGGTAGCACA 
GGTAAGACAACAATAAAGGACAGCACACAAGGATCACGGTGAGGCGAAAGGCAAGAGAGTGAGTTTAAAGGTGAGGGCACGTGCAAACGACACCGACACAGCCAAACTGTGC 
CGAGTCACACCCTGACGACAAGGACACCGGGGTGTCCTTAGGTTCGAGGACCACGTTGCTGCACCTACACCATAAATTCAATATGATGTCTTCCTTTCGCTGCGTTTCTGTT 
TACATTTCTCGTTTGCTTTTTTATTGACAAACGAACGGCATAAATTTTGGGGCTATAAGGCTATGGCCAAAAAACCACGTTCCTATTGTTGCTACCGACTGCTGCGCCGCGG 
TACGCTATATATTTTCGATTTTATACTGCTGCTTTTGGCCAATTCGATAGTCCACACTGCTCGCGTCCGTATCCAGCGCCCAACTGATTGCTGGCTGAGCTTGTTGAGGAAT 
ACTTCGATGCCTCGGCGGCGGCAAAACGCCGGCAGCGACGCCAGCCATCCCAGCAAGAATAATGGCAACAATAGCAACAACCAGCGCAGCTGGCATCTGTGCGTGTGATGGT 
TTTGGGTGGCCGCCGGAGCCACCTCTTCCTCCTCATCCTCCTCATCCGATCCCAATCCCCATCAGCTGGGCTGGAATGTTTGCTCTTGTTGCTCCCACCGCTA 
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(SEQ ID NO: 115) 

Exon: 4927.. 4388 
Exon: 1886.. 1608 
Exon: 1S32..1374 
Exon: 1012.. 1001 

Start ATG: 4927 (Reverse strand: CAT) 
Transcript No. : CT28187 

ATGGAGAACACCACAATGCTGGCTAATATTAGCCTAAATGCAACCAGAAATGAGGAGAATATCACCTCATTCTTCACCGACGAAGAGTGGCTGGCCATCAATGGCACTTTGC 
CGTGGATAGTGGGATTCTTCTTCGGCGTCATCGCCATCACGGGATTCTTCGGCAACCTGCTGGTCATCCTGGTGGTGGTCTTCAACAACAACATGCGCTCCACCACCAACCT 
GATGATTGTCAATCTGGCTGCCGCTGATCTGATGTTCGTAATCCTCTGCATTCCCTTCACGGCCACCGATTACATGGTGTACTACTGGCCATATGGAAGGTTCTGGTGCCGC 
AGTGTCCAGTACCTGATTGTGGTGACCGCCTTCGCCTCCATTTACACGCTGGTGCTGATGTCCATCGATCGGTTCCTGGCGGTGGTTCATCCCATTCGCTCGCGGATGATGA 
GGACGGAGAACATTACCCTGATTGCCATCGTGACTCTGTGGATCGTGGTGCTGGTCGTTTCGGTGCCAGTGGCCTTCACCCACGACGTGGTGGTGGATTACGATGCAAAGAA 
GAACATCACCTACGGCATGTGCACCTTCACGACGAACGACTTCCTTGGTCCGCGCACCTACCAGGTCACCTTCTTCATCAGCTCCTACCTGCTGCCCCTGATGATCATCAGC 
GGTCTCTACATGCGCATGATCATGCGGCTCTGGCGCCAGGGAACCGGCGTCCGCATGTCCAAGGAGTCGCAGCGCGGTCGCAAGCGGGTCACCCGACTCGTCGTCGTGGTGG 
TCATCGCCTTCGCCTCGCTCTGGCTGCCTGTCCAGCTCATCCTGCTGCTCAAGTCACTGGATGTCATCGAGACGAACACCCTCACCAAGCTAGTCATCCAGGTCACCGCCCA 
GACTCTGGCCTACAGCAGCTCGTGTATCAATCCGCTGCTCTACGCCTTCCTCTCCGAGAATTTCCGGAAGGCCTTCTACAAGGCGAATTTCTAA 
(SEQ ID NO: 116) 

Start ATG: 1 (Reverse strand: CAT) 

MENTTMLANISLNATRNEENITSFFTDEEWLAINGTLPWIVGFFFGVI AITGFFGNLLVILVVVFNNNMRSTTNLMI VNLAAADLMFVILCIPFTATDYMVYYWPYGRFWCR 
SVQYLIVVTAEASIYTLVLMSIDRFLAVVHPIRSRMMRTENITLIAIVTLWIWLVVSVPVAFTHDVWDYDAKKNITYGMCTFTTNDFLGPRTYQVTFFISSYLLPLMIIS 
GLYMRMIMRLWRQGTGVRMSKESQRGRKRVTRLVVVVVIAFASLWLPVQLILLLKSLDVIETNTLTKLVIQVTAQTLAYSSSCINPLLYAFLSENFRKAFYKANF* 
(SEQ ID NO: 117) 

Name: Allatostatin receptor type 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000012806866 

ATCTTTTAAGTAAAAAGCGCTCAAGTCAAAACTTTTACTTTCATTTAAAAGTAAATTTTCTTGATGTTTCAAATTTATAGCAAAATGCCATAATTTCAGTTCGTTTGAAACA 
AACGAAATGAAGCGGAAATTGCGAACCGAATTTATTCGTACAATTTATGGGCCACACATGAACAATAATAATTCATTCGGCTGCATGTAATTAAGAATAATAAAAATACTGG 
TATATTTTGTTTGTTTGCTTAATGACTCGAAATGTTTATTCGTTTTCGTTTGGTAATTAAATACAATAATAATTACATTAATAAGCCTTTCTGTTTTTTAATGACAAATATT 
GTACATTACATTTGCTTACATCCAGCTGTGTGCTTGTGTAGGATTTAATACATAATCGGATTGTTGTTAAAATACTATTTAAATAATTTAATTTAATGCTACTTAATTGATT 
TCAGGTGCTAATTTAATTTGATTCATTTTTTGTTCAGCTAATAATACATTTGATTTGGTGAGTTTTGGTTTGAGTTTGATAAGGTCATTAACGAATCGATTATTTACAACGT 
TTGGACTTCGTGCTTTAAGGATCGCAATTTGAAGATACCATTATTATAATACTGTACACGGCTCACGACCAGCATTCACACTCATTCAATATATATCCTTCTGCGAAACAAT 
TATCAGCAATGGGACTTGTAATCACAATTCATTAAGACGAAATCACAGAGGCAAATGGTCATGTTCCAATGACACATTTAGTTGCGGGATATCTAAACACTGTAAAGGGGTT 
TTACTAAAGCCTTACATACAATTACAATAAAATTTAAACATATTAGTGTAGTAGCTATACCCTAATAATAGTCATTCACTAAAACATTAACACACATACGTTAAATGAATGA 
GCTATGACAGCCAATTTACTGGAATACACATCATATGTGAATGCCGACTGGCTGCGGAATGTGTCATTGGAACACGGCTATAACTTAGATCGCTCTAGTTTAAATTAAAGTG 
GTTGCCACAAGGACTTGCCCGCATCCTTGTCCTTCGCCTGCGAGCCGGGCAGGTGGACTCGCGACACCTCGCTGAGTATGCAGGTGCCATCGCATTCGGTTTGGCTGCCAAA 
TTCCTGTTCGCAAATGCAGCTGGAGCTCTCCCTGCTTAGCAGGGCCAACTGGACTCCGGCCCCGGATGCCAACTCCTCGCTGGGGGGCGTGGTTGATGGCAGCTCCATGAGT 
GCCACGTCGTCCTCGCATTCGATTACGTTCGGATGCAGAGGTTTGAAGTTGTTTCGCCGCCACGGCGGCATTGCAACGCCAACACCTGGTGCCGAAACACCCGGAGTGGCCG 
CAAAGGTAACCACGCTTTGATGCTGATGATGTTGGCCATTGACATTGTTGTTCTGCCCACTACTTCCGTTGTAAATGCCGCTGTTAGCACCGGATCCATGCACCCGCGGCAT 
ATGCAAGCCCGGCTTCCCATTGTTGACACCACCACGCGCCGGACCAAAGAGATTACTCCGGATTCGCATCGAGGAGTTGGCGTAGGTTGACCTTATGGAGCTGGCGCGGGTG 
TGCATCGAAAAGGTCCTTTCGTGGGCGTCCATGCTCGTCTTGAACTTGCAGAAACAGGCCGCAAATCGCTTGTTGAATTCCCGCTTAAATTTTTCCTGTGCAGAGAAAAATA 
GACGACGGTTTAGGTTAATCCATCCCAATAACATATCACTAAAGTAAGCGAAAGTACTTTTATTCACTTAAATACAAATTAAAGTGCATTGCATAAATCATAAACCTCTTAG 
CAAAGTTTATCATTTGCAAAAATACATTTTTTCGCTATCAAACTTTCTGGTTAATTATTATGTGAATTGGCAGAAAATTTTTCGTGAAAAAGCGAAAACTGCCCCGGCTACG 
GCATCTTTCTGGCTAATGAAAGTGCTCTAGGCAAACCTCCATCTTCGCTGACAAATCCCATTTGTCTTCGCTGACAGTAAAATAGAATATCTGTTGACAAAGTTTTTCACGC 
ACAGCCAGCAGGGGAAAAAGTTTACGAGTTTATTCGGAGATCTGGCTCTCAATCTATCTTTTACAGACAATTTGAGTGTCAAAGCAAACAGCCGACGGGTTTTTTCCCGCTT 
TTATCGGTTTGCCACTTTGGCATTCTGAGGATTTTTGTGGATTTCGCTTCAGTTTTGCGGTTTCACAATTTCTGCGGCCCAGACGGTAGGTGTTTGCTTATTGGCTACAGTG 
GAGCGCTGCTTTAATTAACTTTGACAAAGTATTTTTTTGTGTCAGCTATTTTAGGTGCGAGGCGAAAAGAAATTGGAAATTGTTTCAAAGTAAGGCAGAAGTTATTTTTGAT 
TAAAAAAAAAGTGACGCGTATGCTTATAAATCAGCGAGAAGGAAAATGGGAAAACATCAACAAAGCAATCGATTCGCTTACAATGAAATTAAGTTAGACATATTTCATTCTT 
TTTTAAGCTTCTTTGATTTTTGAGACCATGAAAACTTCCGTAAAAGTTTCATTTGCTTTGCGTCTATGTTTGTGTTTCATTTGATGAGATAATAGAAAAAGCTAATTTTGGA 
AGCACGTGCAAAGGCTTTTTTTCTTTTAGGTCTAAAAGTACGAAAATAATATAGAAAAAATTACTTTTTTTACTTTAACTTCATTATTAAAAGCAAGACGTTAAGTATGTAG 
ATATTTTGACGAGACATCCAATTGTACGCAAGGCAAGTTTTCCCAAGATCCCCTGTCAATAACTGACTCACCCAAACTTAAGTTCCCCAAAACTAACCCAAGGCGACCTTGA 
GTCCCTCATCAATCCCCACATTTCCCCGGCAAATTTACCACACATTTATAGACTTATGTCTTTTTAGTGCGTGTGCAAAGAACGTGTCCCTGAGCACGAACAAACATCCAAA 
GGATGTCAATCAAATCCCAGCCAGACAAATGCAAATTTAACACTCCGTAAACAAATCATCATCATCATTGGAGCCCACACTTACATTGTAGATGCCATAAATAAAGGGATTG 
TAGCAGCTATTGCTCATGGCCAGCCAATCGCAGCAAAACCAGACGATGCTAATGAAGTGGTAGTCGTTGATTTCCGGTATCGTGACATACAGAATATTATAGAGCTGCAGTG 
GCAGCCAGCAGAGTCCAAAGATAATGACCACGATAATCAGCATTTTGATGACCTGCAAGCGTCGCAGAAAAAGGAAATTAAACCCAGTCCCGATTTGCATACCACATCAACG 
TGCAAAACTCACCTTCTTCTTGTTTTTCAACAGCGTTATGTCCCGTGAATCCTGTGCGTTACCAGGAGCACGTGTGCCCCACAATCGTACCGCCATCTGGATGTAGACAAAG 
CTGATGACACAGAATGGAACCAGATACTGCACAAAAACCAGGGTGTAGCGAAAGGATTGCAATTGATCATCGGATAGGTTCTTGTTCATGCAGAATGGCCGCGTCACATTGT 
AGGTCTCATTGTTCTCTGTTCAAAGGATGAAATTCAGTTAAGGTCGCTGGCACTTAATCCATGCAAGTTGACGTATTGACTTACCGCGAAATCTTTCGGTCAACTCCTCCAC 
ACGAAAGGCAATGGCAAAGGGCACCGCAAATAGCAGGGCCAGCATCCAAATTCCACCAATTATGAACTTCGATACGAACTTGGTGGGACGTGCCCTGCATGCGTAAGCCGAT 
TTTTAATGATTATGATTGACGAATCGAATCGTTTCGAATGTGGGAGCCCACTCTCCCCCCCAGCGACTCACCTAAGTGGATTAATGATGGCCCTATGCCGATCGATTGCAAT 
GGCGGTCAGCGTGAATACCGAGACATTTACACTCAGGGCCTGGACGAAGGGGCAGAAGCTGCACATGAACCACGGCAGGTTCCAACTCTGCAGCAGGGCAGCCTGGAACTAC 
ACGGAATGAAAATATTGCAATTATTAGTTTAAAGTATTGCAGATTATCAGAAACAACTTGACTATACAACTATGCCTAACCTTTAGCCTGAGCATGATGTTTACAAATCACA 
AATAATCCCTTTAAAATGTTCACTATATATATATTCATTTTAATTAAAGTGTAAGGAAATAGGATAGCTACAAGTGAGTCGTCCGCCTGAGGTTTGATATGATTTAATCAAC 
CCAAAATTGCGACTCCATTCAAGTCACACGGCAAACACCATTAATGAACAGTGGAACATGTCGGGGGTGTTGTATTTTCGGCCAGTTTCAACATTTCGTCGACAAATAAACG 
CTCGAAAGCCCAGCGGTGTTGTTTTGGACACTGATTTGCGCAGGAGTGAGTGAAAAAGTGTGTGTGTGATTACTGCGGATATTTGAAGACCGCGCTTTCTCCGTTCTCCACA 
CAAAGTCGGCCGCAAATTGCAATTTCTCGCCGGCTTATTGATGACACTTACTGGCGGCTGTGACAGGACCAAAAATCTATTTGATTGCGGCATTAATATAAATTAAGTGCCC 
AAAATGTGGCTGGCTTTGGGTCAAGTTCAAGGTTCAACGAAGTGTACAGATTGAGCTTTATTAGTATGAATTCTGAGGTAAAAATCAATTTACTATTTTTATTTGAAACATA 
GAAAGATGTAAACAATTTGTACTACCAATTAATAATTATTTCGACTATTCAGTTCGATGGTTACATTCAATATTTTACGTCTAAAACAAGCTAAAATGCAATTGCGGCATTC 
TGCGAAACGAGCTGAATACAAAGTAACGATTCAATTACTTTATTCAAATGTCCATTGCTCATTGATTGACCATTTTCCGACTAAACAGTATTTTCAAAGCAAACTGCGTTCG 
CGTTTTGCATTCGACGACACCAGGGAATCAATTGAAGATAATGGCAATTCAATTACTCGGTATCACATACAAATTCCAAAGAAAGCCCAGAAAACTGTTCACATAGCTTTTA 
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GCftCTCTATTTTGGGTACTTTATAGAATACAAATCCCATTAACACATARACCGfiAAATCACATTTAAATTTGflTGTAGGTAATATTCCATTACCACAACTTTTTTCGACAAG 
ATGCGCGTAGGCCATGGCAATTTTCTAATGCCAAACATGAGCGCTCGTTGCCCGGAAATTAAATTGATGCCGGATTATGCTGAAATTGGTGTCCCTGCGCACTGTGGGTTTC 
AAAAACTGGGCTGGCTACCTTTCTATGGTGAAAATAAGCGAAAACTAATGCTAATGTTGTTGGCTTGTGTGATTTATGCGTTAAAGTCGATATCCATTTACTAGCGCAACAG 
TGCGTATGGGCAATTTTGCATGTATGCATTTCATATCCGCGAAAAAAAAAATCTCGGTAATTACCCGCATTATAATTAAGCTTAACTGTTGCAACTCACCTGAAATGGTATG 
CAGAAGAGGCCAATAATCACATCGGCAAAAGCCAAATTAGCGATATACATATTTGTCACGGTCCGCATTTGCCTGGTCGTGGCCACCACCCAGATGACCAAAGTGTTGCCAA 
TGACGGCCACGATACTGATTCCCCCATAGAAAATGGACAACAGGGCCACAATCTCAGCGGGAGCCGCGTATAGACGCTCTGTGAATGAATTGAAAAATAGTTGTGCATTAGA 
TTTATATCACTCGGAAAAAGTGCTTGCAACATTATTATAAAACTAAATGAGATATAACAATCTGTTTTACACAAAAATTTAAGAATGTATCCAATACATAATGCGCTCCTAC 
TAAGTTACTGAAATTTTAGTTATTTATTACAAGACCATGCTTTTATGAGTGAATAAATTTTGTGTGTGGTGCTTAGCTTCTTCCAATCCATTAGTTTCAGTGTTGAACTTGT 
GAAAAATCGCGCTTGCTGACAGAAGCAGATTAATTCAAGTGTCAGTTGTCAGTCAACTGATGGCAGGACAGACGACATTCACGTTCCTGCTGACTTGTAAACGTCAAGGAAA 
TTGAACCGTATTTTATAGCCAAAGAGGGAATTTTATAAATGCCACTTTGGACAGTATATACCTGTATATATATATACCTATATCCAATTTAATATAAAATATTGATAGTTAA 
ACCCAACTTTGATATGAATTTCATTAGCATATCCGAACATCACCTGTTTCCTCATATCAAAGTCAATGAGATGGTTGAGCCCAAGTTTGGTCTTTAAATTCGGCACCCCCCA 
CACTCATGAATGATTCAAGATCAAAGAACTAACGGCGAACTGTCAGAAGAAAAAAACACTTGTCAATGTTTTTGATTGATGATACGTCAAGTCAATTTAATCCAGTTGGATG 
AGTCGCGTCCCCAGTTGCTTGAAGGGATTAGAGAGTATACTTACCAAATTCTGCTTCCTCCTCGGCTCCGGGCAGGAATTCCAGGCGGGACTCCTGCTCGATTAAGTCCATT 
GCCATAACACTTTTTCGTTTTTTAGTTTTAGTTGTGTATCCTAATGAGGATATGATATCCTTCAAACCTTGTATTTTAGTTCGAATGCTGCTGCTCTATGTAGTGCAATACA 
TCTTCAGGGGCTTCTGAGCTTCATTGCAATCTAAACTGGAAAGATATAAATTACAGCATTTGTTCATTTGTTTTCTGAAAATTCGCAAAAAAGTTATCTGCCACAAGTCCAG 
TGGTCAGGGAACAACAAAAGTGCAGACAACAAGAGAAACAAATGATCTGAAAAAAAATCATTGAAAAGCTATACAAGCAATATCAGTCTTGCCCTCAAGGAACTGACCAAAT 
TCAGTGGGCGGACATGAGTTATGGCTCAGGTGGAATCACCTGTCAGCTTTCACAAGTGAAAAGTTTGTTCTGTATAATTGAAAGTGCACCCAAATTCAGTTTGGCGACTGAG 
GGCACTGCACTAGAATTTCGCTTGTATTTCGTTTCTTATGAAAACAGTTCATGTTTTCATGTTTTTATTGAAGAGTCATGCCAACGCTCAAATGTTGCCTTAGCCTTATTGA 
GGTGGCTCCTAAAAAATAAACAAGAAAGTCAGCAAGTTCATTGAATGTTTGAGAATAGAAGCCTCACTCAGGTTAATTGGCCATTTATCGAACTGTTAACATTGTTGGGACT 
TGTTAATTTATCTTCTTCAGTGTAAACACTGTCAAGACTTGTTAATTTATTGGGTAACTAAATGTTTATTCTATAGGCAGACGGAATTGTTATAGATTGACTTAGGGATGCT 
TGTAGGAAATAGGCTGGGAAATTGGCGAATGTCGTGAAAAAAAGATGTATTTAGTTTTGAATGACTGTACTATTTCCCCCAGAAAGAAACAGGTTCACAGGCTTTAAGTTTA 
TATGGAAATGCAAAAATTTGTTTCGAGCATTAAATTACTCATTTTATTATGCTGGTCACAAGCTAAAAGAACTAAAAGGAGCCACCCTAACGCATGACCCCAA 
(SEQ ID NO: 118) 

Exon: 6383.. 6317 
Exon: 5566.-5364 
Exon: 3916.. 3768 
Exon: 3678.. 3557 
Exon: 3487.. 3261 
Exon: 3188.. 2997 
Exon: 1663.. 1001 

Start ATG: 6383 (Reverse strand: CAT) 
Transcript No. : CT29768 

ATGGACTTAATCGAGCAGGAGTCCCGCCTGGAATTCCTGCCCGGAGCCGAGGAGGAAGCAGAATTTGAGCGTCTATACGCGGCTCCCGCTGAGATTGTGGCCCTGTTGTCCA 

TTTTCTATGGGGGAATCAGTATCGTGGCCGTCATTGGCAACACTTTGGTCATCTGGGTGGTGGCCACGACCAGGCAAATGCGGACCGTGACAAATATGTATATCGCTAATTT 

GGCTTTTGCCGATGTGATTATTGGCCTCTTCTGCATACCATTTCAGTTCCAGGCTGCCCTGCTGCAGAGTTGGAACCTGCCGTGGTTCATGTGCAGCTTCTGCCCCTTCGTC 

CAGGCCCTGAGTGTAAATGTCTCGGTATTCACGCTGACCGCCATTGCAATCGATCGGCATAGGGCCATCATTAATCCACTTAGGGCACGTCCCACCAAGTTCGTATCGAAGT 

TCATAATTGGTGGAATTTGGATGCTGGCCCTGCTATTTGCGGTGCCCTTTGCCATTGCCTTTCGTGTGGAGGAGTTGACCGAAAGATTTCGCGAGAACAATGAGACCTACAA 

TGTGACGCGGCCATTCTGCATGAACAAGAACCTATCCGATGATCAATTGCAATCCTTTCGCTACACCCTGGTTTTTGTGCAGTATCTGGTTCCATTCTGTGTCATCAGCTTT 

GTCTACATCCAGATGGCGGTACGATTGTGGGGCACACGTGCTCCTGGTAACGCACAGGATTCACGGGACATAACGCTGTTGAAAAACAAGAAGAAGGTCATCAAAATGCTGA 

TTATCGTGGTCATTATCTTTGGACTCTGCTGGCTGCCACTGCAGCTCTATAATATTCTGTATGTCACGATACCGGAAATCAACGACTACCACTTCATTAGCATCGTCTGGTT 

TTGCTGCGATTGGCTGGCCATGAGCAATAGCTGCTACAATCCCTTTATTTATGGCATCTACAATGAAAAATTTAAGCGGGAATTCAACAAGCGATTTGCGGCCTGTTTCTGC 

AAGTTCAAGACGAGCATGGACGCCCACGAAAGGACCTTTTCGATGCACACCCGCGCCAGCTCCATAAGGTCAACCTACGCCAACTCCTCGATGCGAATCCGGAGTAATCTCT 

TTGGTCCGGCGCGTGGTGGTGTCAACAATGGGAAGCCGGGCTTGCATATGCCGCGGGTGCATGGATCCGGTGCTAACAGCGGCATTTACAACGGAAGTAGTGGGCAGAACAA 

CAATGTCAATGGCCAACATCATCAGCATCAAAGCGTGGTTACCTTTGCGGCCACTCCGGGTGTTTCGGCACCAGGTGTTGGCGTTGCAATGCCGCCGTGGCGGCGAAACAAC 

TTCAAACCTCTGCATCCGAACGTAATCGAATGCGAGGACGACGTGGCACTCATGGAGCTGCCATCAACCACGCCCCCCAGCGAGGAGTTGGCATCCGGGGCCGGAGTCCAGT 

TGGCCCTGCTAAGCAGGGAGAGCTCCAGCTGCATTTGCGAACAGGAATTTGGCAGCCAAACCGAATGCGATGGCACCTGCATACTCAGCGAGGTGTCGCGAGTCCACCTGCC 

CGGCTCGCAGGCGAAGGACAAGGATGCGGGCAAGTCCTTGTGGCAACCACTTTAA 

(SEQ ID NO: 119) 

Start ATG: 1 (Reverse strand: CAT) 

HDLIEQESRLEFLPGAEEEAEFERLYAAPAEIVALLSIFYGGISIVAVIGNTLVIWVVATTRQMRTVTNMYIANLAFADVIIGLFCIPFQFQAALLQSWNLPHFMCSFCPFV 

QALSVNVSVFTLTAIAIDRHRAIINPLRARPTKFVSKFIIGGIWMLALLFAVPFAIAFRVEELTERFRENNETYNVTRPFCMNKNLSDDQLQSFRYTLVFVQYLVPFCVISF 

VYIQMAVRLWGTRAPGNAQDSRDITLLKNKKKVIKMLIIVVIIFGLCWLPLQLYNILYVTIPEINDYHFISIVWFCCDWLAMSNSCYNPFIYGIYNEKFKREFNKRFAACFC 

KFKTSMDAHERTFSMHTRASSIRSTYANSSMRIRSNLFGPARGGVNNGKPGLHNPRVHGSGANSGIYNGSSGQNNNVNGQHHQHQSVVTFAATPGVSAPGVGVAMPPKRRNN 

FKPLHPNVIECEDDVALMELPSTTPPSEELASGAGVQLALLSRESSSCICEQEFGSQTECDGTCILSEVSRVHLPGSQAKDKDAGKSLWQPL* 

(SEQ ID NO: 120) 

Name: Tachykinin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384568 

AGTGGAAATTTCGAAATTGAAATGGACCCTTTCCGCCGAAAAATCGATTGATTTTCAATTTCCCCTCTGCGCTTTCTTGTTTTGCTAAGTTTGAAAGTTATTTGCTTTAACT 
CATGCTGAAAGAAAATATCAGTTGACTAGCCGCTTTTTAACCTTTTATTCGAATTTTATCCTTAGTTGTTGCACAAAAGAGTCTATGTGCTTAGTTGATGATATCAGTCGAT 
ATCTCTTTAATTGGCCAATAACTGCTGAATATGTGGCAGCACAATATAGTTACGAAAGTAATCGGCTTAAGAGCCCACACTGCGATGTGCATTTTAGTGCATTAAAAGGACT 
TGCAATGCACTACTGTCTAATCCAGATTACGACGACAGCCAATGGCTTTAAGCGCAAGTATTTTAAAACTGAATTGTGAGCCGTGATTCCAGCTGGAGTTAAATAGGAAAAC 
TGTTGGGCTTAAAGCAGCTGTAGTGGCAAATACCAATCGAGCATGGCTTCAAATCACGTTCATTTCGTTCCACATGACATGACATCATGGCAATGGTATCAAAGTGCACACT 
TGTAGACACAGAAGCCTCGGGTTTGCTTAAATTTTTATGGCCTCAACACTTACCAGGTGCACAGCTTTTCTTGCCAGCTACTGGATGTTCAGTAGCAGGGAAATCGACAGCT 
GGAAGCTCGAGTATCAAGAAAATTGAGAATGAATGGAAAATGGGTTACCACTCTCTTTGTTGCAGACAAATTTCTCCTTTTAATTGAAGGCAAAAAGGGATATATATTTTTA 
AATAGTTGGCTTCTGACAGAGACTTACATGCGAAAATAAAAAATTTAGTATTAATTAAATCGACTCGCTGGGAAGTATTGTCTTATTTTATATTAGAATTTTTAAATTTTAA 
TTTAGGGACCTATACATAATGGCAATCAATTAGCATAAAATATAACGTTATTAATACATTGGTCCTGGACCTTTGTTCGTCGGGGAATACGTATTTTAGTATATCTACTGAT 
GTGATCTGCTGTTGTAGTTGGTGGATGAGGTGGTCATCGACTGGAGCTCAGTGGTGCGCTTTTGGACATCCAGCTCCATCTTGGTGGGGCAAATCAGACCCACCCACAGCCG 
ACGGTTCGTGGAGTCCAAGAGGACAAAGTAAATGAACATCACAAATCCCTGCATAGTGGCCGTGATGCAGAAGAGGTATGAAAAAGCCACACCTGCCTGCATGAAGGCGAAT 
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ATTCCAAAGATCCAGGTAAGACCCAGCAAARAGAAGAGCATTATGGAGflGTCGGATCTGCTTAACCRCCATTTTCTTTTCGTTTTTATGGATGCTCTGGCTCAAAGAGTGCG 
AGATGCTGTAGAAAACATACACGAATATCACCAAGTTGCACACAGTAATCAAGGTCACAGGGAGCACCACCCCAAAAATGAGACCATAGCCCGATGGATAGCAGATGCCAGT 
GTCCGTGGAAAGTTGGGCAGCCGAGGGGACGTACGAATCGGGATCTATAAGGGCCACCAGCAGGGTGGGGACCAAGGGCAGTAGCCAGGCCACAATGGCGGCCTTCAGGATA 
TAGCGAGGTGGTCTCTCGATTCCGATAACTGTGACGTAACGCTGGAACTGCAGGAAGGCGATGATTAGCATCCAACTGAAGAGAACCAGAATGGAGTACTGCATGGCAGCGC 
CCAGAGCCACACAGCGAACTGTGTTTCCATTCACAACAAGCGCTTCGGATACATCATCTGTATTGAGAAACACGAAGAGCATCATCTGGAGACACATAGCCAGGCAAAGGTG 
CAGTAAGACCTTCGTGGAGGCCTGACTGCGCCAGCTTTTGAAGAGTGCGGCAGTGAGGAATATTCCCAGTATACCCAAAAGGGAAAGACTGCAGCCCACTATCGAGATAATG 
TCCAGCACTTTCTCATTAATCGGAGTAATTAGAATCTCCTCGCCAAGGTCGTTGGCTCGATAGCTTCCACCCACCAGGAATGCAAACTGTGTGAGGTGGTTAGTGTGGCACT 
CGATGATGGCATCCTTCAAGAGGTCCGAACTGCTTTCAGTGCTTACTCCCTCAGTTGACCAGGTTTCATAGTTCCAATATCCGCATCCACTCCCAATGCTAAAGGCCTTTGA 
ATCCGGATTGCGAAGATTTTCATTACGCAGAAGAAAGGGCAGTGGCAGAGAGAGATAATTGCCTATTTAATGGAAGGGTATTCAAATTAGTTCGAGTATTCGAAAACAAATT 
TTCATGCGTTTCTTACTTCTCAGTCCTGGTATTGAAATGGAGATTACTTTGCTGCGCGGCTTACGAGTGCGTTGCAATGAGGTTTCCACGAACAGAGCATCGTGGGCATAGA 
CTTTAAAGATGAGATAGGTTGCTCCCCTGCTTTTCACTTGCCGCCAGAGATTCTCCGGCAAAAATGCAGCTGTCTCCAGATTAGATTCCTCCTTGACCTTAGCCAAGTCTTC 
GGAGAATCGAATGAAGCGGTACCAGAAGCCACTGGCGGAAGTCTTTCGTTGATCCCCTGGCGCAGAGAATATAGCTAATCCAGTTATGCGATCGCATTCGGGGTTGGCGAAG 
AACACGCTGATATTGCCGGTAATCAGAGCTTGGACACCGATATCTGAGAAGTTGTACGTTTCCACTCCAGTAGTGGCCGTTTCTGCTTCATCACTGGTCGGCTTGGCCACCA 
CCTTGCCGCATCTGTCCTTGGGAACCAGTTGCTCCGGAAGAGCATCCATGTAGCTTTCGAACTGGCTGAGCAGGGAATTGGTAGCGTTGAGCTGAGCTGATAATCTAAGAAC 
CTTACTGTCGCAGGACATTATCTCCTTGCAGACACTGACCAAGTCCACACTGACCGTGGCATCCTTGTCCTGCTGCATCAGGGCGCCGAACATTTGTCCGGTCATGTGAACA 
TCGGCGGGTAGAAGATTCCCACCTCTCTGTCGGAACATGTTCCTCATTATGCCAGTCATCTCACGACGCCCCTGGGTGTCATTGGTGCGCCTCCTGCCCTCGAGAACGTCGT 
CATGCAGGCTATTCAAGTCACCGCTGATCGTGTGCTCCCTGAATCGCCTCAAACACACCACGGGATCCCAGTGTTCGGTGGACTCCCAGTTGGCTCGGTTGTCCTTCACTCT 
GCACTCCCTGGTGACCGGCATTCCGTTTCGCAGAAGGCAAACATCGTGCAGGGTGGCACGTTCGCCGATTCTAGCCCGTTTCCACTTGTTCTGGTGAGGTGTTTGATCGTGG 
TTACGCGTTATGTAGTCGTGCTGAAACTCCTCCGGCTCGCAGAACATGGACGTACAGTAAACCAGATGTGTTTGCATAAGTTCGTCCGATGAATTACGTTCCTCTCCATCAC 
AGTACTGAAACGTTCCGTTTCTCAAGCACATTTGATTTTCCTTTGGGCAGGCGGAATAAATCGGCGAAACCAAGGCCAAGACCACAAAGAGCCCAGCACACCAGTTGAAATT 
CATGGCGATGAATCAGGTGCCACAGCTGTTTGCAGCTCTCCGAAAATGTTTACAAATTCTTTGGCAGATAGTACTTTTCACGTCCGACGTTCTTGAAATATCGCGGCACACT 
GAGCTGTCTGTCGATCGATTAGATTTCTGTAGACCCCTTCTTCATGGTGTTTGCTATCAGTTGCCACCACGAGCGCTAATCGACCCCCGTTTTTTGTATATCTACACACTAC 
TCACAAAAAGACCTCAAGTCTGCGGCATTTGATACAATTTACATTGTGATTAACACGACCACGGCAATTGAAGGCGAATCCGACGGAACCCTTTGAGGCAGGGAAGTAACAC 
CAAACTTGTGACGTCGTCTTTATTTGTGCAAATAGCGATTTCACGCCTTTTTACCGGTCTTGGTACTTAAAATGCTCTGGCTGATTCATTTTTTACGCGCTTGAATTGCCAG 
ATAAGAAGTTCTTACCAGGAAACACGGCTGTACATGGGCTATTTATACATCAATGATACAAAAATAATGTTAAAAAAAATTTAAAAACTTTAAAAACTAGTTTTATTTGCAT 
CCGTTCAAGAAAGAGATTTTTTTTAGATTTGTTTGATCAATTCATTGCTATGGCTTCTAATAAATGTGAAATGTGAAACACCTTCACATAAAAAAGTATTTACGCAATTCAC 
TCAAGCAAAACAGATCCTTTGTCCTTGAAAAAAGTTATGAAATGCAAAGCACTTCACCTTGGTTGCATAAAGTGCGATTTTCATTTTGACAGGCTACGCGTTTTCAAAGACT 
CAGCCTTCGATTATAATCCACCTGTACTCACTTCATTGTAAGCCAGTTTTTGGGCCAAAAAGTAATTTTTACAGCCGACTTCGCAGCACGTAGACTGCGTGTGCTAGCGGAA 
ATTGTTAGCTTGTTGACACGTTT 
(SEQ ID NO: 121) 

Exon: 3391.. 2251 
Exon: 2190.. 1001 

Start ATG: 3391 (Reverse strand: CAT) 
Transcript No. : CT31591 

ATGTGCTTGAGAAACGGAACGTTTCAGTACTGTGATGGAGAGGAACGTAATTCATCGGACGAACTTATGCAAACACATCTGGTTTACTGTACGTCCATGTTCTGCGAGCCGG 
AGGAGTTTCAGCACGACTACATAACGCGTAACCACGATCAAACACCTCACCAGAACAAGTGGAAACGGGCTAGAATCGGCGAACGTGCCACCCTGCACGATGTTTGCCTTCT 
GCGAAACGGAATGCCGGTCACCAGGGAGTGCAGAGTGAAGGACAACCGAGCCAACTGGGAGTCCACCGAACACTGGGATCCCGTGGTGTGTTTGAGGCGATTCAGGGAGCAC 
ACGATCAGCGGTGACTTGAATAGCCTGCATGACGACGTTCTCGAGGGCAGGAGGCGCACCAATGACACCCAGGGGCGTCGTGAGATGACTGGCATAATGAGGAACATGTTCC 
GACAGAGAGGTGGGAATCTTCTACCCGCCGATGTTCACATGACCGGACAAATGTTCGGCGCCCTGATGCAGCAGGACAAGGATGCCACGGTCAGTGTGGACTTGGTCAGTGT 
CTGCAAGGAGATAATGTCCTGCGACAGTAAGGTTCTTAGATTATCAGCTCAGCTCAACGCTACCAATTCCCTGCTCAGCCAGTTCGAAAGCTACATGGATGCTCTTCCGGAG 
CAACTGGTTCCCAAGGACAGATGCGGCAAGGTGGTGGCCAAGCCGACCAGTGATGAAGCAGAAACGGCCACTACTGGAGTGGAAACGTACAACTTCTCAGATATCGGTGTCC 
AAGCTCTGATTACCGGCAATATCAGCGTGTTCTTCGCCAACCCCGAATGCGATCGCATAACTGGATTAGCTATATTCTCTGCGCCAGGGGATCAACGAAAGACTTCCGCCAG 
TGGCTTCTGGTACCGCTTCATTCGATTCTCCGAAGACTTGGCTAAGGTCAAGGAGGAATCTAATCTGGAGACAGCTGCATTTTTGCCGGAGAATCTCTGGCGGCAAGTGAAA 
AGCAGGGGAGCAACCTATCTCATCTTTAAAGTCTATGCCCACGATGCTCTGTTCGTGGAAACCTCATTGCAACGCACTCGTAAGCCGCGCAGCAAAGTAATCTCCATTTCAA 
TACCAGGACTGAGAAGCAATTATCTCTCTCTGCCACTGCCCTTTCTTCTGCGTAATGAAAATCTTCGCAATCCGGATTCAAAGGCCTTTAGCATTGGGAGTGGATGCGGATA 
TTGGAACTATGAAACCTGGTCAACTGAGGGAGTAAGCACTGAAAGCAGTTCGGACCTCTTGAAGGATGCCATCATCGAGTGCCACACTAACCACCTCACACAGTTTGCATTC 
CTGGTGGGTGGAAGCTATCGAGCCAACGACCTTGGCGAGGAGATTCTAATTACTCCGATTAATGAGAAAGTGCTGGACATTATCTCGATAGTGGGCTGCAGTCTTTCCCTTT 
TGGGTATACTGGGAATATTCCTCACTGCCGCACTCTTCAAAAGCTGGCGCAGTCAGGCCTCCACGAAGGTCTTACTGCACCTTTGCCTGGCTATGTGTCTCCAGATGATGCT 
CTTCGTGTTTCTCAATACAGATGATGTATCCGAAGCGCTTGTTGTGAATGGAAACACAGTTCGCTGTGTGGCTCTGGGCGCTGCCATGCAGTACTCCATTCTGGTTCTCTTC 
AGTTGGATGCTAATCATCGCCTTCCTGCAGTTCCAGCGTTACGTCACAGTTATCGGAATCGAGAGACCACCTCGCTATATCCTGAAGGCCGCCATTGTGGCCTGGCTACTGC 
CCTTGGTCCCCACCCTGCTGGTGGCCCTTATAGATCCCGATTCGTACGTCCCCTCGGCTGCCCAACTTTCCACGGACACTGGCATCTGCTATCCATCGGGCTATGGTCTCAT 
TTTTGGGGTGGTGCTCCCTGTGACCTTGATTACTGTGTGCAACTTGGTGATATTCGTGTATGTTTTCTACAGCATCTCGCACTCTTTGAGCCAGAGCATCCATAAAAACGAA 
AAGAAAATGGTGGTTAAGCAGATCCGACTCTCCATAATGCTCTTCTTTTTGCTGGGTCTTACCTGGATCTTTGGAATATTCGCCTTCATGCAGGCAGGTGTGGCTTTTTCAT 
. ACCTCTTCTGCATCACGGCCACTATGCAGGGATTTGTGATGTTCATTTACTTTGTCCTCTTGGACTCCACGAACCGTCGGCTGTGGGTGGGTCTGATTTGCCCCACCAAGAT 
GGAGCTGGATGTCCAAAAGCGCACCACTGAGCTCCAGTCGATGACCACCTCATCCACCAACTACAACAGCAGATCACATCAGTAG 
(SEQ ID NO: 122) 

Start ATG : 1 (Reverse strand: CAT) 

MCLRNGTFQYCDGEERNSSDELMQTHLVYCTSMFCEPEEFQHDYITRNHDQTPHQNKWKRARIGERATLHDVCLLRNGMPVTRECRVKDNRANWESTEHWDPVVCLRRFREH 

TISGDLNSLHDDVLEGRRRTNDTQGRREMTGIMRNMFRQRGGNLLPADVHMTGQMFGALMQQDKDATVSVDLVSVCKEIMSCDSKVLRLSAQLNATNSLLSQFESYMDALPE 

QLVPKDRCGKVVAKPTSDEAETATTGVETYNFSDIGVQALITGNISVFFANPECDRITGLAIFSAPGDQRKTSASGFWYRFIRFSEDLAKVKEESNLETAAFLPENLWRQVK 

SRGATYLIFKVYAHDALFVETSLQRTRKPRSKVISISIPGLRSNYLSLPLPFLLRNENLRNPDSKAFSIGSGCGYWNYETWSTEGVSTESSSDLLKDAIIECHTNHLTQFAF 

LVGGSYRANDLGEEIL1TPINEKVLDIISIVGCSLSLLGILGI FLTAALFKSWRSQASTKVLLHLCLAMCLQMMLFVFLNTDDVSEALWNGNTVRCVALGAAMQYSILVLF 

SWMLIIAFLQFQRYVTVIGIERPPRYILKAAIVAWLLPLVPTLLVALIDPDSYVPSAAQLSTDTGICYPSGYGLIFGVVLPVTLITVCNLVIFVYVFYSISHSLSQSIHKNE 

KKMWKQIRLSIMLFFLLGLTWIFGIFAFMQAGVAFSYLFCITATMQGFVMFIYFVLLDSTNRRLWVGLICPTKMELDVQKRTTELQSMTTSSTNYNSRSHQ* 

(SEQ ID NO: 123) 

Name: HE6 Receptor-like 

Classification : G_protein_linked_receptor 
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GTGTTAflTCCCCTTGGCTTATCGGTTTflATCTATTTATJ^AAflTCAATTCAAACATAAAGCGGCGCATATCTCAGCGATGTTGGGTAGGTAACCAAAAGTAAATACACTCTAA 
TTCCTATTGATCTTGTATAAATACGAGTGGATAGTTAGTTAGATATCACCATTTTGCACCGTGCATAAACATAAAGTGAGATACTCGTTTTCGAGAGATGCACTTCTGAGTA 
CCCGTATTCAAATCGTACTCTCAGTAGCCAAGTACTCAGTACTCAAGCTGTTTGTCTTACGGGTGTGGACGTGCGAGCAGACACTTTAGGAGTGGGATAGCCGGAGCAGACC 
GGCTGAAGTGTCATCCGATCGAAGGTCGCCTCAACAGCfiCAGCGATTTCGATTCCGATTCCAAATCCGATTCTTTGGCCATAGGAATTCCTCTGGGAGGTGGAGCATGTGGC 
TGATTTATTTATTTCGAGCGATATAAACAGAGCAATACGAGAATGGGCGAATATAAGTTTGCTCGAACGGCTGGTCCGTTCCGAGCTGAAATTGAATTGTAAATGGCCATAT 
TAATTGGGAATCATTTAAAATTTCCAATGATTTTTTCTGTGTTTTGGTGTCGGCAGTCTATGCATGTTCGGTGGCATAATTTATGGGCGCTAATTACTATCGCCCCATTTTG 
TGTATTTCGCGTGCTGTCCAAGGCCAATTACGCTTACTTATTGCAATTTATGTGGCCACAGTCCGGCCGTAATTGGAAATTGTGTAAGGTCGCAGGCAATTTCATTATTTGT 
ATGCACTTCTGGACGCTCCGGTTGCCGGGTCTAGCATTTTTCTAGCATTTTTCTAATGACGCCAGCGCGTTTGGGGACGACGTTACCTAGCCCAGTTTTCCCTTTGCAGCCA 
TTTGATTAAGTGACCTACGTTTCGCCCACACAGAATCATCAATTTGTTCATGTCGCCGACGGTGCGAATAAGGTCCAACCTAAGTGAGTGAGCTATCCGCCGGCATGGCGGC 
GTTGGAACATCCGGATGCCATGCTGCCATACTCCAAATTGTTGTCAACAGCCCTCACTGTGGCCAGTGGTAACTCGAACTCCTCCGATGGTACTTTGCTGCTCATCCAGACC 
AGTGCGGCACCAGGTGGCGATGTGATGGCCACCGGAGCCGGAGCCGGAAAGTTGGGTGTGCCCTCGATGAACTCGTACCTGGTGAGCATGGCGTGGCCCAAGTCGCTGGCAG 
TGGCCGTCTTCCTGGTCATCATCCTGGTCACCGTGGTGGGCAACACACTCGTCATCCTCGCCGTTCTGACCACACGCCGCCTGCGCACCGTAACCAACTGTTTCGTGATGAA 
CTTGGCCATCACCGACTGGCTGGTCGGCACCTGCGTGATGCCTCCATCGGTGGTGCTCTACATAACAGGTGAGAATGGCAGCGGAGGGCCTGTCCATGCCCATCGATGGCCA 
AAGATTTATGGCTCTGTGGGGGCTTTACACTTCCGGGGAAGGTGGAATCGGGGCAAGTGTTTACTATTCGAAGCCCTCCGGTGGAAGTTCGGGTAGTAAACATTTGCCCAAA 
TCGATGGGGGAGCACATTTGGTCGATTTCGATTTAAAATAATCCCACAGCGAAAATTATAAAATTTCTCCGTTGGCAAAATATACATATTTTTGATTGCAAAATATCTTGTG 
AGTGGTGTGCGAGTGTTTTATGGGTTTCGCTGGAAAACCTTTGATAATTTTTTCTGGAATCTATATTTTTAGCAGAATTTTGAATAAAGCATAACAACATTAGTGGTTAAAA 
ATTTTGTATAATTTAGTAATTTTAAATAAATTATGTATCTTTAATAATTAACATCTCTTATTTAACGCATTTCATAAATGTATAAAAAGCAATCAAACATGTGATCTTTTCG 
AAGCCAAATGAATATACAAAGTTTAATATTCCTTGTCATAACTTGAACAATACAAAGAAGAATTTATAGTGGTTTAAATTGAAGGATCTGCCAGTGTAATGCCTCCTTAATC 
CCGGCCAGGTGCACAAACCGATTGAAGCACGGCGACAAATATTTGTTCTTTGCAGGCACTTGGCGCTTTGGCTGGATACTGTGCGACATTTGGATCTCGCTGGACATCCTGC 
TCTGCAGCGGCTCCATCCTCAGCCTGTGCGCCATCAGTCTGGACAGGTGAGTTGGCCAACTAATTTCGTAATTTGCCGTTGATGTCATAGCAATGACCACAGTCCATAGCCC 
GTTGTCCACGGTCCTCGGTCCTTAGATAAAGTCATGGCAAGGTGCAAAAACAGTGCAAATAATTTATTTGTTGACGTTTTTATTGCCGCATGACCGTTGTCCTGGCCATTCC 
CAGTGCATATCCTTTGAGTGTGCGACGCCACAATGGCAAGGATTGGGTTGGGCTTGGATTCCCATTCACCCATTCCCTATTTCCCCATTTCCTCCATTTAGCCATCCCACCG 
TTTGCTTTTGTTGGAAGTTTCAATTACATTTTAATGGCATTTCCAAGTTTTGTGTTGTGGCTCCCGAGTTGAGCTGATCCAGCTGAGCCATGGAGCCATGGACCTGAACCTC 
AACCAGGACAAACACCGACGCCCACTGGCGTTGTGTGTGCCTTTTGGCCTGGCCAGAAAGCCGTCTGCGGCTTTGCGAACAATATGTCCGCTTGGCTTTTGTTTTTTTCCAG 
CATTTCTCTGGCCATTTCCGTTGCGTTTATTGGCCCGGCTTTTATAATTTCCTCTTTTTCACCTTCGTTTTTGTTTTATATTTCGCAGGTATCTCGCCGTCACACAACCACT 
GACGTACTCCAAGAAGCGCCGATCCAAGCGACTGGCCCTCCTCATGATTCTCGTCGTGTGGATAACTGCCCTGTCAATCACATGTCCACCCTATTTGGGCTGGTAAGTTAGC 
GCCAGCTGCACGGAGAACAAAATCAACCTTAACCCATGATGGAICGCCTATTCATCTGTTAAGAAAACCATTCATGTTATAATCATTTAATTGAACGCATTAAATTGGGTTG 
GGAGTACAGGCAACATGTGATAATGATCAGTATCCAAAGGAAGAAAAATAATATATTTAATATCATATAATATTATTAATAATATTATATTATAAATATGGCCAAGATAAAT 
TTATTTGTAGTGCTCTATTAAGTTTATAGATATATATTTTTTTGAATTTTGTAATATTATTTTCACTCTTTCCCGTGTAATTCGAAATATTATATGTAATATTTTTATTGAG 
CACCCATTTTTTTTTTACCAAATTTGTCCTTCCGGAGGCGTGACAATACTTTGTGGCTTTTTGTGGCTGAAAGTCCTGCTCGCTTTTTTTTTTCTTTTTTGAGCTAATGGCT 
GTGCGGTGGCGCAGCATAGATAATGTGCTACAAAAGGGTCTCGAAATACACACGTTTATTTATATTATATTTGCGCCTTTGGCTGCGCCTAAAAAACATTACGTAATTGACG 
ACGGGTTGTTGCTGTTGCTGCTGGTCGTGCAGCCATTACATTGAAGTTTTAATGAGTTGTTAATGACTTTGTTTGCCGCGGCGCATTTGCTTTGTTCTCACCTACGCACACA 
CACTCAGCGCGAAAGCCGCACACTTGTGTGTTGCATACTTCCAGGCGCCCAAATTAACCAGCACTCACTCACACACACTCGCACTCACTCACACACACCACAAAGCCAGCAT 
GTCCTTAATGCCATTGGCCATGGCTTAGTTTTGGCCCCCTTTGGCTGAAAGCGAAGAGCATGGCTTAAAGGGCACATTAAAACGGTCTGAGGCCGAATTACAGCCACAGCTG 
CAGGACATTGGAAAATTTGGAAAATTTGGCGCTGGGGGAAAATCGCAATTATAAGCGCCAGCAGATGGCGCCACTTACATATTTTCATTCTTTAAGCGGACTAGTTTTATGA 
AAAAAGTTTGCAAATGCAGCGGTTGCAACAACTGTTTGTTTTACTCACAATCTGTTATTTGTTATTGCCTAATGAATACTAAACTGATTGTGTTTGCTTGCTATTTAAATTT 
AGATTTAACTTTTGGCTAATTGATTTAAGCATTGTTTATTTAGTTCATTTTTTAATTTAACTAATGCATAGCTAATGTATTAAATTGCCCTCAATTTCGGATTGAATAATTA 
ATGTTAAACAAGAAGTTTAGTAGGAAAATCGGCTATTTCCATTCGGGTCTTGTAAATACCAACCTTAACAAAGTTGCATTTGTACAATTATACTGTCAGGTTATTTTGGTAT 
CGGGTAGCAGTAATTACCGTGAGTTTTTGGATAACACATACTTGGGGTAACAAGCAGCTAGTTTAGTTGAATTACAACACCGGAAAATTCCATTGACAGTAGTGACAATGAG 
GCAATAAACTTTTTGAAAGAGAAACATTATAGGAAACTGTGTATCCAGTGGGCGTGCCCCAAACTTTGGTTGGCATATTTGTAAAAATTGGCCATCCAAAAAATAAATCAAA 
ACAACTGGCAATCATTTTATAATCGTGTGGGCGTGAGTCTTTTAGGCGATTGGGGGGCTTTAGAGTGGGCGTGGTAATGTTCTGAAATAAGCTTGTTCCCGAGATTTGGACT 
CCGGGGGACAAATGGACATAACCGAATCAAGGATATATATACTTCATGCTTCTGTCTGCTACATGATATTCAGTGAACCTAGTATTCTAAGAATAGTGGGTACAATAACCCA 
GTATGAACTGGAAGAAGCTGAATCTTGCCACCTACAGTGGATGTATGTGACAATTTGCCCATTTTTTTTACTCAATTCCCCACAGCACACGGGTGTACCTTTAGTTGACACA 
TTTGGTGTACGCCACCACATTACTCTGTTGTCTCGGAATCGTTTTCTTCCGCATATTGATTTAAAGTCAGCTCCGATTTGTACTGCCTGAATGACCAACACACTGCGATGTG 
CATAAGTAAAAGGCTTACAGGTGGGACTACTCTGACTGGCTTAGGTTTAGGATTTGAATTCGAATTTCGCCCGGCTTAACTCAGCCAGTTGTCATTCCCCAAGTCCCAGTTC 
CCCAGCCGCAATCACAACCGCAAGAGCCACCAAACCAGAGCAGGTACCAGGTACCTCCCACATCCACCACATCTCACATCCTTCCAACCCGCAAACCGCATGCAATTCGAAC 
TGGAATTCAATTTGCCATTCACATCCACATTCGCAGGTACGAGGTCGGTCGGCACCAGGCGGAGTTCGTGGACTGCCGCTACAACCAAAACAAGGGCTACGTGGTCTTCTCG 
GCGATGGGATCGTTCTTCATTCCCCTCACCGTGATGCTCTACGTTTACGTGAAGATTGGTTATGTCCTCACGTCACGGCGACAGCGCATTGTGCGCGATGCGGTAAGCTACA 
TCTCCAGCTCCATCACTATATAAGCATATATGTATGTGTCCACCACTTTGGTGTCCCCTGAAATGTGTCCGGAAATTTATCATGTCACACCTAACACATCTAACCAGCTAAC 
CAGGGGACTAAGTTGTCAGGCACTGGGGAAAAAGATCCAGGAGTGCGCTCAACACTTCAAGTGCAGATCTTCATTAAGTTGGGATACCCACTGGGTATCCCACTTGAGTACC 
AGCCACTTATCGCACAGTACTGCAGGTGATATTTTAAGTGCTGAGCTTTGATAACAGAAGTCCAACTAATATATCTAATCACAAATTAAATTTGCTGTGAATTCTTGGATGT 
AAACTGAATGAATTATCTTTTGTTTGTTAGCATGAAAAATGAACTTCAGTGTCAATAAATGAAATGAAAATGCAATAAGAACACTTTTTGAATATTTAAAAAGAGTGCATAG 
TAGTTGATAAATATATTCTTTTAGTTTAGTACTAATTGAACGTTTGCTGCCAAGAGCAAGTGAAGTAAATTGTAGAAAGTAGGTGACGTGAAATGGAACGAAGTAATTACAG 
CAATCCAAATCCCGAATCTCTATGCCCTGACATGCAATCTAAATTATTCCTCACGTTTTTGTCAGTGCTCTGCTTACATATTCCTCTTCTATTATGTCCTTCCAACAGTACT 
CGGAGCGCACGGCAGACTACGATGTGGATGGGGACAACTTCATCTCGGAGTCGGAGCACTATCATTGCACGCCAACCAAGTGGCTGCCGAACAGGAAGTCCCGCTGGAGGTT 
CAACTCCCTGCACGACCCACCACCCAGCGCCGCCACGAATGCAGCAGCGAAGGGACACCAGTCGTCCGTCAAGTGCTCCAAGTGCTCCAAGTGCTCCTCTGCCACTGGAACT 
GCTGGAGCTGGAGCTGGAGCTGGTGTGGTAGACAAAACGCTGACGTTCAAACATCAGCCCACCTTTTACGAGCTGGTCGAGGTGTCGCGTCTCTCGTCGCTCATTCACTGCT 
CGGCAATTAGCTGCAAGTATGGAGGACCCTGCATGCACTCCACGCCCTCGGGCATGCACTACGTTGGCACGGAGGCCTCCTTCTCGGACACCTGCCTGGGTGGCAATGTGAG 
CAACAACACCACCACCACCACCACCAACACCGTCGCACTGGAAACCAACGAGAAACTGGAGGCGGAGGCCAAGCAAAAGCAACAGCAGCAACGATCCTCCTTCGACCACCAT 
CACCCGCATGCACACCATCATGGCTCCCATGGCCAGCAGAGCCATCACAATCATCACCATCGCATGCCGATGCGAGTTTCGACCACGAAACGTGATAGCAAGACCGCCAAGA 
CCCTGACTATTGTGATGGGCGGCCTAATTGCTTGCTGGCTGCCCTTCTTCGTCTATTATCTGCTGATACCCTTCCTGCCGCGACCCGCCGTCCTCGAGGACCTCATGTTCGG 
CTTCACCTGGATTGGCTGGGTCAACTGCGCCATCAACCCGTTCATCTACGCCTTCTACAATCCGGACTTTCGCACTGCCTTCTGGCGGCTCACCTGTCGCCCCATTTGCAAG 
CAGAAGCGTCCGCCCAACCACCTGGCCATGTTCCGTGGCTAGTTTCCGGATGCAGATGCAGTTGCGGATGCGGATGCGGATCCGGAACCGGTTTCCGAATCACTGAGCACTT 
GAGTGTGTGTCATAGTGCCGCCATCTCCAGTCGCACAATCGACGCTCACTTGGGTGGTGGGGGAAATATCTCAGGCATATCCTTAACTTTTCGCCGTGCTGGTTCACCAACT 
TCTATGTAATTAGCATAAAAAATAAGTGGAGAAACTACCAACTCGCATTTCGATCTTTTTGAAGTACAAGTTATGCCAATGCAAGGACATCAAATTGACTCTAGACTACAGT 
ACAGATAGTAAATAAAAAAGAATTTaAGGAGTATTaACTACAATAGATAAGCTGTTTGAGAAATGCACTATCTCTTATATTAAATCAATTGAAGATAATGAAAATCAAATAT 
tagcttaacaaaactgctgaatttaattcaagtttatttcatgaaaacaaatatatagaaactagtttaaaatttatcacacatgtccctccaaaaattgtaactaatattt 
tggcaaacgaagccaatggagggaccttttggccaactcgtcgccactagctcaccaactaagtattcatttgccaggcactataagctgtttgccgcactctatctgcatc 
gtttatgccaaacgcagctagtaaaactaagatacttatggtcaatattgattggagcaaagattgccttgttaaacccaaaaaatcgactgccttttgaagcttcaaaagc 
cgcatttcgagcagcatcttaaaggtcactttacggtcgcaattttttttttttaagactcactggacttggccggaagttgagattcttgcctagaccagatccaaatcct 
atatatagtattttcctgtgtgtgagcgttgattgtgcgaggcgagcggcacgaagcgaaacgtaacgaaacaaaagacaaattatagtatttgcgagtctttgttgcctga 
cggcaactgttggccatcgggatgagtgggatga 

(SEQ ID NO: 124) 

Exon: 1001.. 1412 
Exon: 2072.. 2174 
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Exon: 2777.. 2902 
Exon: 5189.. 5366 
Exon: 6045.. 6491 
Exon: 6549.-6986 
Start ATG: 1001 

Transcript No. : CT37292 

ATGGCGGCGTTGGMCATCCGGATGCCflTGCTGCCftTACTCCAAATTGTTGTCAflCAGCCCTCflCTGTGGCCAGTGGTAACTCGAACTCCTCCGATGGTflCTTTGCTGCTCR 
TCCAGACCAGTGCGGCACCAGGTGGCGATGTGATGGCCACCGGAGCCGGAGCCGGAAAGTTGGGTGTGCCCTCGATGAACTCGTACCTGGTGAGCATGGCGTGGCCCAAGTC 
GCTGGCAGTGGCCGTCTTCCTGGTCATCATCCTGGTCACCGTGGTGGGCAACACACTCGTCATCCTCGCCGTTCTGACCACACGCCGCCTGCGCACCGTAACCAACTGTTTC 
GTGATGAACT1GGCCATCACCGACTGGCTGGTCGGCACCTGCGTGATGCCTCCATCGGTGGTGCTCTACATAACAGGCACTTGGCGCTTTGGCTGGATACTGTGCGACATTT 
GGATCTCGCTGGACATCCTGCTCTGCAGCGGCTCCATCCTCAGCCTGTGCGCCATCAGTCTGGACAGGTATCTCGCCGTCACACAACCACTGACGTACTCCAAGAAGCGCCG 
ATCCAAGCGACTGGCCCTCCTCATGATTCTCGTCGTGTGGATAACTGCCCTGTCAATCACATGTCCACCCTATTTGGGCTGGTACGAGGTCGGTCGGCACCAGGCGGAGTTC 
GTGGACTGCCGCTACAACCAAAACAAGGGCTACGTGGTCTTCTCGGCGATGGGATCGTTCTTCATTCCCCTCACCGTGATGCTCTACGTTTACGTGAAGATTGGTTATGTCC 
TCACGTCACGGCGACAGCGCATTGTGCGCGATGCGTACTCGGAGCGCACGGCAGACTACGATGTGGATGGGGACAACTTCATCTCGGAGTCGGAGCACTATCATTGCACGCC 
AACCAAGTGGCTGCCGAACAGGAAGTCCCGCTGGAGGTTCAACTCCCTGCACGACCCACCACCCAGCGCCGCCACGAATGCAGCAGCGAAGGGACACCAGTCGTCCGTCAAG 
TGCTCCAAGTGCTCCAAGTGCTCCTCTGCCACTGGAACTGCTGGAGCTGGAGCTGGAGCTGGTGTGGTAGACAAAACGCTGACGTTCAAACATCAGCCCACCTTTTACGAGC 
TGGTCGAGGTGTCGCGTCTCTCGTCGCTCATTCACTGCTCGGCAATTAGCTGCAAGTATGGAGGACCCTGCATGCACTCCACGCCCTCGGGCATGCACTACGTTGGCACGGA 
GGCCTCCTTCTCGGACACCTGCCTGGGTGGCAATAAACTGGAGGCGGAGGCCAAGCAAAAGCAACAGCAGCAACGATCCTCCTTCGACCACCATCACCCGCATGCACACCAT 
CATGGCTCCCATGGCCAGCAGAGCCATCACAATCATCACCATCGCATGCCGATGCGAGTTTCGACCACGAAACGTGATAGCAAGACCGCCAAGACCCTGACTATTGTGATGG 
GCGGCCTAATTGCTTGCTGGCTGCCCTTCTTCGTCTATTATCTGCTGATACCCTTCCTGCCGCGACCCGCCGTCCTCGAGGACCTCATGTTCGGCTTCACCTGGATTGGCTG 
GGTCAACTGCGCCATCAACCCGTTCATCTACGCCTTCTACAATCCGGACTTTCGCACTGCCTTCTGGCGGCTCACCTGTCGCCCCATTTGCAAGCAGAAGCGTCCGCCCAAC 
CACCTGGCCATGTTCCGTGGCTAG 
(SEQ ID NO: 125) 



MAALEHPDAMLPYSKLLSTALTVASGNSNSSDGTLLLIQTSAAPGGDVMATGAGAGKLGVPSMNSYLVSMAWPKSLAVAVFLVIILVTVVGNTLVILAVLTTRRLRTVTNCF 

VMNLAITDWLVGTCVMPPSVVLYITGTWRFGWILCDIWISLDILLCSGSILSLCAISLDRYLAVTQPLTYSKKRRSKRLALLMILVVWITALSITCPPYLGWYEVGRHQAEF 

VDCRYNQNKGYWFSAMGSFFIPLTVMLYVYVKIGYVLTSRRQRIVRDAYSERTADYDVDGDNFISESEHYHCTPTKWLPNRKSRWRFNSLHDPPPSAATNAAAKGHQSSVK 

CSKCSKCSSATGTAGAGAGAGVVDKTLTFKHQPTFYELVEVSRLSSLIHCSAISCKYGGPCMHSTPSGMHYVGTEASFSDTCLGGNKLEAEAKQKQQQQRSSFDHHHPHAHH 

HGSHGQQSHHNHHHRMPMRVSTTKRDSKTAKTLTIVMGGLIACWLPFFVYYLLIPFLPRPAVLEDLMFGFTWIGWVNCAINPFIYAFYNPDFRTAFWRLTCRPICKQKRPPN 

HLAMFRG* 

(SEQ ID NO: 126) 

Name: Octopamine receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384349 

TCCATGCATTTATAATTCAACATTCGCTCGCGGCCTTATGAAAAGCTCATTTGGTTACGAGTGTCCTGGGTCCTTAAGTCCTTTCTACCTCGAAAGTGTTAATGAGCCGGCA 
GGCTGCCGTAATTATGCAAATTTTTAATGACTATGAAAATGGTTTCCGCGGAGCAACAAGTGCTCGCGTCTTGGTCCTTGGATCTCTATACCCCTCATACCCCTCTAAAAAT 
GCCAGCTAATTCCATTTGACCCCAGAGGCCCCAAAGCAATCCATTTGCATGGCTACCAATAGCTCGTCGCATTCGTCGTACCAACCCAACCATTTTCCATTTAATCCCTACG 
ATGTGAGAGCTCTGTACTGAGCTTACTGCTGGCATTCGGTCTTTAGATTTATACAGATTTTGTCAAACAAAACATGAATTCGCACTTTTACACACATATGTGTGTCTGTAAA 
TGACTGAAAGGGAGAGCGTCGAAAGCCGATTTGGTGCCATTCAGTTGAAGCCCTTTAGGTCGTGATTCGCATATGGTCGAGGCCATTTAAGCTGATATCAGGCACATTAAAT 
GTAGTAAAGGAACGTAAGTATGATAAAGGCAGCATCACAAAAGTGTCAAGTAAATGCAAATTAACATTAATTTAAGACTTGAATTCGTCTGTACTGAAACAGTACAATTAAA 
ATATAATTAGCTATATATTTGTAACTTTTAAGAACGGATACTTCTGGGACCAATGTTTTTTGAACATAACAGATATCTACCATATGAAGTGAAATAAATAGAATTTTTCGGG 
ACCATTTTGTTTCCAGATTTCAAATGAAATGAAACGCATATTTTTGCCAGTTTTTGATAATAAATATCAGGGGACTTGGCCAAATTTTAATACAATCTCATTTAGAAAAATA 
AAACAATTGCATGCCGATAAAATAATAAGTGATACCGAAAGAGAGCTTTTGATTAGAGTGAAATATGCTCATGCGGTTCAGTCGAGATCCAAAGTTTGCGCAGAATGCTTCT 
GAATATTTTAGCTATAATTCTTGTATTCGTAATTTCAAGCCAATCGGAGGCAGTTATTCCTGGATGTGATTACTTCGACACCGTGGATATATCTCATATCCCAAAGCTAAAT 
GACTCCTATGCGTACGAGGAGCTAATCATTCCTGCTCATCTCACGGGTCTATATACATTTAGGCAACTGGCGGATGGATCACAGGAGCCAGTGAAGAGTCACTTAAGAGCAT 
GTATCTGCAAACTAAAACCTTGTATCCGATTCTGCTGCCCCCGCAACAAAATGATGCCGAATAGTCGCTGTAGCGATGGACTCACAGAGAATCTGAAGCGGATCAATCCCTA 
CTTGAAGATCACACTGGAGGATGGAACTATAGGAAAATATTACCTTCTCACCGACATGATCGTCTTAAGATATGAGTTTCGGTATTGCGAGAAGGTTGTTTCGGTACAGGAG 
GATCAGTACAAGCTATATGAGGTTAAGATTTCAATCTTTTTGAAGTATCTCAGTATAATTTGATATCTATAGAATGGCAGCTTTATGATTAAACCCGATGTAAACTGGACAT 
TGAGCAAGCAGTGGTACTGCCTCCATCCCCGTTTGGAAGATCCGAATTCCATATGGATACTGGAACATGTATATATACCGAAGTCAATGCCAGCCGTACCTCAGGGCAAGCA 
CCTACTATATCATCTAATCTCATCTCATCTAATTCTCTGATTTTTCCAGTCGGTACTATTTCCATGGTTGGTTGTATTCTCACGATTGCAGTGTATCTCTACATAAAGAAAC 
TGCGAAATTTGCTTGGAAAATGTTTCATTTGCTATGTATTTTGTAAGTTTGTGCAGTACCTTATCTGGGCAGGAGGAGATTTGAACTTGTGGAATAATATTTGCTCCCTAGC 
AGGTTAGTTGAATCCATATTAAAGTTTGACTTAAAATATGCGTTTATTCTAAACAGGTTACACTAACTATTTCTTTGCTTTGGCTTCTCACTTTTGGCTCTCTGTTATGAGT 
CATCAAATATGGAAAAACTTGAGGTTAATCAATCGGGATGAACGTAGCTATCACTTTCTCATCTACAACATCTACGGTTGGGGCACACCTGCTATCATGACGGCAATTACAT 
ATCTGGTGGATTGGGCCTGGGAGGATAGACCTGACAAACTGAATTGGATTCCCGGTGTTGGCTTATATCGATGCTGGATAAACAGTGGGTTTTAGGGTAGAGTGTTTAAGCC 
AGTTTAAGACCAGTAATTGATTTTCTTTGAACAGCCTATGACTGGTCTGCTATGATATACCTATATGGACCGATGCTGATCTTGAGTTTATTCAATGTGGTCACGTTTATCC 
TGACAGTAAATCACATAATGAAGATTAAGAGTAGCGTTAAGAGTTCTACCCAACAGCAGCGAAAGTGTATACAGAATAATGAGTAAGCTTTAAGTATTTATAACTTACTATT 
TCTAATTCCATTCTCTTACTATAGTTTTCTATTATACCTACGACTGTCGGTGATGATGGGCGTGACTGGGATTTCTGAGGTAATAACTTACTTCGTCAAGCGGCACAAATTT 
TGGCGACAAGTCCTTCGGGTGCCAAATTTTTTCCATTTGGGTTCCGGTATAGTCGTATTTGTGCTATTTATTCTTAAGCGCAGCACATTTCAAATGATAATGGAGAGGTAAG 
AGCTTTTTGCTTAATAGATTTTAAAGTTACTATATCAATATCAT.TTTAGAATCAGCGGTCCGAGGAGACAACAACCTGCATCTTGATCAACCTACTAGTTTAAATAAATGCA 
AATTATAGATAAGTGAGCTCTTTTTTTTTATAAGTTTAGTATAGAGCAGTCCCTCTATGGTCAATAAATGCTTATAATCTTCTTATTATTGAAAAATTCTTTATTATTGATA 
TTTAATCAATAATATATTTTTATTTTTTTTTTTTTTGTTAGAACAAAATGTCAATTGTCATCAAAAATATGTGTTTGTGAATAAAATCAAGTTCCGTAGGAAAATATGCACT 
TTTATGGAAACTCGCTTTTTCAATTAGTCCTTGCATACGCGCCAGGACCCCAAATCCTTCCACCTTCCCCACAATCCTCGGCTGGTGTCAGAATGCAAATGCAGATTTTTAA 
GCCTTGCGGCAGCTTTTGCGCAAACAGATTGTAAACAAATAAATGCCGCCGGCAGACACACAGATACTCGAACATTCGGTGGTCAGTGGCCAGATTGCAGCCTGAAATTCAT 
TCAATACTTTATCTCACATGCATTGCACTTCACTTTTGAGCCACGTTAAGTTCAGTCTGCCGAAACCTTTTTCGCACTTAAGTGGTTAAATTGGCTGTGATAAATTGTTGCC 
GAGTCAAAGTTGCTGCCGCTGCGGGGCACCTCAAGTCACATGTGGTGATGGTGCGAGTGGGTGGTTGGGTGGCGTGTGCTTTTCTCCTCGGAAAATGCGTGAAAAGTCCACA 
AGCACAAAAAGTCTGGGAGGCTGGCTCTCTCTCATCTCATCACGTTCATTGAGTGCCTGTCGCTGACAGTAATCATAACGCAGTCATCAGCTCGTAATATTACGGCCGCCTG 
ACATATGACATCTGGGAAAGCGGAAAATGGGGGCTCAGGGACCAAGGAAAAGGTATGCGTACTATTAATATGCCAGTGTGTGAAGGAGCTTTCATTC 
(SEQ ID NO: 127) 

Exon: 1001.. 1636 



Start ATG: 1 



FIGURE 
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Exon: 1741. .2208 
Exon: 2283.. 2681 
Start ATG: 1001 

Transcript No. : CT37715 

ATGCTTCTGAATATTTTAGCTATAATTCTTGTATTCGTAATTTCAAGCCAATCGGAGGCAGTTATTCCTGGATGTGATTACTTCGACACCGTGGATATATCTCATATCCCAA 

AGCTAAATGACTCCTATGCGTACGAGGAGCTAATCATTCCTGCTCATCTCACGGGTCTATATACATTTAGGCAACTGGCGGATGGATCACAGGAGCCAGTGAAGAGTCACTT 

AAGAGCATGTATCTGCAAACTAAAACCTTGTATCCGATTCTGCTGCCCCCGCAACAAAATGATGCCGAATAGTCGCTGTAGCGATGGACTCACAGAGAATCTGAAGCGGATC 

AATCCCTACTTGAAGATCACACTGGAGGATGGAACTATAGGAAAATATTACCTTCTCACCGACATGATCGTCTTAAGATATGAGTTTCGGTATTGCGAGAAGGTTGTTTCGG 

TACAGGAGGATCAGTACAAGCTATATGAGGTTAAGATTTCAATCTTTTTGAAGTATCTCAGTATAATTTGATATCTATAGAATGGCAGCTTTATGATTAAACCCGATGTAAA 

CTGGACATTGAGCAAGCAGTGGTACTGCCTCCATCCCCGTTTGGAAGATCCGAATTCCATATGGATACTGGAACATTCCATGGTTGGTTGTATTCTCACGATTGCAGTGTAT 

CTCTACATAAAGAAACTGCGAAATTTGCTTGGAAAATGTTTCATTTGCTATGTATTTTGTAAGTTTGTGCAGTACCTTATCTGGGCAGGAGGAGATTTGAACTTGTGGAATA 

ATATTTGCTCCCTAGCAGGTTAGTTGAATCCATATTAAAGTTTGACTTAAAATATGCGTTTATTCTAAACAGGTTACACTAACTATTTCTTTGCTTTGGCTTCTCACTTTTG 

GCTCTCTGTTATGAGTCATCAAATATGGAAAAACTTGAGGTTAATCAATCGGGATGAACGTAGCTATCACTTTCTCATCTACAACATCTACGGTTGGGGCACACCTGCTATC 

ATGACGGCAATTACATATCTGGTGGATTGGGCCTGGGAGGATAGACCTGACAAACTGAATTGGATTCCCGGTGTTGGCTTATATCGATGCTGGATATGGTCTGCTATGATAT 

ACCTATATGGACCGATGCTGATCTTGAGTTTATTCAATGTGGTCACGTTTATCCTGACAGTAAATCACATAATGAAGATTAAGAGTAGCGTTAAGAGTTCTACCCAACAGCA 

GCGAAAGTGTATACAGAATAATGAGTAAGCTTTAAGTATTTATAACTTACTATTTCTAATTCCATTCTCTTACTATAGTTTTCTATTATACCTACGACTGTCGGTGATGATG 

GGCGTGACTGGGATTTCTGAGGTAATAACTTACTTCGTCAAGCGGCACAAATTTTGGCGACAAGTCCTTCGGGTGCCAAATTTTTTCCATTTGGGTTCCGGTATAGTCGTAT 

TTGTGCTATTTATTCTTAAGCGCAGCACATTTCAAATGATAATGGAG 

(SEQ ID NO: 128) 



MLLNILAIILVFVISSQSEAVIPGCDYFDTVDISHIPKLNDSYAYEELIIPAHLTGLVTFRQLADGSQEPVKSHLRACICKLKPCIRFCCPRNKMMPNSRCSDGLTENLKRI 

NPYLKITLEDGTIGKYYIiLTDMIVLRYEFRYCEKWSVQEDQYKLYEVKISIFLKYLSII* 

(SEQ ID NO: 129) 

Name: mth-like 6 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384207 

AGTATATACGAGTATATCCTGTACGACTATATATCAAACCGAAAGTCGAACTGAAAGTGTCCTCTAAGCGGGACTTTCATCGAGGTCGGTGCCCCTCATTATCGTATTCATT 
TGTGGGGCTCATTTGATTCATTGTGTCTGTTCGTGTGTGTTGGCGTGTTACTCTGTTAAGTTGGATGCAATTCTGTCGCCGATGGTTGTGGTTTTGTGTCCATGGCCCATCA 

atggggatgctccatccatgtggatgggcggtcagccaactaacttggaacattcgtatgtgatgtgtctgtggtggagggcgccgacctcgaggcccgctagctactgtat 
ccaattgaactgtgatttagctaactaactctaggtaccgacacctcgtgtcggcgtggaacaatccggcctctaaggatatgaatttctttacactcgcttacctaatcgt 
actgtgggaatcggaaggaaggttctgtgtgccgtgggttttggttttgggtgggaggaaaagcttgctgctcgtgatatgtgataggttgtgggagcagcccactttgaag 
gacactactctattctggtaaagcgaatcgtatccttggcgttttgcttacgtctactttggaatcataatcgcgtgcttgctgtgtgttgattatatcgctacttggggtt 
tgcttaatgtaattgttaaaaattgtacctagcctacagcttgcaatccaccgttcatgctccacatagcatatatgtatgtatgtagtttagtggtatgtacagatctatt 
tacataagtagctagtagaaccttaaagctattgggaaacgcaacgaacgacaaacgaaacgaaactggagcccaattgagatctctgctgcaacgatggcctccatttggc 
acactttgccatcctctcccgggctttaatccattccacaagggacccatcatccaattgcctattgctataactattgctggtgacgtgctcgccgttgataatcacccca 
tgtgcatgatcttcttgaaggccttgcgaaactcgggattgaatattgtgtagataaccgggttcacaaagctgttgatgtagcccagccaggtggtcatcatgtaggcggt 
gagcccaggtcggcagtctttcttgaacttggcgcacatggcgtccatgatgttgcagctgaagaacggcagccagcagaacaggaagacacctgtttgagggtttgaaggt 
tatatatatgatggcatatggattagaatatggcagagcgcaaggacgtggtccaaaacggctgggggaggtggctcaacttgaacttacccaaaacaatggccaatgtttt 
ggtggccttgcgctcctttttggccgacgatttttcgcgtttctttttcgaggccttgtgcaccttgtatatcgtgaatctgtgcgtcacgcgaaggataataggccatggt 
gtggtatatattgcattttccacaggcaaaaaaggcaaatgatgaagagaggtagagaaggcgagaaggaaatggggtggcattagcatcgactggccaacaagagcccaaa 
taactaataactgtaattaggggcaatggacggcgttactcaccttgacgcctgcgagttcttcttatccttgcgcgaggacgttttcgatgtagtcgatagcgtcgagtcg 
ttgcgtgccaaagtcatggcctcctgcactccgtaccgcacaaaggacaatggcttcatggcgacgctgcgaacggaagtcggtcggattcggaatgagaacttaactgcgg 
cattacgtactactagtCactatttttgggcttgccggattgcctacattgccaacatgcggcttctattttgggggtttcgaggataaagtggcaggtttctcgctaaaca 
ctacgtaataggtctgtgatgaaaacgtgagctcttttttgctaagcaacttaaatcaagcgattgtgcaagaatgccaccaaataatcaattgatgatttcataagcctgc 
tgatatgtaaacatgccgacaataattatgaaaagtagtggcaagtggcagaactaatactgcaaatcactatttaaatgtgcaattggatttcaatctgtttgaaaaatgt 
aatgtgttaaaaaaaaaaaaaacatgtaatagttgacaacaataacccgacatatttaattaaactttgccatttgtttaccaattaatatttggaagcaaagatgttgaca 

TTTTTGACGCGGTCAAACTCTTTGTCTGTTTTTATTGAACGAAAAAAGCCATGCGTGTTATACGCATTTATTGTTTTGTTTATAAATAATGACAATTGATGTGTGCAATTTA 
AGCAATTTTAATATGCTACGTAATCTAGTGTAAGTAGATAAAAAAAAACATTGCGCAAAAAGAGCACTGCCTATAGCTGAAAAAAGTATTATATTATTCAAAAACTATGTAC 
AAAAACTTGAATGGGAATGCTTTATTTTGTCGGGCTGCAAAAACAGTTCATTAAACTTTTAAACTGTTGCGAAAAGTTTCATTTTACCTATGGTACAATCAAAAAGAATGTT 
GTTCAATGATTTAGTACCAATGTGGTACAGATATTGGACGGTATTTTGTTTTTTTTTTTTTTCTAAATATTAGCAATTTCTCGAATTTCCGATATAATTGCCGGAACTTTGA 
TTTTAAAACAATAACTTGGATGAGGCATTATGTTTTTATCCGTGCACGAACTTCTACTACGCCCCCAACTACTGGGGATGCTATTTCTAAGGTGGCAGGCTTACGGATCTGA 
ATGCCAAATACCTGGAGTTATCGACACCGACTGACCTGAGGGCGGGTTCGCCGCGCTTCGGCGAGTCGTCGCCGGTGGGGCGCCGCTGACTGCTGACGCTGGATCGCTGGAG 
GGTGGCGCCACTGGGCAGCGGACTGTCCGGCGGCGAGCCGCTGCTCCGCGGAGCGGCTGATGTGGCAGCAGAGGCGGAGGCGGGCGCCACGCCTGCAAGGACATCGTCAACG 
TTTGAGGCTGCATAACCAGAATCATGATTACCATTCGGATCGGCGACGACCAGCTGCGGCTGTGTGGTGATTTGGGCCACAACGCTGCTGCGAGTGCGTTATGGCAATGTGG 
AGCGATGTGACGGTGGGGTTATCCAATTAATGGTGCCGCGACCATATCCATCAATCAATCAATCAATCGGCGATAGACGACGAGTGGGCGAAGAAGCAGCAGAATCCGGCGT 
TGATGAAGGACGAAAAGATGGAAAATCGGGGTGCAACGTCAATTAAAGGCAAGGCAAGATTGAGTTTTAATAAAGTGACGCATTAATTGAATTATTGAGAATTATTTTTCTC 
TGCCATTGGCTGTGATTTCTTTTGGCGGGTAAAGTTTCGCGGTCTTGTTTGCCATCCATTTCAGCCCATCGGTTCGGTTCGGTTCGGTTCGGATCGGATCCCATCCCTTTGT 
GCATCCAGCCTCCTGTCATTAATGGTTTCATTTCATGTCCTTATTGGATTAAGCAGGCCATCAGGATGCTAATTGGCTGCGCTTAACGCTCGCACTTAATGGCGGTTTCGGC 
CAGGAATTGGGAGAAAGATTCGAATGCCGTGGGGCGGGGACATCCGATTGCAAGGATGATGGGCAAGGGATCCTTTCGGCAGGACCTGATTGAAAGTGCAGCACGCTTTTAT 
GTCGGCAATTGAGTGCCGCTGTGTCCGGACGAAGCCTGCCATTAGATTGACAGTTTTCCATGGGCATGCACCGTCTGGGGGTCCATTGCTTTGCTGGTCTGCTGGTCTGGAA 
GGAAGTGCAGTCGAGACCACTCAGCAGAAACAGGCTTATCTATTACGTGTTTGGGGCACGTGCCAGGGCTGTCAGGGTTGTCAGGTTGCAAGTTGCAAGTTGCAAGCGGACA 
AGGATATGGACACGGAAACGAAACTGCTGAGCTCCGAACTTTTGCAGGGCCATCGGAGATAGCCCCCAGCCAAACATTCGTTCTCCAGGCGATGGAGTTTTCTGCGAAATTC 
GCCGAAAGTTTCCGCTTTTCATACCCCGTTCACGTAAGATGTAAGTGGGTTTGCAAATGTACATATATGTCCAATCCGACTCCATTGGCAACGTGTTTATATTCGTGAAGTA 
ACATTAGCCGCCAAAAGAAGGTCTTACACAGCCAAAAAAAAACTTTGTGAAACGTATATTTCCTTTTTGTATATTCCAAATCTGATTTAAAGCTATCATGTTCTACGCATTT 
CTACACAGCACATCAAAGCCCCTAAATGGTATCGTAAAATGAGTACTCACTTGCCCGTCTCCTCGACAACGGTGGCCAGCATGAACTCGGTGGACTTGTCGTTCACGATGAC 
GTGGCAGTCGTCGATGTCCGGCGAGATGGCGTTCTCGTCCTCCTCCTCGTTGGAGCCGCTGGCCGTGTTGGTGGCGGCCTCGTCGGGCAGAATCCTACTGGCGTGGCGACTG 
CTGTCCAGGGCCGTCTCCGCCAGGCGCCGGGTCTGGGCAATGTTCTCGATGACGCTGCCGCCCGTGAGTTCCGATAGGTGGGGCTTGCGGGCCGCCCGCTGCTTCCGCGCTC 
GACTCCTCAGCGCCTGTGGGTGGGTGGGTACGAGATGTAAAATACAGAAAACAGTATACCAAACATGGGACACAGGATACAGACTACGGACGACGGACCACGGACTACGGGC 
TGTGGGACACGGGATGCAGGATGCAGGATGCAGGATGCAGTAGAGAGCGGATTAGGCACTGCCATGCTGCGCTTGCATCCTACGGATATTCGCCCCCAGTCAGCAGCAAATG 
TGCACAAACAGAAACAGAAACAGGAAGAGAAACAGAAGCATCCCCAAGCCAGCATTGCAGATTATGGATCTGCGCCAGCAATATTCATTCCACTCGACTCGGAGAAATCCGA 



Start ATG: 1 



FIGURE SHEET 




WO 01/070980 
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TTTTAATTGCGGTCTTTTAATAATAGTGCTTGAAATTTTTTTGCGCAATGGAAAGAGCGACAGAAAGACCAARGCGCGGAAAATTCGCTCTAGGAATTAATGTTTGTGGGAA 

GCTTGGAGTGTTCAAAGGAATTAACATGTTTGGflATATATCGTATATATACATATATCGCTCGCTACTCACCTTGAAAATGTTCCAGTAGAGAAACACCATGRTGATGCAGG 

GTATGTAGAAGCTGCTCAGCGAGGAGTAGAGTATAAAGTCGGCGTTGTAGMGGCGCATACATCGGGTTCGCGGTTGGGCGTGTTGTTGAGGCCCAGAACTATCGGCGAGCC 

GATGGCAGCCGATATGGCCCACACCAGCAGTATCGTAAGGCAAACGCGGCGGCTATTTTTGTGCTTGGCGTACTTTATTGGCTGTGTCACAGCGATGTATCTGTGGGATATA 

TCCGTGGAATGTGTGCTTGGGTCATATGGGTCTCCGRAflTAGGGGCCGCTGTGAACTCACCTGTCTATGGAGATGGCAACTAAGTTGAATATCGATGAAGTAGAGCATATCA 

CATCCATGGCTATATAGAAATCACAAACGACGTCAGGCAGGGCCCAGGCTCCATTTACCTGCAAflACAAAAAGAAGGAGGGGGGGGGGGCGTCAGGTAAGAACGAGGATGTG 

TATGTCGATGCGGATGAGGAGCACACTTCATTTGTTCGGCACTTAATCGACTCGGGAATGGAGCTCTATTGGAATGCCCATCCTTTGTCGAATATTAATAACAATATTTTCC 

CATTTCCCGTGCTGCTGCCACAAATTAGTTTTAGACTCTATCTCTTCATCGACGTCTCCGGCAGCATCTATGTTATGGCCCAACCCGAATGCGGACTCCATAAAAACCCCAA 

CAAAAACGTCAGGAATTTCCCAAGCCAAACAATTTGTAGCACACAAAGCACATAAAAAAAGGCCGAGTGGGCCACCAGTTCGGTTGGAACTTAATGTGGATATGTTTGGGTG 

AACACCCAACCTGTCGTGGTTGCACTCCCCATAGCCCGTGTACGAGGTATACATAATGTAGCGCCAACCGATTCGGTATTAGAATGGCATCGATAGCTTTTTGGATGTGAAT 

ACTATTAGTAAGTAGAGGTAGCTGTCACATAGGTATTTGTTGACTGATCTGCGGGTAAGAGGTATCGCATAGTCGAATACACTCGGCGGAGCCAGTTTCCCAGAATCAGTAG 

TTTAACATTTTAAGAAGTCAATATTTGAATAATACTACTACTACATATATTGTACTTATATGCCACCAATGTGTGTGTGCTAACTAAAATGCATGTAGACGGTATCGAATAG 

TCGAAGTTAATCGACCCTAACGTACTATCTTGTTTTTCTTCTTCAATCGGTTTTGCGCCTAAATTATACAAACACATTGAAACTTGAAGCGGGGCTCAACGCGTGGGAAGAG 

TGTAAAATATTGATACACACGCAACCACACCACACGAAAGTAAAAGTGAAAGTGAAAGTCAAGTGAAAGCAGCATAGAAATGTGCTAAAATAGCTGGGCAGTCGCAGACTTG 

GAATTTCTTAGAACTCGGATGAAGACTCCTTAGCCGGAGCTAAGCGAGCATAACAA 

(SEQ ID NO: 130) 



Exon: 5552.-5437 

Exon: 5364.. 5112 

Exon: 4605.. 4307 

Exon: 3111.. 2836 

Exon: 1746.. 1612 

Exon: 1423.. 1323 

Exon: 1212.. 1001 

Start ATG: 5495 (Reverse strand: CAT) 



Transcript No. : CT37739 

TTGCAGGTAAATGGAGCCTGGGCCCTGCCTGACGTCGTTTGTGATTTCTATATAGCCATGGATGTGATflTGCTCTACTTCATCGATATTCAACTTAGTTGCCATCTCCATAG 
ACAGATACATCGCTGTGACACAGCCAATAAAGTACGCCAAGCACAAAAATAGCCGCCGCGTTTGCCTTACGATACTGCTGGTGTGGGCCATATCGGCTGCCATCGGCTCGCC 
GATAGTTCTGGGCCTCAACAACACGCCCAACCGCGAACCCGATGTATGCGCCTTCTACAACGCCGACTTTATACTCTACTCCTCGCTGAGCAGCTTCTACATACCCTGCATC 
ATCATGGTGTTTCTCTACTGGAACATTTTCAAGGCGCTGAGGAGTCGAGCGCGGAAGCAGCGGGCGGCCCGCAAGCCCCACCTATCGGAACTCACGGGCGGCAGCGTCATCG 
AGAACATTGCCCAGACCCGGCGCCTGGCGGAGACGGCCCTGGACAGCAGTCGCCACGCCAGTAGGATTCTGCCCGACGAGGCCGCCACCAACACGGCCAGCGGCTCCAACGA 
GGAGGAGGACGAGAACGCCATCTCGCCGGACATCGACGACTGCCACGTCATCGTGAACGACAAGTCCACCGAGTTCATGCTGGCCACCGTTGTCGAGGAGACGGGCAACAGC 
GTTGTGGCCCAAATCACCACACAGCCGCAGCTGGTCGTCGCCGATCCGAATGGTAATCATGATTCTGGTTATGCAGCCTCAAACGTTGACGATGTCCTTGCAGGCGTGGCGC 
CCGCCTCCGCCTCTGCTGCCACATCAGCCGCTCCGCGGAGCAGCGGCTCGCCGCCGGACAGTCCGCTGCCCAGTGGCGCCACCCTCCAGCGATCCAGCGTCAGCAGTCAGCG 
GCGCCCCACCGGCGACGACTCGCCGAAGCGCGGCGAACCCGCCCTCAGCGTCGCCATGAAGCCATTGTCCTTTGTGCGGTACGGAGTGCAGGAGGCCATGACTTTGGCACGC 
AACGACTCGACGCTATCGACTACATCGAAAACGTCCTCGCGCAAGGATAAGAAGAACTCGCAGGCGTCAAGATTCACGATATACAAGGTGCACAAGGCCTCGAAAAAGAAAC 
GCGAAAAATCGTCGGCCAAAAAGGAGCGCAAGGCCACCAAAACATTGGCCATTGTTTTGGGTGTCTTCCTGTTCTGCTGGCTGCCGTTCTTCAGCTGCAACATCATGGACGC 
CATGTGCGCCAAGTTCAAGAAAGACTGCCGACCTGGGCTCACCGCCTACATGATGACCACCTGGCTGGGCTACATCAACAGCTTTGTGAACCCGGTTATCTACACAATATTC 
AATCCCGAGTTTCGCAAGGCCTTCAAGAAGATCATGCACATGGGGTGA 
(SEQ ID NO: 131) 



Start ATG: 58 (Reverse strand: CAT) 



MDVICSTSSIFNLVAISIDRYIAVTQPIKYAKHKNSRRVCLTILLVWAISAAIGSPIVLGLNNTPNREPDVCAFYNADFILVSSLSSFYIPCIIMVFLYWNIFKALRSRARK 
QRAARKPHLSELTGGSVIENIAQTRRLAETALDSSRKASRILPDEAATNTASGSNEEEDENAISPDIDDCHVIVNDKSTEFMLATVVEETGNSVVAQITTQPQLVVADPNGN 
HDSGYAASNVDDVLAGVAPASASAATSAAPRSSGSPPDSPLPSGATLQRSSVSSQRRPTGDDSPKRGEPALSVAMKPLSFVRYGVQEAMTLARNDSTLSTTSKTSSRKDKKN 
SQASRFTIYKVHKASKKKREKSSAKKERKATKTLAIVLGVFLFCWLPFFSCNIMDAMCAKFKKDCRPGLTAYMMTTWLGYINSFVNPVIYTIFNPEFRKAFKKIMHMG* 
(SEQ ID NO: 132) 



Name: Dopamine Receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384407 

TCCATGCCAGGCCTGGATGCACATGGGCCGGATGGGGATCAGGTATATATCGAGAGCGTCGAGAGTTCGGTTTGCGTTTTGCCCACCAAGACGAAACCCAAAAAGGTCGCCT 
TTTACGGCAGCAATGGTCAGCGCTACACTTTCCTGTTCAAGGGCATGGAGGATCTCCATCTGGACGAACGCATCATGCAGTTCCTGTCCATCTCGAATGCCATTATGGCCTG 
CCGCAGCGACGCACCCGGCAATGGCTGCTATCGTGCCCACCACTATTCGGTTATACCATTGGGACCGCAATCCGGACTGATTAGCTGGGTGGACGGGGTCACACCAGTCTTT 
GCGCTCTACAAGAAGTGGCAACAGAGACGGTCCCAAGTTGCTGGGAATGCGGGAGCAGGTGCTGTGGCAAATGTCCCGCGACGTTTCACAGACTTGTTCTACAACAAACTCT 
CGCCCCTGTTGGCCAAGCACAATATGCAGGTGAGCGATCCACGACGCCAATGGCCAATATCCGTCCTGCTCCAGGTGCTCGACGAATTGTCACAGGAAACACCCAATGATCT 
GCTTGCCCGCGAACTGTGGTGCCAGGCGGGAAATGCCGCCGAGTGGCGGCAATCGGTGCGTCGCTTTGTCCGCTGCATGTCGGTCATGTCGATGATTGGCTATGTCATTGGC 
CTGGGCGATCGTCATTTGGACAACGTGCTTATCAACCTGGGAAGCGGCGACATCGTGCATATCGACTACAACGTGTGCTTCGAGAAGGGCCGCACCCTGCGCATCCCCGAGA 
AGGTGCCCTTCCGTCTTACCCAGAATCTAGTGCAGGCCATGGGCATCACAGGAATCGAGGTAAGACTATTGCTCATCCTATACCAATCAATTATCTCAACAAATCGCTGCTG 
GGGGTGTTTAAGAAACGGTGCTCCTAAAGATAATGTCCACTTTTATTGCTATATCTCCATTTAATTCTACCAAGTTAAAGCGTTGGGTTTCCCAATTCCGTTGATCACTGAG 
TGGCCTTCGGTTTGATCTCCTGAGATGAGGATGTGGCCTCCATGCGGTGCTTGCTGTCCGCCGGCAATTGATCCTCGCACTGCCTCGCCGTTCGGCAGCATAATGTTTGGAC 
AAAGGCTCGCCGAAAGCCAGAGTTCTTCCACGAGTAGATAATCGGATTGAGAGCAGAGTTGGCAATGGCCAGAGAGAACGTCGTCTTGTAGATCATCGAGCTGCTCTGGCAG 
ATGCTAAATAGCTGGGCAATGGCCACACAGAAGTACGGTAGCCAACAGAGCGTGAAGCAGCCCATGATGAAGACCACAATCTGGACGCTCTTCCAGTCTGGATGGAGCAGCA 
GATTTCGCATGGTGGTGGCCTCGTCCGTGTTGTAGACCACCGACTGTCGCAATCGCAGCGCCTGCTTGGATGCCTCGCGCATGATGCGCCAGTAGACGAGGAACATACAGAT 
CCAGATGATCACAAAACCCGGCGTTATCACTCCGGCAATGTAACCAGGCGCGAGAACTTCGTCGAATTCGCACGCCTGTGCATCCGGCCATCGATTCCAGAACACTGGCAGC 
AGAGCCACCAATGCACCCAAGCACCAATTGAATATGATGATGCTATATGCCACTCGACGGGTCATGTATCTACGGTTTCAAAGGGTTTCACATATGTAGCCATTGAATTAGG 
ATTTATCGCGTTCGCTTAATACCTTACCTTCTATAGTGCAGAGCATAGACGACTGCTATATACCGATCCACCGCAATCGAGATCAGAGTCAACATGGACACGCAGCAGGCGC 
AGATGAGCAGGAAGAAGCGCAGCAGGCAGGGACCTCTCATGGCGCCAATGTCCGAGCCCATGTAAAATACCAGATGATAGGGCAGTGCCAGTCCCACACAGAAGTCGGAGAC 
GGCCAGCGACAGGATGAACAGATTGGAGATGACCGATCGCAGGTGGCGGCAGGTGCGCACCGCCACAATGGTCAGAATATTGCCGCCCAGGATAAGGACAAACAAGAAGGCA 
TTGATGGCTAGCCACAGCAGATGGCCGGCACCGAAACTGGTCGCCTGGAGGGTTATTTCCGTTGGCGTTTCCGTGGCGTTCTGGTGCTGCCACAGGAACGGGACGACACCCA 
ACGCTTCCGGGGAGCTTCGATAATCCCAAGCGGGCGATGAGCTGCTGGTAAATGCCATGAGCCACGATGGTAAACGAAGCGCACGCGACGCCAGTGGTTGGCTATCTGGCAA 
CCGTGGGAGCTGGAGTTGTGCTGACTTTTGGCCAGGACTTCGGCTCGGCTCGGATCGGATTCAGATTCAATTGGGTTTCGGCTCGGACCGTGGGCGTCGTTTGTTTACGTTT 
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CGCTGGTGGCTTTACGACATCGTCATCGCGCCAGTTAACTGATAGCGCGCCCAGATCCACCGTGftGAACTGCAATCCGTTGfiTAAGCCCTCCACTTTGGCCCAACCACCATT 

CACCACCCTCATACCTAACCACTCACCCACCCATCGAGTCAATCCAACCACTCCAACCAGCCAAACTATCCATTGCCCCCGAGTATTTCGAGTGTCCGGACTCCCGGGCCCA 

ATGCATGGCCAAACAGATCGCTGGGGGTTTTAGCAGATTAGCCCAGCTCCGCGGACCAAAATAGATTGATAAAAGGCCAGGGCCAGCAACTGGGATACGTGGTAAACAGATA 

CATCATTGATArTCCCAAATTTCCTCGATACATTGCAAGCGAAGAAACTATATGTAAATAGTGATATCATCATAATTGTTATAAATCCGAGACCAACAGTACAACCAGTACT 

ATTAGGATTGACATGGTTATCCACTCTTGGTTCTTTTAGAATTAGAAATCAGGAGAATATATAGTATATATAGGTACAACTCTCTGATCTAATCCCATATTTGAATTATCTT 

TAAGATGAGATATTAAAGATTTCATGGATCAGAACAGACCAGGAGTTTGCTAAGGATG 

(SEQ ID NO: 133) 

Exon: 2186.. 1708 
Exon: 1637.. 1001 

Start ATG: 2186 (Reverse 3trand: CAT) 
Transcript No. : CT38338 

ATGGCATTTACCAGCAGCTCATCGCCCGCTTGGGATTATCGAAGCTCCCCGGAAGCGTTGGGTGTCGTCCCGTTCCTGTGGCAGCACCAGAACGCCACGGAAACGCCAACGG 

AAATAACCCTCCAGGCGACCAGTTTCGGTGCCGGCCATCTGCTGTGGCTAGCCATCAATGCCTTCTTGTTTGTCCTTATCCTGGGCGGCAATATTCTGACCATTGTGGCGGT 

GCGCACCTGCCGCCACCTGCGATCGGTCATCTCCAATCTGTTCATCCTGTCGCTGGCCGTCTCCGACTTCTGTGTGGGACTGGCACTGCCCTATCATCTGGTATTTTACATG 

GGCTCGGACATTGGCGCCATGAGAGGTCCCTGCCTGCTGCGCTTCTTCCTGCTCATCTGCGCCTGCTGCGTGTCCATGTTGACTCTGATCTCGATTGCGGTGGATCGGTATA 

TAGCAGTCGTCTATGCTCTGCACTATAGAAGATACATGACCCGTCGAGTGGCATATAGCATCATCATATTCAATTGGTGCTTGGGTGCATTGGTGGCTCTGCTGCCAGTGTT 

CTGGAATCGATGGCCGGATGCACAGGCGTGCGAATTCGACGAAGTTCTCGCGCCTGGTTACATTGCCGGAGTGATAACGCCGGGTTTTGTGATCATCTGGATCTGTATGTTC 

CTCGTCTACTGGCGCATCATGCGCGAGGCATCCAAGCAGGCGCTGCGATTGCGACAGTCGGTGGTCTACAACACGGACGAGGCCACCACCATGCGAAATCTGCTGCTCCATC 

CAGACTGGAAGAGCGTCCAGATTGTGGTCTTCATCATGGGCTGCTTCACGCTCTGTTGGCTACCGTACTTCTGTGTGGCCATTGCCCAGCTATTTAGCATCTGCCAGAGCAG 

CTCGATGATCTACAAGACGACGTTCTCTCTGGCCATTGCCAACTCTGCTCTCAATCCGATTATCTACTCGTGGAAGAACTCTGGCTTTCGGCGAGCCTTTGTCCAAACATTA 

TGCTGCCGAACGGCGAGGCAGTGCGAGGATCAATTGCCGGCGGACAGCAAGCACCGCATGGAGGCCACATCCTCATCTCAGGAGATCAAACCGAAGGCCACTCAGTGA 

(SEQ ID NO: 134) 

Start ATG: 1 (Reverse strand: CAT) 

MAFTSSSSPAWDYRSSPEALGVVPFLWQHQNATETPTEITLQATSFGAGHLLWLAINAFLFVLILGGNILTIVAVRTCRHLRSVISNLFILSLAVSDFCVGLALPYHLVFYM 

GSDIGAMRGPCLLRFFLLICACCVSMIiTLISIAVDRyiAVVYALKYRRYMTRRVAYSIIIFNWCLGALVALLPVFWNRWPDAQACEFDEVLAPGYIAGVITPGFVIIWICMF 

LVYWRIMREASKQALRLRQSWYNTDEATTMRNLLLHPDWKSVQIWFIMGCFTLCWLPYFCVAIAQLFSICQSSSMIYKTTFSLAIANSALNPIIYSWKNSGFRRAFVQTL 

CCRTARQCEDQLPADSKHRMEATSSSQEIKPKATQ* 

(SEQ ID NO: 135) 

Name: Adrenergic receptor- like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384675 

GCCTAAACGATAAGATTAGAAAATGGAAATGGAAATGAAGACAGCCACCTGACGCAATCTCACATGCCAGAAATTTGTCGTTTGCCATCCAAGGCCCCGATTTCCGAAATCG 
AAAATCCTTGATATCCAACTGGACATGACCAGTTTCCGGAGGAATGCACTCCAGTTACACAACTGGCACAAACATTTGTTTGCTGTTTTCTGACCAAAAACAAGCCACACTT 
ATGCTGGCACTTTTTTGTTGACAGAAATATGCAAGCAGACACAAGTAAATATTTATTTGATTATTTATTTTAAAGAAATGTGCAAGCTCACCGAATAAATATTATATTTTAA 
AATCAATAAGTAGAAACCTAAACAAATACAAGCTGGTGAAGCGATTGCTCTGCAAATTCCACATATTTAAACTATTAATTTACAAAAATAAATGTGGCGGTCAATTTCCAGC 
TGAACAGTTCTCAACAATTAATGAAAATAATTGCCACTGATAATAATAGTTTTTTTTATTATTCGATTACATACTTTAAATATTTTTTTAGTTTCCCAATTAGAAAACTTTT 
TTTTAACAAAATATTTAATATTCCGTAACAGTATTTATGCTTAATTACCAGATGACAGTTTTTATTTCTTTTGACTATACTTACATTATTTTGAAAAAATAGCAATTATTTT 
TAGACTTCTGCCACGAATGCCTTTAATTAATAGCTGATGCAATCGCTTAAACTCAATTAAGTTACACACGCAACCCACATCTTCTGTGTAAAAAAAAATCAATGTATTCGAC 
CATAAATAATTCCGCTGCAATCAGTGCAACAGTATTCAGTCCTGGAACTTTAATAGCGCCAAGCTGTTGGAAAACCAATTAAATTCCCTTTCAAATGGAGCGTAAAACGCAA 
AATACTTACGTATGATGGGGGCGCCACACACTCGCACACGCATTTATGAGCTCGGCAAAACCCATTTGATTGACTAACGAAACTTATTTGCCACTTTATTTCAGCGGACGCA 
GTGCAGCAGTTCCACGACGACCATGAGCGACCAGATTGGCAACCCCAATGCAACATTCAGTGGCAGTGGCAGTGGCAGTGGCACCAACGTCGCCTCCATCGCCGAAAGTGTG 
GCGGAGAGCGGACCGGACTTCGATGCACTTCGGGCCGCCTGCGAAACGCGGCTAAATGCCAGCGGTCAACTGGCGGGTGAGTCGTGCACTGCGAGAAAATATATCACACAGC 
ACATAACTATTAAAGTTTATGCAGTTATTTTATTTGATGCAGTTGTAAAACGCTCACCAGTAAATTTCATTTAAATTTTAGATGTTTTTATAAACATAAATCAAAATAAATT 
TTTTTGCACTGTTTTACTTACATTCCCTTTCTTTATGCCAGTTATTTGTCTTTACTTAAGCGAAAGAAAAAATTCACTCTGAAATTGTAGCACTAGGGAAATATAACAGTTT 
TTGGTATTTCACTACATATTTTTGCCCGTGCTTCTGGCTCTGGCCAACTGCAATTTCGGCGTCCATTATGCACGAGTAACTTTAAATTGCTGACCTAACCACTCCCCTTTCC 
TTTTGGCCCCTTGGCAAACAGGCTCCGGAGGACCAGGCGCAGAAGCAGGAACCCACTGTGCCGGCACCTTTGATGGCTGGCTTTGTTGGCCGGATACGGCTGTCGGCACTTC 
CGCCTACGAACTCTGCCCGGACTTCATCACGGGATTCGATCCAGCAAGTAAGTGCGCCGGGGCAGGAGCTGAATTGTTTATTAAATTCCACAAAGCTACATCCGTTTCGGGG 
TCAGCTGGGGATCTGCATATGACACGAAAAAATATATATATATACATACATATATCACATGTATGTATGTTACATATAGGTTAAATAGCATGCCAAAGGGAAATGTGTTTGC 
ACGAGGCAATGCGGTTGGATAAGTTGGTTGGGTTGTTTGGTAGGTGGAGTGTGCCAGTCTACCCTCTAATTGGGCTCATCCCCGAGCTCATCGTTATATACACATAAATGGA 
ACAGAAAATACGCTCCAAGCCAGGCCAAATCAATTGGAAACTTGTTATGTTTATGCTTTGATTTCGTGTCGTATGGAAACATCAAGTATTCACAAAACCACACGGGCTAAAG 
GCAAAGCTCCTTGGAGCTCCCTGGGCCACTAAATTATCCAAAAGAGCTCGACAAAATGCTAAAATGTTTCGCCTGATTTGGCACAATTTGCCAGAGTCTAACCGAAATCAAA 
TGTAAGGCAGAAAACACGGCCAAATTGATTGAGGCTGCTGGGCAGAGTAGAAGTCAAATATGTGTGTGGGTATTTAAAGATGCTGGAGCACGAAGAAAGGGGATTTAAATTT 
TATCATGTTTGTGAATTTCATATTTTTTTAATCATTAATCAGTGTTCTTGGCAAGGTGTTGATGACGGTCATAATAACATAACTGAAGAAGAAAACCCTTTGAATAGTTTCA 
CCAACTCATTGTCGTATCATTAACATATCATAAATAAAAATATCAAAGGTGACTTGAATTATTTCTCAACTTTCTGCCACACTGCGTGTATTTCATTCAATTATAAATATAT 
ATATATATAAATATAAATATAAATTAATCAACTGATTGGCCAAACGCACAAAGAAAACTAATTAATTATGCTTGGCATTAAAGCAAACGTTTAAAAACAAAGCGCACGCTCT 
AGAAAATCGTCATAAATGTGGATAAATATTTGAACCAATTTATGGCATAAATCATGCGCTGCATGATAATTTGCATTAATGGTATACAATATGTAGCCAGCCAACTTAAACC 
TCAACCTCGTTTTGGCCATGACCAGGGCAACGAAAATAGCGAGAGTTGAATGACTTTTTTTGTTATGACAGAGCGGAGAAAAACAAGGCGAGCGTCAAGAAAGTGCTACTTT 
TGCTACATTTAGCTTGTGGTTAGCATGGCGCCACCCCCTTTTTCATCCTTTTTGCATTTTTTTTTTTGCCTTACGGGGTCGCCGCCAGCTTGTACAAGATTATGTGCTTTGG 
CACTGCGCCGCCATTAACTTTTATGGCACTTCCGGTCGCCGGATGTCGATGTGGTGAGCTGTGGTCGAGAACTCCCTCACCCCCCGGCTACTTATCTGCCACACAGCGCGTA 
TACGTGATTTTCTCGGTCTGCGTGCCAAACGGCGATAAATGGATTCTTGGCCAAAGATCATCGGTCGAAAAAAAAAAGAAAACGGAAGAGCCGGAAAAACGAACGACCTGAT 
AAAACTCACCTTCGAAGTTCCTAACAAAAATTCTCCAGTAATTTTTCTACAAGTGCCACTCGGTAAATTGCATTGCAAACTTGGTAATCTACAATCGAATCGAATGGAAAGT 
TTTTGTCACGAACTTCGAATTGATTCGACTTAAACACACACATGACCCAAACAACAGAATGATTATGCAATTTTCAAATTGATATGCTTGCAATTCGAATGCGAACCGCAAT 
TCTTTTGCGGTTAACTTGGTGGCAGATGCAATAAATAATGGTTAATTCAATTGAATGCTCGGGCCGACCGCAAAATGTAAATATGAGCCGGAATATTTATGCAGTAGTCAAT 
CATGAAATCACACATATTTAGCAAGTGTAAATATTTGTGTGGTGCACAGGATGTGAAAATGTTTGCTAGAAAAAACGGTAAAGAATTAATCGTATATACAATATTTATTGAG 
GAGCTGTAATAAGCAGAGTAAACAGCATGGTTAATTATTTTTTATTTATCGCAAACTGACATTGCTTTAATTATGCAGGATATGCCCACAAGGAATGCGGTCTCGATGGCGA 
GTGGTTCAAGCATCCGCTGACCAATAAAACATGGTCCAACTACACAACCTGCGTAAATCTCGAAGACCTCAACGTAAGCCGATCCCATACATAAATATCCTATATATCCTTT 
ATGTAATTAATTAATCGCAACACTATATATCCAGTGGAGGCACACAGTGAATCTGATCTCCGAGGTTGGCTACGGCACCTCACTGCTGGCCATTCTGCTGTCGTTGGCCATA 
TTGGGTTATTTCAAGTGAGTATGGCCAAGGGCTTTGCACTCCTTCATCTTGGCCACAAGTTGACAGCTTCGTTGGGAGATGGCAAAAACACTTAGGGGTAATGAGGCACGCT 
GGTGACTTATATTGCGACTTATATTATGACGACTGGCGCCATAAGTGAAATTTACTGATGGCAAACAAATGCGACAGTGGCTTAATCAACTGCCAACCTGGATGCTGGATGC 
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CCAAATGCTCATTGCACTCATTGCTCATTTAATGCGGCACAATTGACGCCGGAAATGGGAAAATACATTGTGTCCCCATCTCCTGGTGACTCGAAGCAGGATATCGATTAAT 

TTCCCAGTGCCTGCAGACCAATTAGTGCTCTGGGTTTGTTTATCGCACATTCGAATGGAACTGGGGCTCAAGGATAAAGGATAATGGATGTGCGGACTGAGGGGCGAGATCC 

CATAGACGGTTCGGCCAACTGACAGAACTGTTTTAAATTGTGTGCCAAAGGCAAATAAAATGGGTCCAGATACGTGTGGGAATTTGGTCTGCCCTAAAAAAGTGAATAATTA 

AAAGTTTAATGACAGAGCAATGGGCAACACCAGCCCCGGAAAAACGCATTCCTCAAACAGCTGTAATGAAAATACCAGAGTTTTCATTCAAATAAAATATAAGAGTATAAAG 

GAAATTGAGTCGTTGTGAGCGGCCAAAAATAAATAAAGAAGTATTATTATAAAATATACCATCAGTATATTCTCAGTCAATAATTGTATTGCTTAAAGAAAGTTTAGAAGTG 

AAAAGTAGTATATAAAATAATTTATTTAGACAGAAGTAAATTAAGTAAAGTAGAGAAGGGATTAACGAAAACAATCTAACCTATTGAATTAAATTGCTGAAAAGAAATCAAA 

CTAGATTCAGTTTGGCCAGATGTCAGCAATCAATGCAATTGAGCAAAGGAGAAACGGAAAATGGGAAAATGGACAAATGAACTTTGTCCTTTTGGTCGCAGGACTTTTTCTG 

CCTCTTTCGGCCCTATCCAAGCTGCCTCACATGCCCCTGGCCCCGAATGCAAATGAAGCCATAGACAAACTTGCAAATTTGGCCATAAAAACTTGCCACAAATTGCCGGCCA 

AAGTAATTCGATTCGCATATTTGTCTTTCGATTGAACAGATCCCTGAAGTGCGCCCGCATCACGCTGCACATGAATCTGTTCGCCTCGTTCGCTGCCAATAACTCGTTGTGG 

CTGGTCTGGTACCTGCTGGTCATGCCGAATTCGGAGCTACTGCATCAGAGTCCGGTAAGTGTGCACACCTGGCGAATGCGTAATGGTGAATCAACCTCGGCTTCCGATTTCC 

ATATTCCATACTCCGTGCGAACACCCAGATGCGCTGCGTTGCCCTGCACATAACGCTACACTACTTCCTCCTGTCCAATTACTCCTGGATGCTCTGCGAGGGATTCTATCTG 

CACACCGTCCTCGTCGCCGCATTCATTTCCGAGAAGAGGCTGGTCAAATGGCTCATCGCATTCGGCTGGGGCTCCCCGGCCATCGTCATATTCGTCTATAGCATGGCTCGCG 

GTCTGGGCGGCACGCCCGAGGACAATCGTCAGTAAGTGAATAATGAATGGGTACAGCTTACACCGAAAGAAAATGAGGAATGCCAATAGTTATCAAAAGTGACCGACAACAG 

ACAAGGTGTAACCATATTGTGTTGTCTAGAAGTATCCTAGTTTAATGAAATCCCACAGACTTGAGTATCACAGATTGATATCAAGATATTTCTGTGTAGCTAAGATATTCAG 

TCTGTCGAGCTCTTGACCAAATGCTAATCAAATCAATCAGCAATTCACCCACTGACTCATCTCGCATTCCTTCGCCAACTTGCAGCTGCTGGATGAACCAAACCAACTACCA 

AAACATTCTTATGGTGCCTGTGTGTATCTCCATGTTCCTGAACCTCCTGTTCCTGTGCAACATTGTGCGAGTGGTCCTTTTGAAACTGAATGCCCCGGCCAGTATTCAGGGC 

AGCTGCGGTCCATCGCGAACGGTTTTGCAAGCATTTCGGTAAGTTTTCCCGTCCAAAACAAAAGAAAAAAAAAAGAACTGTTGGGGGGGAGATTGAAACACTTATAAGCGAA 

GCAGAGAACTCTCAACTGGCCCACGGGGCGAATGCGTAATTATATTTTGTTTCCCTTGTATTTAGACTCTTTCCCATCGGCCATCAGGTCTTTTATTGTTTATTCGCTCTGC 

ACTCTGGAAGTTGGGCTCATTTTACATTTTGAATTACGCATACGCCGCGTTGACTTGCATGTTAATACACAGCTGAAAAACGAAAACAAGGCATTATAATTCACCCTCTCAT 

CCTGTTTTAGGGCAACGCTGCTTTTGGTTCCTCTGCTCGGCCTCCAGTACATCCTTACGCCCTTTCGTCCTGCACCCAAACATCCCTGGGAGAATACATACGAAATCATCTC 

GGCGTTTACGGCCTCATTTCAAGTAAGTAAAATGTTTTCGTTTCGTTTCTGTTCTCGTTTCGTTTTTTTTTCGTTTTTTTTTTTTGCTTTTTTCTTTCCGGCTTTTGTTTTG 

CACGCTCTCATAAAATCACTTTACGACTCCGAGCTGGCTAAAAAGTTATGATGTTAGTTCGCCTCATGGTGCCATCCACCACTGGGAATGCAAATAAGCCAGCATTACTGGA 

ATACCGAATACCAAATACCAGGACTAGCAGCTCATTAGATCGGGGCTCTTTTCAAATCAATAATGAAAATACACGCCATGTGAAATACCCACCATTGGCATTTCACTTGATC 

TCAGTTTATTAAATTGCTGGCCGAGATTGGAGCGAAGTTCCCCCGTTGGAAAAATGTAGCAAGTTGAGCATGAAATGCACTCTCGCACACAGGAACTCGTAAAAGCAACTCG 

AGCTTGTCCCTCGGCCAACTCAGTCTCGATAGACCAGCATTTACACATGGTTAACACACATACATATATTTCTTGGCCGCAACTTAAAGACAACGACGGCAAACTTCATCCA 

GCTACGTTGCCTTCTTGTTTGTTGAATACCCAAAGAATCTTTCAGTTCATGTTTGCAGTACACAGATAAAATAATTTGCATATATTTTCGTGTTTTTGTGCACGAAATGGTT 

TTTCATTACAGACTGTTGATATAACAGTTGCCATGGAGGTAGAGGTGCCTAAAAGTATGTTGCAGCTTAGATGGCTTTCCTTTACAAAATTTCAAAAGTAATTTCTAAGGCA 

CCCCCTAGCTGTTATTCGAAAATCTTTTAGTTCGATCTAAAACCCAACTCAATCGGTTCAAAAAGGTAGCCCATTGAATTCATAAACAGCCCTAAAACAACAGGAAAATCCA 

TTATCTTGCGAGTAGACATAAAATCATCTTGGGTGCCACTAAGCTGCAGTTGAGCTCTTGACTATTTTCTATAACTTGGCAAACTGAACATTGTCACCTCGGTCTGAGCCTT 

TTTTCATCATTGCCAATCGATAGGTGAAATCTTTTATTCGACTGCTAAAAGTTTAAAGCCAGTCGACGGGTACTTTTCCTTTGCATTTTCCTCTTGTCGTATTTATTCACTT . 

TTTTTTATGTCGCTGTTTTCCGTTGGCTTATTTGCTGCAAAGA 

(SEQ ID NO: 136) 

Exon: 1001.. 1196 

Exon: 1590.. 1727 

Exon: 3775.. 3881 

Exon: 3955.. 4046 

Exon: 5156.. 5318 

Exon: 5405.. 5631 

Exon: 5910.. 6086 

Exon: 6395.. 6659 

Start ATG: 1031 

Transcript No. : CT38445 

CGGACGCAGTGCAGCAGTTCCACGACGACCATGAGCGACCAGATTGGCAACCCCAATGCAACATTCAGTGGCAGTGGCAGTGGCAGTGGCACCAACGTCGCCTCCATCGCCG 

AAAGTGTGGCGGAGAGCGGACCGGACTTCGATGCACTTCGGGCCGCCTGCGAAACGCGGCTAAATGCCAGCGGTCAACTGGCGGGCTCCGGAGGACCAGGCGCAGAAGCAGG 

AACCCACTGTGCCGGCACCTTTGATGGCTGGCTTTGTTGGCCGGATACGGCTGTCGGCACTTCCGCCTACGAACTCTGCCCGGACTTCATCACGGGATTCGATCCAGCAAGA 

TATGCCCACAAGGAATGCGGTCTCGATGGCGAGTGGTTCAAGCATCCGCTGACCAATAAAACATGGTCCAACTACACAACCTGCGTAAATCTCGAAGACCTCAACTGGAGGC 

ACACAGTGAATCTGATCTCCGAGGTTGGCTACGGCACCTCACTGCTGGCCATTCTGCTGTCGTTGGCCATATTGGGTTATTTCAATAATTCGATTCGCATATTTGTCTTTCG 

ATTGAACAGATCCCTGAAGTGCGCCCGCATCACGCTGCACATGAATCTGTTCGCCTCGTTCGCTGCCAATAACTCGTTGTGGCTGGTCTGGTACCTGCTGGTCATGCCGAAT 

TCGGAGCTACTGCATCAGAGTCCGATGCGCTGCGTTGCCCTGCACATAACGCTACACTACTTCCTCCTGTCCAATTACTCCTGGATGCTCTGCGAGGGATTCTATCTGCACA 

CCGTCCTCGTCGCCGCATTCATTTCCGAGAAGAGGCTGGTCAAATGGCTCATCGCATTCGGCTGGGGCTCCCCGGCCATCGTCATATTCGTCTATAGCATGGCTCGCGGTCT 

GGGCGGCACGCCCGAGGACAATCGTCACTGCTGGATGAACCAAACCAACTACCAAAACATTCTTATGGTGCCTGTGTGTATCTCCATGTTCCTGAACCTCCTGTTCCTGTGC 

AACATTGTGCGAGTGGTCCTTTTGAAACTGAATGCCCCGGCCAGTATTCAGGGCAGCTGCGGTCCATCGCGAACGGTTTTGCAAGCATTTCGGGCAACGCTGCTTTTGGTTC 

CTCTGCTCGGCCTCCAGTACATCCTTACGCCCTTTCGTCCTGCACCCAAACATCCCTGGGAGAATACATACGAAATCATCTCGGCGTTTACGGCCTCATTTCAAGTAAGTAA 

AATGTTTTCGTTTCGTTTCTGTTCTCGTTTCGTTTTTTTTTCGTTTTTTTTTTTTGCTTTTTTCTTTCCGGCTTTTGTTTTGCACGCTCTCATAAAATCACTTTACGACTCC 

GAGCTGGCTAAAAAGTTATGA 

(SEQ ID NO: 137) 



MSDQIGNPNATFSGSGSGSGTNVASIAESVAESGPDFDALRAACETRLNASGQLAGSGGPGAEAGTHCAGTFDGWLCWPDTAVGTSAYELCPDFITGFDPARYAHKECGLDG 
EWFKHPLTKKTWSNYTTCVNLEDLNWRHTVNLISEVGYGTSLLAILLSLAILGYFNNSIRIFVFRLNRSLKCARITLHMNLFASFAANNSLWLVWYLLVMPNSELLHQSPMR 
CVALHITLHYFLLSNYSWMLCEGFYLHTVLVAAFISEKRLVKWLIAFGWGSPAIVIFVYSMARGLGGTPEDNRHCWMNQTNYQNILMVPVCISMFLNLLFLCNIVRVVLLKI, 
NAPASIQGSCGPSRTVLQAFRATLLLVPLLGLQYILTPFRPAPKHPWENTYEIISAFTASFQVSKMFSFRFCSRFVFFSFFFFAFFFPAFVLHALIKSLYDSELAKKL* 
(SEQ ID NO: 138) 

Name: Calcitonin receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384816 

ATTCGTTCGGTATTTTCGCATTTGGCATGTTTGTGGCTAAGATATCATCAATTCAACACAATGATCAACAATATTCAACGACAAACACTCGACTTGGGACCCTGGATGCTTC 
ATCGGCGGCGCTCGGGAAACTGAATTTGACCCACTAAGGGAACTAGGCAAAATTGCAACGCTCTGTTGGTAAGCCCACCAAATCGAAGCGTTCAGGGGCGGCCAAAAGGAAA 
ATGAATCATTCTACCGATTTGAACAACGATCGAAACTTCACACTAGCCTATCATTTCAGTATCATAGTTGTATGTTGTATGGCTAAGAACTACTGGAAATGTCTTTTTGGGC 
TTATTTGCTTTTTGGATTTTAACATATTTGCTGTGTTTTATATTCGTTTAGTGGGCCTTGGTTTTTCGTTTCTTAAGCGCAATTCGCAAAGCGTGCGAAGATTGAAGATTTT 
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CCGAATTTTGAATTCTCATATTTCGGCCCGTCGTCftAGTTTGGCAAATTGTTTAATAAACGCTRCACGATTTCATTTTGTAAAATTCTAAATGCTftCTTRATftTGGACTTTG 
AAAAAGGTTTATACTTATGCAATCTTTAGATAATAATCCTGCTGCTCGAGTAGCATATGATTTTCGTATATTGTTTTTGTTGTAGCAAGTTCGAATTGTAAACCGGAAACGG 
AAATGTTGCTCTTGCCTAGTGAATTGTCTCTTGTCTACGAATTAGAATTAGAATTAGATCGCTGCTACATATGTATATGTATATGGCTAGAGTTAAGTACAAGTATATATAT 
ACGTGGTAAGCTCATAAATACACAATAATAATCGTCAATCAATTCAATCAATCAATCGGTGAATCAATCATAATTGGATACGCTAflAAACTCTTAGCTTAGCTTAGGTCTAG 
AAATATGAATATGTCTTTCAACGTGTCTGTGGTTTTGGTGTTTTTGGTGTTTCTGTGTGGTTTGCTGTATGGTGGCTGAGTGGTTGAGTGGTTGCGTTTTTAGTTTTCAGTT 
TTTCTGTGGCTGTGACTGTGGGGTGGTGCTTTCA'ITGGCATGTCGCGTAGCGCCAAACGGCCCGTAAATATGCAACTATATGGCAAACAGGCAAGTAAGTGTTTATATATAT 
TGAGAGAGAGAGATATATAGAGAGTTAAGGAAATGGCTCAATGCAATTACAACCAATTATTTACTTGCAGGACTTTGTAGTTCTTCTCAAAGGGATTACGGAGGATGGGTTT 
GGGAGGGGGATTTTTACGGATAATTATTGTATTTTTTTTTGAGGAGAATGAAAATCTAAATCTAACTCTAGTCATCTGGGTCCAACCCGATTAGCTCTGCTTGCAAACGGCC 
CTTAGTCTTACAACAAATGGTTAAGCAAAATACTCTCAAAGCGATACGGAATTGCGGCGACATACTGGTCATGATCAGAATTTTCCAAAACGAATTCAATATGCCGATCCAG 
ACGACGGCGAAGTTGATAAGAGTACTTTGGATAACATCGCCCGTGACCACCTCGTACAGACGCAACAGAATCCATGGCAGCCAGGACACCCAGAATGCAAAGACTAATGCGA 
ATGACATCATATGACTAGGGTTCGATAAATTTTCAGCCAGAGCAGTTGCATATTCTTTATCGTGTATCGGCTCGCCGGACCTAATCTTACGCATCATTACAAAGATATAGCC 
GTAGTTGTGTACGATTGAAATGAGGCTGGGACCTAAGACTAATATCGCCAAGGTGGCGCTATATGCTGCCATATTCCCCCAGTCCAGCATGCAAATGTGCGTCATGTTGTTC 
TCGATGGGCATCGAGTAGCCCAGGATTGGGGGACAGCACAGCAGGGCCGCCGATATCCAAGTGAACACCATCCAGCATTGGCATCTGCACGATAAAGGTCACAAGAGATCAA 
AGATTAATCCAGTTTTAATCCAGATGAGGAGGAATTCGGGGTTCCATTGGCGCTACTCACCTCGTTTTCGTCTGCACCGTTTCATATCTGAGTGGCTTCCGCACCGCAAGAT 
AGCGATCCACTGATATCCACATGAAGGTGTATACCGATACCGCCCACAGAGTGACCTCCAGATAGCCGGTGAAGCGACACACGATGTCGCCGTACATCCATTCTCCGGTCAA 
AGCGGGATACACGGAGAAGGGCACCACCAGGAGTCCGCAGAGCAGATCGGCGATGGCCAGGGACAAGAGGTAGTAGTTGATCACCTCTGTGGGTCCTGCGATGACAGCAATA 
TCGTTAGAAAACGAAGTAGAGGTGTATTCAAAAATAATAGGCAAATGCAAACAATAATATTTAAATGGATCATTCAATAATTTGGGCGAGTCTATTTGTACGACATTCACTA 
GAACTATTTAGAAACCTAAGGTTCTTAAATTTATCCAATAACAATCAATTTATAGGATAAAAGTTTAGGTACAAAAAGGGGAAAGAAATAATATGCTTAAGCACAATTGGTT 
TTTGAATTTTTTCAAAGCAATTAAGACAACAAGAAAACAACACATTTGAATTTTTGGGGAGGATGCCGTGTTACTTCATTCCTTTCACTTTCACATCAAAATCATGGATGCA 
TTTTAAGTATAAACATTGGACTTAAGTGGTGAAAATAAATTATATTCAACACATATGTAGGTTACTTATGAAAATATTATGCTTGAGGCCAAACCCAACCGCGAGATTCGAG 
ATGTTCCTGTGGAAAGTGTTATGTGAGTGTGATATTCGTGCAACGTACCCTTGAAGTTGGCATAGGTGGCGATAATGAGGAGGTTGGACAGGACGATGGCCACGCCCAGTAT 
CGATATGAGCACCGCCTTGGTGAGGGCCTCCACCTTGGAATTATCCTGTAGGTAGCTCATTTCCTGCATTTTTACGGGCTGGCACTGTCGGTTGAAATGGGCGCTCCTGTGC 
TCGTTGTATACGCTCCCTTAGCTCCTTGTCCGTTTCCCTCCTCCTTGGCTCCTCCTGCTTTCACCTGCTTTCGCTGCTGCTTTCACCCGCTTTCTCCGCACCTTCTTGGCGG 
GCTGATTGTTATGCAAAGGCCAGTGGACGTGGACGTGACCGTACTTTACTTCCGGTCATGGCGGACGCCCATGCAATTGTCCTGTGCCTTCGGTTGTGTGTGTCGGTGTGTC 
TGTGAGTGTCTATGTGTGCTTAGGTGAGTGTGTCGATGTGTGAGTGTGTGTGTGTGTCTGGGTGCGAGTGTTGGTGGCGTCCTTTATTGTGCAACGAGGCTTGCACACGCGA 
AGGTCGTCGGAGTCCTTGGCGCTGTTTTTTCGTCCTTTCTCCAACGTTGGCGCCAAATTTGTATAAATTTTCTACTGCCTACTTTTATTTGGGCTGCAACTTCTACTACTGC 
CCACCTTTTGGCGCCGAGCCAAGCATTTGATTGTTACCCTCACAATCCTGGCGTCGTGGTTAAGGTTAAGGATAAGGTTGTGCTGGTGTGTGTGTGTCTATGTGTGTGTCTG 
AAGCTCGTTTAAAATGCTGCTTGGGCTAAAATCTTACACGTAAAATCTAGTAATTGCGTACAATTTGGTTTCTATCTTAGAGGATCGTTAATGATTTAAAGAATTCGTTTTG 
TTTTGCTAAATTCACAACGCTCTTTGCTAAGTTTTAATGAACGTCAAAACTGTTAGTTGTATGCTTGAATAATTCGCTACTAGTTTTAATATCGATTTAAAAATGTACACAT 
TTCAAACTCGCAAGTGGAAGAATAATGATTATTAATTTAACTATTTAATGTGTAATAATTACGCGGTTTTCAGGGCTCTTTTTCTTTCGGGGTTAGTTGAAATATCTTTCGA 
CTATAATTAAAGCACAGCACAAATGGTTTTGTTGTTTTGTAAGGCTTCTGTTGTGCTACATTTAAGTTTGTTAAAATAGTTCGATAATAGTTCGGTTGTTAAGTATCGGTTT 
. TGCTATTTAATTTCGCTGCTTGGTGATTGTGATTATCCTTTTGCTTGTAAATTCTGGCTCTTGTTGCTGTTGTTGGTGTTTAAACTCCACGTCCTGCTGTCGTTTTAAGCTA 
ATTGTGCGCATCTGAAAGGACAAAGGATAGCAATTGTATGCCTTTAGCGCTTCATAAATAGTTTACAGAATCTTCCATCTTGAGCCAATCAATAAATTATCCCACATACGGC 
AGCAGGCGGAAAAAGCCTTTTAGCTGCACAAACGCTTTACA 
(SEQ ID NO: 139) 

Exon: 3185.. 2737 
Exon: 2223.. 1965 
Exon: 1876.. 1001 

Start ATG: 2869 (Reverse strand: CAT) 
Transcript No. : CT41076 

ACACACTCACACATCGACACACTCACCTAAGCACACATAGACACTCACAGACACACCGACACACACAACCGAAGGCACAGGACAATTGCATGGGCGTCCGCCATGACCGGAA 
GTAAAGTACGGTCACGTCCACGTCCACTGGCCTTTGCATAACAATCAGCCCGCCAAGAAGGTGCGGAGAAAGCGGGTGAAAGCAGCAGCGAAAGCAGGTGAAAGCAGGAGGA 
GCCAAGGAGGAGGGAAACGGACAAGGAGCTAAGGGAGCGTATACAACGAGCACAGGAGCGCCCATTTCAACCGACAGTGCCAGCCCGTAAAAATGCAGGAAATGAGCTACCT 
ACAGGATAATTCCAAGGTGGAGGCCCTCACCAAGGCGGTGCTCATATCGATACTGGGCGTGGCCATCGTCCTGTCCAACCTCCTCATTATCGCCACCTATGCCAACTTCAAG 
GGACCCACAGAGGTGATCAACTACTACCTCTTGTCCCTGGCCATCGCCGATCTGCTCTGCGGACTCCTGGTGGTGCCCTTCTCCGTGTATCCCGCTTTGACCGGAGAATGGA 
TGTACGGCGACATCGTGTGTCGCTTCACCGGCTATCTGGAGGTCACTCTGTGGGCGGTATCGGTATACACCTTCATGTGGATATCAGTGGATCGCTATCTTGCGGTGCGGAA 
GCCACTCAGATATGAAACGGTGCAGACGAAAACGAGATGCCAATGCTGGATGGTGTTCACTTGGATATCGGCGGCCCTGCTGTGCTGTCCCCCAATCCTGGGCTACTCGATG 
CCCATCGAGAACAACATGACGCACATTTGCATGCTGGACTGGGGGAATATGGCAGCATATAGCGCCACCTTGGCGATATTAGTCTTAGGTCCCAGCCTCATTTCAATCGTAC 
ACAACTACGGCTATATCTTTGTAATGATGCGTAAGATTAGGTCCGGCGAGCCGATACACGATAAAGAATATGCAACTGCTCTGGCTGAAAATTTATCGAACCCTAGTCATAT 
GATGTCATTCGCATTAGTCTTTGCATTCTGGGTGTCCTGGCTGCCATGGATTCTGTTGCGTCTGTACGAGGTGGTCACGGGCGATGTTATCCAAAGTACTCTTATCAACTTC 
GCCGTCGTCTGGATCGGCATATTGAATTCGTTTTGGAAAATTCTGATCATGACCAGTATGTCGCCGCAATTCCGTATCGCTTTGAGAGTATTTTGCTTAACCATTTGTTGTA 
AGACTAAGGGCCGTTTGCAAGCAGAGCTAATCGGGTTGGACCCAGATGACTAGAGTTAGATTTAGATTTTCATTCTCCTCAAAAAAAAATACAATAATTATCCGTAAAAATC 
CCCCTCCCAAACCCATCCTCCGTAATCCCTTTGAGAAGAACTACAAAGTCCTGCAAGTAAATAATTGGTTGTAATTGCATTGAGCCATTTCCTTAACTCTCTATATATCTCT 
CTCTCTCAATATATATAAACACTTACTTGCCTGTTTGCCATATAGTTGCATATTTACGGGCCGTTTGGCGCTACGCGACATGCCAATGAAAGCACCACCCCACAGTCACAGC 
CACAGAAAAACTGAAA 
(SEQ ID NO: 140) 

Start ATG: 317 (Reverse strand: CAT) 

MQEMSYLQDNSKVEALTKAVLISILGVAIVLSNLLIIATlANFKGPTEVINYYLLSLAIADLLCGLLVVPFSVYPALTGEWMYGDIVCRFTGYLEVTLWAVSVYTFMWISVD 
RYLAVRKPLRYETVQTKTRCO^WMVFTWISAALLCCPPILGYSMPIENNMTHICMLDWGNMAAYSATLAILVLGPSLISIVHNYGYIFVMMRKIRSGEPIHDKEYATALAEN 
LSNPSHMMSFALVFAFWVSWLPWILLRLYEVVTGDVIQSTLINFAVVWIGILNSFWKILIMTSMSPQFRIALRVFCLTICCKTKGRLQAELIGLDPDD* 
(SEQ ID NO: 141) 

Name: putative adrenergic receptor 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384685 

TAAATATGCATATTGAACGAGGCAATGCAATGCGATTGAGGTCCGGCTGTGCGTTTCAGTTGCGAAATGAGTTGCGAACAGGATTTATATTCAGATTTAGCCATTAGCCAAA 
TGCAACACCTTCGACGACGTGTGGACACGTCGACTTTGATTCGGCCAACTGAGTGATCCGGGGTAGTAGTGTCAGATGCCAACATTACTTATGCACTGAAAGAAATAAATAA 
AGGATTTACATGTGTAGTAATGTATAATTGGGCTGCATGACATGCATGATAATGATCAACTTTCAAGCAATAATAATTCTATGACATCACGACATTCATACGGACAGAAAGA 
CGAACATGGATTTACCATTAAAATGTAATTATAATATAATATAATAAAGTATCTAAAGAATTTTTTACTTGGATCTCCTAGCAATTTTAACCATATTCTCAAATGCAAAATG 
TGAAGTTCTTGCAAATAGATACACAGACTAATGGGTCCGCATTTGCCAATTAATTGCCTACAACGCCCAGGTTGCTTTGAATCACAGGTGGACATTTGGACAGGTTTGCTCA 
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GCTCACTGGCAGCTCCACTTTGGCCATTCAAAATTGTTTflTCGCTTTGACGAGTTGCAGTAGCAATTTCGACACCGTfiTGCCCATAATGAAGAAACGGCATTGATAATAACA 

AAGTAATGATAAATTAGATGCTGTTGACGTCTAATTCATTTTTGATGATAGTATTGATTTTATACATAAATGACACAATTTCCCTTTTATTTTGTAAAACATTATGGCATTT 

TCACTTTAATGGTCCTGTAAAACACAAGTTAGTTGTAAAGCACAGAAAAACTTTCCATTTCGATGACAAGAAGTCCTTTAAAATTCAAATAAATATATAATTCTAATTCTAA 

ATAAATAATTCTAAGCAATGGTTTATGTATTTAAACTCCTTAAATTTTGTTCCAGTTTTTAACTAGCACATATACATCAAAAGAGCGGAACTAGATAGTCTACAATGATGCA 

AGAGACCGGCAATCAAATGGGACAGACCCATATGCACCAGCGGGTTCCTTTCAACGACACGGTTCTGAAGGACTATCATCTCACGAGCACCGATATCGAGAAGTTTGTAAAG 

CTATGGCAGGAGTACCAGATGAAGAACATGACCCCTCAGGTGGATGAGTGCCAGGGCTATTGCCAGGGGGAAATATACAACTGGTTGCGAGCCTACAACAGCATCCATGGAT 

ACGTTTCGCTGATGGTAAGTAATACTCGTGATTACAACTCCCCCTGGGCACGAAACCCGCAATGTTGGCCAAAACAAAGGAATCGGTTGACTAATGATGAGCAAGGGAAGCC 

AAGACGATACACACAGACCTTCGCCGACAGACCACTGATGAATGTAGGGGACTTCTTTACCTTTGAACTCTTGCCAAAGTCGTCCTTTTTTCGAATTAAAGTTATAGCATGC 

GCTCAGCTTAACCATCAAAATCACAAATGAAATTAGTGGCCGGGAATCTCGGTCTTTTGTTTATGTCCGGGTTTCCCTTTTCAGGGCTTTTTTGAAAAGGATATAAATTATA 

TTAGATGTAGTTAAGCCGTTGAGGCGTTAATGTGCTTTTTTCATTATAATTTTCAGTCGTAAGTCATTTTGCGTTGTAGCAGACGGCCTTTACCACAGGAATTGGGTCAGAA 

AATACCGAGAATCCATTTCCATCAAGGCTAGTTCATAGCCCATCTTTTCAAATAAATATACAATAAATCAAATACAGTTGAAAAATGTAACTTTATTGCTTTCCTTCGTTCG 

CACAAAAAATAGTTTACCTCTTAAATTGAAATAAAATTCTGAACCAAACTTCGGTTACTGTAGTTCCATTTTTTTTTTTTTTTGCTTGTGTGAGCAATGTTAAGAAATGTAA 

ATACGAGAGGTCACGACAGTTGAGTTGCTGCTCACGAGCTGAATAAATAAAATATTCATCCACTTTGGTGTCCAAGGGGAGTGCATTCAGTTGAAAGTATGTGAGCGCTCAC 

AGTTTCGATTTGACAAAGGACGCATTTTATGTTGATTCTTTCTTGTTTCTTTAAATCACCTAGATTTGCATTTTCGGGACAATAGCAAACATCTTGAACATAATGGTCCTGA 

CCAGAAAGGAAATGGCCAAGACGCCCATAAATAATATACTTAAGTGGCTGGCGGTGGCCGATATGTTTGTGATGCTGGAATATATACCCTATACATCGTATCAGTATATATA 

TATGGGGCCAGGTAAGCATACATTTAGTAATAATATTTAATAATAGTCTTGGTTATAGCTTTACTTGTTATTTATTTATTAGTGTTATCAGCTAGATTATTATTATAAAGGT 

ATACTCTTGATCAGGATCTACACACGTCTGTCTGTCCGTTTCTAAGGGTATATTTTGTTAACAATTTTGTTAACAATTCCCATATATCCTTTCCCCTTTGCACTTACTTTTA 

GTrGAGTAACGAGTATGTGGTAGTCGAGGCAATCGGCTATGACCTTCTCTGTAGGTTTCCTTTCTTTTTTAATTTTAAGCCGTTTAAATTTTTTTGAATTATTTAATATTCT 

TTGTATGGCTGTTGCAGGTGAAAAAGATTTGAGCTACACCTGGGCGGTTTGTCTGCTGGTTCACATGCACTTCACCCAGATCCTGCATACGATTTCCATCGGATTGACTGTC 

ACGCTGGCGGTGTGGCGGTATGTGGCTATCAGGTGAGATGATCCTTCGGCCAGCTATGCATGACTTATTCCGGAATTCGCCGTCATCCATGTAGAGTTTGTCTCCGGTTCAC 

ATTCATGATAATCCACATCCGCAATTCGAACAGACATCCGGCTAATCTGATTACCACATCATTGTGGCAATGTGGCTTCTCTTCTCTTCCCCCGTCTTTTGTTTATAGCCAA 

ATGTTACCGTCTCTATTCCACAGACATCCGAACGGGGGCTGTGCCAATTTCCTGCTAGCACATTCCCGGGAGGCGATCCTCCTGCCCTTCATCCTGTCGCCGATCCTCTGCC 

TGCCCACGTACTTTGTGTTCCAGGTGCGTGAGACGTACGACGTGGACAAGGTCAACTCGGAGGCCATGTATCATGTTTATTTCGACAAGGACTCGGTGCTCTACAGGTGCGC 

TTAGATAAGAAGTCTGATGTCATTTATGCATCCTGCGAAGAACCAATTGTAAAATATTTCATCGTGAAAATCCTTAACAGATTCAACTTCTGGATACATTCCGTGCTCATCA 

AACTTCTGCCGTGCGGCATATTAATTGTGATCAGTGCAGTACTTATGCACGTTTTGTGCGAGGCATCGAGACGTCGCTTGAAGCTGAGGGATTACAACAATCCCGCCAAATA 

TGCCATCCAGCTTAATCTGAATGAAACCAAATCCAAGAAGTAAGTACGCACTTGACTCGTTTAGTGCTTAAGAATTCGCCTCATTTGCAGTTTGCCTTGTTAGTTTCATGAT 

CTTTGATTTTTCTGTTTTTGAATGGTCTCGGATTTTGTCAAATCTCTTCTTAGGTTCCTTTTTGCCATCTTTTTTATAAATTTGTCTTTTTGTTTTTGTATCTTAGGCCGCC 

TCGCTGTGATCGCCGTAACGATCGGACTACCCTTTTGCTGGTTGCGGTACTGGTTTTGTTTCTGATCACGGAGTTTCCGCAGGGATTACTTGGTCTGCTGTCCGGAGTGATG 

GAGAAGTGCTTCTTCGCCCACTGCTATCCGCCGTTTGGAGAGCTGATGGATCTGCTGGCACTGATCAACGCCGCCGTCGGATTCGTGCTGTACGGTCTGATGTCGAAACAGT 

TCCGGACCACCTTCCGATCGCTGTTCATGAAGCGGCACTTTGGCAGCACCGAAATGACCAGATTGACCCGCGTAACCACGACTTGCGTCTAGTTGGGCACATCTAAAGATGT 

GATCATGGGCAGGAGCAAACACTAGTGAGGGATGAAGTAGGTGAGGGACATCCAAGATGTTCCGGCCCGGCATTGAATTTGAATTTTCTAAAGCATAAACTTTATAAAAGTT 

TCGTTGACTAAACAAAGCCTAAATATGAGAAACTGTGAGTGTGGAATTATTGTACGGGCTGACCAGTTGATCAGTTGATTTCCACAATGGTTTACTTTTAGGTGTAGTCCTA 

AGGCGTTGACTGGTGCATCGATCGGTTTAGAAATTGATTCTGCTTGTGATCTGGCTGTTTCGCAAAGTGCCATTTCATTTCCAAATAATTACATTGATTAGTTTACCAGTTG 

GTGTTTCAAATTCGTATTAGTATATAAGTTTCATAGGCTAAGTCAGCTGTACAAAGCTGTACAAATACGAGACATTTACTTTTCCATAAGATGCAGAAAAAAAAACAGGAAT 

CCATCATCAGCAAAACTGTAGCCATGCTAAGAGGACTGAAGAACTCACTGAGCCCCGAGTTGCTCTCTCTTCTTAGCTGGAATCACGAAAGAATTCTATTTTGATTGTATGG 

CGTTAACTGGTTTTCAACAAAGTGCCAATGCATTTTAAATTGGCTAATTGAATAACCATATTTTGAGTACTTTTTAAGTCGCTTATACTATTTATAAATAAATGTGAACACT 

TTCAAAAAAAATATGTTTGATGTGAGTTGCATACAAATGAAACGTTTATTGCTTTCCATTACCGAAAAATGAACATCTAATTCTGACTGACAACAGCATTCACCTCCAGTGG 

CAATTCGTCCGGCAGAAGTTCCGTGGCCCGATTGTGTCCCAAATCAGACTCGCCCATATCCGCGGAATCCAGCAGGATGCCATTGCCCTGATTCGAGATGCGCAGCAGGGAT 

TTCAGCTTGGTGGCTTGGAATTTGTACTGCAGGATAATGAGATTGATAAAATCTGCTCTCGAGGGAGTTTCGACCATAATGTAA 

(SEQ ID NO: 142) 

Exon: 1001.. 1246 

Exon: 2080.. 2251 

Exbn: 2594.. 2720 

Exon: 2936.. 3130 

Exon: 3217.. 3399 

Exon: 3579.. 3900 

Start ATG : 1001 

Transcript No. : CT32473 

ATGATGCAAGAGACCGGCAATCAAATGGGACAGACCCATATGCACCAGCGGGTTCCTTTCAACGACACGGTTCTGAAGGACTATCATCTCACGAGCACCGATATCGAGAAGT 
TTGTAAAGCTATGGCAGGAGTACCAGATGAAGAACATGACCCCTCAGGIGGATGAGTGCCAGGGCTATTGCCAGGGGGAAATATACAACTGGTTGCGAGCCTACAACAGCAT 
CCATGGATACGTTTCGCTGATGATTTGCATTTTCGGGACAATAGCAAACATCTTGAACATAATGGTCCTGACCAGAAAGGAAATGGCCAAGACGCCCATAAATAATATACTT 
AAGTGGCTGGCGGTGGCCGATATGTTTGTGATGCTGGAATATATACCCTATACATCGTATCAGTATATATATATGGGGCCAGGTGAAAAAGATTTGAGCTACACCTGGGCGG 
TTTGTCTGCTGGTTCACATGCACTTCACCCAGATCCTGCATACGATTTCCATCGGATTGACTGTCACGCTGGCGGTGTGGCGGTATGTGGCTATCAGACATCCGAACGGGGG 
CTGTGCCAATTTCCTGCTAGCACATTCCCGGGAGGCGATCCTCCTGCCCTTCATCCTGTCGCCGATCCTCTGCCTGCCCACGTACTTTGTGTTCCAGGTGCGTGAGACGTAC 
GACGTGGACAAGGTCAACTCGGAGGCCATGTATCATGTTTATTTCGACAAGGACTCGGTGCTCTACAGATTCAACTTCTGGATACATTCCGTGCTCATCAAACTTCTGCCGT 
GCGGCATATTAATTGTGATCAGTGCAGTACTTATGCACGTTTTGTGCGAGGCATCGAGACGTCGCTTGAAGCTGAGGGATTACAACAATCCCGCCAAATATGCCATCCAGCT 
TAATCTGAATGAAACCAAATCCAAGAAGCCGCCTCGCTGTGATCGCCGTAACGATCGGACTACCCTTTTGCTGGTTGCGGTACTGGTTTTGTTTCTGATCACGGAGTTTCCG 
CAGGGATTACTTGGTCTGCTGTCCGGAGTGATGGAGAAGTGCTTCTTCGCCCACTGCTATCCGCCGTTTGGAGAGCTGATGGATCTGCTGGCACTGATCAACGCCGCCGTCG 
GATTCGTGCTGTACGGTCTGATGTCGAAACAGTTCCGGACCACCTTCCGATCGCTGTTCATGAAGCGGCACTTTGGCAGCACCGAAATGACCAGATTGACCCGCGTAACCAC 
GACTTGCGTCTAG 

(SEQ ID NO: 143) ... 



tWQETGNQMGO/THMHQRVPFNDTVLKDYHLTSTDIEKFVKLWQEYQMKNMTPQVDECQGYCQGEIYNWLRAVNSIHGYVSLMICIFGTIANILNIMVLTRKEMAKTPINNIL 
KWLAVADMFVMLEYI PYTSYQYIYMGPGEKDLSYTWAVCLLVHMHFTQILHTISIGLTVTLAVWRYVAIRHPNGGCANFLLAHSREAILLPFILSPILCLPTYFVFQVRETY 
DVDKVNSEAMYHVYFDKDSVLYRFNFKIHSVLIKLLPCGILI VISAVLNHVLCEASRRRLKLRDYNNPAKYAIQLNLNETKSKKPPRCDRRNDRTTLLLVAVLVLFLITEFP 
QGLLGLLSGVMEKCFFAHCYPPFGELMDIXALTNAAVGFVI.YGLMSKQFRTTFRSLFMKRHFGSTEMTRLTRVTTTCV* 
(SEQ ID NO: 144) 

Name: TRH receptor-like 

Classification: G_protein_linked_receptor 



Start ATG : 1 
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Celera Sequence No. : 142000013384808 

GTTRAATGATTTACAAACAAGCCGRCATCTCGGGAACCATTAAAGAGACATTTAAAGTCAAAAAGTGGCCftACTGAAACCCGTTAAflATCCGCTTGCGACCGCAACTTGGCC 

CGACTAAGGCAAAAAATAACATAATCCTGTCAAGTCATGCAGACAGGGCGATTTACATTATAATTTTAAATGGCAATGGATTAGTCATCATCAGTACCTGCTTATCTGGTTT 

TCCTGATTTTTAAATCAAATCTGTGACTGTATCTCrTACCTTTCCTCTTGTACCTCGTTATTATAAAATGCGAAAACCCTCTGCCAAAGGACACTATAAAACGAATTTTTTA 

AAAGACGCCAAACAAAATGGCACTTTTTGTTGAGAAGAAAATATGCCGATAAGAACGTTTTACTTGTAAGACGCTGCAGAAAGATTAGGAACGAAAACTATCGACAATGTTT 

ACTTGTATAATCTGAGCTTTCTAAAACACAAAAATAGTGTACTTTTACCATACGCTTAATTTCGAAATTAATCCGGGATATACAGTTGGGGATCCAGATTTTAAATTCGTGA 

ACACAGCTGCTGGAGTAGGGCATTTCCTGATCGATCTGTTAGTTTTCGGACTTTTACTTTGGTTGTTGTAAACAATATCCGCAAGTTTCTCGCAAAGTGGGTTGCGTTGTGC 

CGTGCCTCAGTAGAGCACTACACACTACCTTCGATTGAGTTCGCTGGGATTTGGATTTCGTTCACAGTCCAAGTTTGGACCGTGCTAGGAGAAGATAATGATCCGATCACTC 

AATTAACTTCATGCACCCAGTGACGTCATCAGTGCTTTACTATTCGGTGTCTATATAGTCTATTTTGTTTAATTTCTTACACCCTAGATTTCGATACGATTCGATTTTCGTT 

TGGAAATTGTCTGAAGGTAAACAAGCATAAATTGGACGTTCAAGCGTAAACAAATCGGTTTTCGGAGCAAAATACAAAAATCGTCAGTTGAGAAAGTCTCTCAAATGGCGCA 

GTTCTGCATTCTTGGAGTCCTCCTCATTCTTTCTGGAACACACTGCTCGTGGGGATTCCATGAAGAGACACATTATCCGTGTGCCTTTATCGATACGGCAAACATAACGGGG 

AGCTACGGCTTAGATGGCCCATTTGTGCACAACTGGACGGTGATTCCCCGTCACTTCGTAGCCGTGTACGACTTTGTCATCGAAAATGGCATCCGCATCCCCGCAAGCAGGC 

ATCTCAGAGCCTGTGTCTGTAAGACCAAACCCTGTGTACGGATTTGCTGTTTACGGGGAGAAATCTACGACCTGGAGAAGAGGCAGTGCTTAGTCCCTGTGGCCGGGGTGTC 

CAGCCTTCCCTCCCATAGCCATATGGAGGTGGAGTTGGGCAACGGCAGTCTGCGTCTAGTGAAACTGCAACCCCGTTTTAGCATCCACGTGGAAACACCATGTGAGCACATG 

AAAGCCGTAACCAAGGGCTCGGAATACGTTCACTGGACACTTCATGAGGTCGGTTGTGAAATAAACCTTCCATCACATTAATGGCTTTACATCGCCTTTTCAGAACGGAACC 

ATCAGCCACCGAGGACACATATTTTCTAAGCATTACTGCTTCACTCCGCTGCTCCATGGGAACTCCACCTGGGAGTGGCAGCCATTGGCCTGTGCACCAGAAAAACTATATT 

TCGTCTTGGGGGTGCGGGAGTGGACATACGCCATATGTAAGTAGCAAGAGGAAACATTTTTCCAGCAACTTTCAGCAATTCATTATCCCAAATATGCACAATACTGTTTTCA 

CAACAATACATAAAAGTAATTGCTAAAGCACCAAAAAACAATCGATGATAGCATTAATAGTTTGATAGTCAGAATTTAATTTACCCAGCGTATTCACTTTCCAGGTCTCTTG 

ATAGCTATTCTATCAATGTTTATCGTGCTGATGGTCTACCTTATGTGTTCGGAAATGCGCAACAGTTTCTACGGTGTGGCCATAAAGGCGTACGCCATTTGTATGATCTTGG 

GATATGCCCTACTGGCTTATCTCACGCTCCACAACCCAGCAAATCTCTCAAATGCGGCTTGTCGTATTCTACGTAAGTGGTAAATTTGAGTATTCTGTGATATTTAAACATT 

CCCTTCTTTTTTTTTTTTAGCAAGTTTGGCGTTGATGAACTTGGTGCTTTCCTTTTACATTCTGAGCTTTATTGCATTTAAGCTGTACTTGAGCTTTTATGGCGTTGTCTTC 

ACAAAACTGATGTTTTGGCTAATATTTACTCCCATCGTACTGGTAGCCCTGGGCTGGTCTTTCTTTGTGGGTTTCAGTTATTACGGGTCTAGGCTCATCTTCGGAGGCGATA 

CATGCTGGTTTGATCGTAAGCACTGCCAATCCATATATGTTTTATAATAATTAATTGGTATTCCACATCGCTAGCACGCAATTGGTCCGTTATGATATATTTTTACGCTCCT 

GTTTTCGTGGCTTGCGCCATTAGCGGTTTCTTCTATGTCCTGAGCCAGATCTACATCCGTGATCAGCCAGATATCGAGACGGAGAAGAGCTTCGAGTCCATAGAGAAGAACC 

GATTCAAATCATTTTGGAAGTACTTTGGCTACACGGCAGTCGTATGGGTCGTTTGCATATGCTCCTTTGCCTTTAACTACTACTGGGAGAACAGATCCCACCTTAACTACGC 

TGTAAGCTTCTGCATGGCCTTCCACGGGTTTGCGGCCCTCTATGCGCTGATTGGAAAGAATCAGCAAATCCAAAATTTTCTGCGGCGCATAGATAAGTAAGTTGATAACTAT 

GAATTTTAAGTACTACTGTTAATATATACTGTTGCAGTGGCGAAGATACCTGTGAAAACTCAGTTCCGCTCTCGAGTTTTGGTTAACTTTTCTTGTTGTGCCCACTGTAGCG 

TTAGTGTAACTGGAAGCAGTTGAGTGTATCGACAATATCTGTGCAATGTTTTTAAAATCACTTATCAGCATAGACAAAAACCATGTATTTTAAATGTAAAATTTTGTAGGAT 

GTGTGTAAAATATATTTGTAAAATTTGAACTCGAACAAAAAAGTACACTTCTACCCTTTAAACGAAATTACGATAATAAAGATACAAAAAATAAATATATCTGAAAAAAAAA 

TCAGGTCTGGAATTTCTACAATTGTTAAATATGATAACAAGTCCCAACAATCCTTTCAGTATAGAACAAAACATGGAATATACTTAAGCCATGTGAAAGAACACATAACATA 

TTAATATTCTAAACAATTTGAGCAGTTTTCTTGAGTTTACGCCTCCAACAATTTATCCTTCACCTTTCTTGAGTTCCAAATTCCTGGACCGTGATCAAGTTTCTCCAGTCTT 

ATTTCAAGTCCCGATTTTACTGTACCCTATCCGCATCGCCAAGCCCGGTGTTACTTCCCCGATCTCTCTATATTATAAATGTGCGCCGATTTTATTTAACATCAGAGGTCAA 

TGGCCGCAAAACAGACAGAGAGAGAGAGAGAGAAAGTTGGTCAGAGCCACGAGAACTGCATCCCGACAGGGGGCTGTCAAGAGCAAGGAATCAAGTGGAATGTTGGTGGTGC 

TCTTTTGAAAAGGCTTCGCAAACTTTCCCCCTGTACACCCCCTTGCCAAGGCCTAATCGCAGAAAACCCACCCGGGTCGGAGTCCCCAATTCATTTTCATGCCGATTATGCT 

TCGGGTTTTGGTCCTGCTCTTTTTGGCTGGGCGTACAGCAGCAGGACTCGGGGTACACACTAAGAGAGGTCTAAGAATATAGAGATAGACAAGGAAAGCAGATGGATCGATA 

AAACGTAGTAAAAATGCTCGTAGTAATGGAAGCTGGAAATAACATGGCACAAACAGGCCCCAAACTGCATCGACAACACCATCACAATCCAGACGTGCGCACTGGAAAATCC 

CGAGAGGTAGGACATGTTTGGCGCGTTTTTCGACTGCATGTGTGCGAGTGTTCTGTTTTGGTTAGATACGTATGAAAAAGTGAAAAAGCCTATATTGACCGCGCCCGGTAGA 

TCGCATAGGCAATATCGATATGGGAAATCATTGACTAACATTTGCAGACATTTAGTTTGGGACACATTAAAGTTAATTTGACTC 

(SEQ ID NO: 145) 

Exon: 1001.. 1504 

Exon: 1560.. 1716 

Exon: 1897.. 2088 

Exon: 2149.. 2367 

Exon: 2427. .2784 

Exon: 2838.. 3116 

Start ATG : 1001 

Transcript No. : CT32762 

ATGGCGCAGTTCTGCATTCTTGGAGTCCTCCTCATTCTTTCTGGAACACACTGCTCGTGGGGATTCCATGAAGAGACACATTATCCGTGTGCCTTTATCGATACGGCAAACA 

TAACGGGGAGCTACGGCTTAGATGGCCCATTTGTGCACAACTGGACGGTGATTCCCCGTCACTTCGTAGCCGTGTACGACTTTGTCATCGAAAATGGCATCCGCATCCCCGC 

AAGCAGGCATCTCAGAGCCTGTGTCTGTAAGACCAAACCCTGTGTACGGATTTGCTGTTTACGGGGAGAAATCTACGACCTGGAGAAGAGGCAGTGCTTAGTCCCTGTGGCC 

GGGGTGTCCAGCCTTCCCTCCCATAGCCATATGGAGGTGGAGTTGGGCAACGGCAGTCTGCGTCTAGTGAAACTGCAACCCCGTTTTAGCATCCACGTGGAAACACCATGTG 

AGCACATGAAAGCCGTAACCAAGGGCTCGGAATACGTTCACTGGACACTTCATGAGAACGGAACCATCAGCCACCGAGGACACATATTTTCTAAGCATTACTGCTTCACTCC 

GCTGCTCCATGGGAACTCCACCTGGGAGTGGCAGCCATTGGCCTGTGCACCAGAAAAACTATATTTCGTCTTGGGGGTGCGGGAGTGGACATACGCCATATGTCTCTTGATA 

GCTATTCTATCAATGTTTATCGTGCTGATGGTCTACCTTATGTGTTCGGAAATGCGCAACAGTTTCTACGGTGTGGCCATAAAGGCGTACGCCATTTGTATGATCTTGGGAT 

ATGCCCTACTGGCTTATCTCACGCTCCACAACCCAGCAAATCTCTCAAATGCGGCTTGTCGTATTCTACCAAGTTTGGCGTTGATGAACTTGGTGCTTTCCTTTTACATTCT 

GAGCTTTATTGCATTTAAGCTGTACTTGAGCTTTTATGGCGTTGTCTTCACAAAACTGATGTTTTGGCTAATATTTACTCCCATCGTACTGGTAGCCGTGGGCTGGTCTTTC 

TTTGTGGGTTTCAGTTATTACGGCTCTAGGCTCATCTTCGGAGGCGATACATGCTGGTTTGATCCACGCAATTGGTCCGTTATGATATATTTTTACGCTCCTGTTTTCGTGG 

CTTGCGCCATTAGCGGTTTCTTCTATGTCCTGAGCCAGATCTACATCCGTGATCAGCCAGATATCGAGACGGAGAAGAGCTTCGAGTCCATAGAGAAGAACCGATTCAAATC 

ATTTTGGAAGTACTTTGGCTACACGGCAGTCGTATGGGTCGTTTGCATATGCTCCTTTGCCTTTAACTACTACTGGGAGAACAGATCCCACCTTAACTACGCTGTAAGCTTC 

TGCATGGCCTTCCACGGGTTTGCGGCCCTCTATGCGCTGATTGGAAAGAATCAGCAAATCCAAAATTTTCTGCGGCGCATAGATAATGGCGAAGATACCTGTGAAAACTCAG 

TTCCGCTCTCGAGTTTTGGTTAACTTTTCTTGTTGTGCCCACTGTAGCGTTAGTGTAACTGGAAGCAGTTGAGTGTATCGACAATATCTGTGCAATGTTTTTAAAATCACTT 

ATCAGCATAGACAAAAACCATGTATTTTAAATGTAAAATTTTGTAGGATGTGTGTAAAATATATTTGTAAAATTTGAACTCGAACAAAAAAGTACACTTCTACCCTTTAAAC 

G AAATT ACG AT AAT AAAGAT AC AAAAAAT 

(SEQ ID NO: 146) 



MAQFCILGVLLTT.SGTHCSWGFHEETHYPCAFI DTANITGSYGLDGPFVHNWTVIPRHFVAVYDFVIENGIRIPASRHLPsACVCKTKPCVRICCLRGEIYDLEKRQCLVPVA 
GVSSLPSHSHMEVELGNGSLRLVKLQPRFSIHVETPCEHMKAVTKGSEYVHWTLHENGTISHRGHIFSKHYCFTPLLHGNSTWEWQPLACAPEKLYFVLGVREWTYAICLLI 
AILSMFI VLMVYLMCSEMRNSFYGVAIKAYAICMI LGYALLAYLTLHNPANLSNAACRILPSLALMNLVLSFYILSFIAFKLYLSFYGVVFTKLMFWLIFTPIVLVAVGWSF 
FVGFSYYGSRLI FGGDTCWFDPRNWSVMI YFYAPVFVACAISGFFYVLSQIYIRDQPDIETEKSFESIEKNRFKSFWKYFGYTAVVWVVCICSFAFNYYWENRSHLNYAVSF 
CMAFHGFAALYALIGKNQQIQNFLRRIDNGEDTCENSVPLSSFG* 
(SEQ ID NO: 147) 
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Name: mth-like 8 

Classification : G_protein_linked_receptor 



Celera Sequence No. : 1420D0013384832 

AGTCAGTGCCTGGACGGATCGCACGAGTCCTTCTACTAAGCGCTACCAGTGGTGGTCGTAATCCCCCAATCCAAACCCATACCCAAAGCTTCGCTGCATTTTCGAATACAAA 

CACTGAACTATCCGTCGTTGTAAAGCTACCrTATCCACTACGATTACTACACTATCCTACAAAAAACTATGCTGGTTAAAAGACTATACACCTAGCTCCATATACTATATAT 

ATATATACAATATCGAGGCATGTGTTGTGTGGGCGAGTGAGTGTGTGCGTGTGACTTCTGATTGAGTTGCCATTTGATTCCAAGTATTTTCGAGTGCTTAGTTTTGTAATTT 

GTAGCTAGCCGTTAATGTTTGTATAAACTGTTTCATTGGTTTCCCACTCCGCTTTATTCGATAAGCTCGTTTTCTAGATTTTAGCTTTAGCACTCTTAGTTTAATGTTTAAA 

GTTAGTTTCGCTGTTAAGTTTACACCGTAGTCGAGGCGTCTTGAGTAGTTCTTTTGTACATTAAGCTTACACTTAGCTCTACATTAGATATACCATGTCGAACCTACGAGTC 

TACGAACATACGAACATATATATGTGCGTTGAAAGCGTTGTATTTTTTGAAGCCGAACTTATGCAACCGAAAGGAGACAATAAACAAATGTGCTATAAAGGACGAAAGTGTC 

GCAAATAAATGTGAGCCACTGCTAAAGAAGACCGATTTGCGCAAGGTGTTTTACTCGGCGGCGGCGGCAAGAAGCGTTTGGGGCACAAACGACGGATGAGTGATTGCCACTC 

GTACCATCATCATGACGATGGCGTGTGGAACGACGTGCGCATAGTAATGACAGCGGATGGCAATCTGAACGATAAGCGGCACCCGflTAATGAAACATTAAACTAAACCGTCG 

GTAGTCGCTGGCCATAAATCTGTCGGCAGTCTGCTCGCAGAAGCTTTTAGGTGCGGACCCCTTGCGCGGCATACTTTCGGGCTTCCGTTCCGTTCCGTTCGCCAATGGCCCT 

GCTCCACTACACCTTCGACCAGCTCGAGTTGTACCTAGAATGGGCGTTCGCCCAGCACGGAGAGGCCACGCCGATTCCCTCGATCCAGCCGTATCCGGGCGTGTTCGTGGGC 

GATCTCAGCCAGCTGAACCGCTTCAAGCGACATGCCTTCTCCGCCGTCGTGGGCACGCTCTTCGTCCTCGCCTTCTGCGGCAACCTGAGCACCCTCTATGTGAACAGCCGCC 

GGAAGCTGCGGCCCTTCTTCCGGGCeTGCCrCATCTCACTGGCCTGCAGTGACCTTGTCTCCAGCATATTCTGCACCGTATCCTACATGGCCCAGTTCCAGGCGCAGTATCT 

GCAGCTCTGGGTGAGGCGGCAGGGATTGAGCAGACTCAACACAAGCTAATTACAAATCTCCTTAGACAATTGGTGGCTTCATGTGCAAGTTTGTGCCGTTCATAACCACCAC 

GTCGGTGTTGTCCGGCAGCCTGACCCTGGTGGCCATCGCCTTGGACCGCTACTTGGCCGTGATGCGACCCGTGCTGGGGTTCTGGAGTCCCGACAAACGCTTCAGCACTCTG 

AGCATGCTGCTCATCTGGGCATGTTCGATAGGATCCTCTGGCCCGCTGCTGGGCATCTACGACTACCGAAAGATCTATCTCCTGGATGTGGAGGACTCCAGCGAGGAGAGCG 

AGGAGGTCGTGACCGCTGTGCCCGAGGAGCTGGTGGTCACCGAGCTGGAAATGGTTCACATGTGCCTGGCCGGGGACGTAAGTCAACGACTGGATTATCGCTTAATACAGTT 

GACAGCGATTAAGGGGATTTGGACGGGGGGATGTCAAGTTGTTGAATCAATGTTCCGATTATCTTATCAGCCGCGTCGAAAACAATTTATGCAATTCTATAAATATTCCGTT 

TTAGCACGACGTCGGACTCTACTACGTCATACTGTTCACCCTGATCTTCCTGCCGTGCATTGTGTCCTTCCTATGGCTGAATGCCGTGATTGCGAGGCAGCTCTGGCTGCGG 

AGACACTACCACCAGGAGCAGCAGGAGCAGCATCAGGAGCCCAAGGAGGGTCAGTTTAAGACCATGGCGAATGGCGGGGATCTCTTGATGCCCTCCACCTTGGTCAGTGCCA 

TGGGAGTGGCCGTTCCCTTTGCGCTGGACAACACTCCCTTGCCGCCCAAGAGTACCGTCAACGCTCCTGGCAAGAAGACCACTGCTGCTGCTCTAGCCAGGGAGGCCAGGCA 

CCGCAAGATGGTGGTGGTGGTGCTGCTAATGATGGCGGTTTTCATATGCCTTCGACTGCCCGCCTGGGTCTTCCTGATCATGCGGCTTTATGGATCCTACTCCGAGCCCATC 

GACrGGCTCCTGTACTTCAGCTTCGGCATACTGAACCTTTTCAGTTGCGCCCTAAATCCCATCTTCTACACCTTCCTCACCCAGACCATTAGGACCTTGACGCTGGTGAAAC 

ACAAAATTCAGGGATTTCTTGGCTGTCCGCCTGGTAAAGTGCCGGATGGTATGCCCACGGATCAGATGGACAAAAGCGGGTGCTGCTGCGGCCTGAGGCCACCCACCTTCAC 

ATGGAGGTGCCATCCCAGCCGCGATAGGGCAGCAGCCACCGTAATCCGGGATGTGGACCAGCCAGATCCGCCCTCCGACCAGGTGCAACCGGATCCGAGTAGCCTACGGCGG 

TTCCTCAGCTACAAGCAGGAGGTGTTCACCATTTACAAGCAGTGCGGTGACTCGTCGAGTGCGTCAATCGAGTCCTCCGCCTGATAGGCGGTGGTCATAAATCACCCGGGTG 

CATTTGTCGGATACGGGGGCCAACGGTTAGCAGAACCCCAGAACTTGATTGATGGATGGATCCAGCCCCCATTCACTCTGTACAGGCGCAAAAACGGTTTGTCACGGTGGGT 

ATATCACCCACGAATACACTAGCACGCATATAAGATATAAGCrCACATATTATGTACCTCATCAAATGTGTAAATATGAGGAATAAACGAATGGGTGTTGGCAGCTAACTAA 

CCATTAACACATGGCGCAATCGAAATCGGGAAGGTCTCAGCTATTATTGTGACGAAAACCTAAGACCTTTCAAGTACCACTTAAAACTATATTTAGAGAACTTAAGCAAATT 

GTGAGATTAAAATGCAATGCAACTCCTAAAGGTGCGAATATACCTATTCTTAAATATTTTTACTCTAGAAATACTGACATCTTTATGTGCTTAGTTTTCGTTCAGACAGACT 

ATCTTTGTTATTTCTAAGAACGTAGTAATTAGTTAATGGAATTAATTAGTAACCGAAACTTAAAATTGAGTCATGCCAAAAAAAGAGAATTGGCCTGACAAGACACACATTA 

TAGAGTCAGCAGATCGTGGACGAGACAATGAGCCCGCAATGCAATGGAACTAGATTGTATCATATACTAGATACTTCCGCTTGGAACAATGGTTAACAGATCTCGTGAGAGA 

TGGGGCTGAAAGCAGTTGTTAATTGTCAAATGTTGAGCACGCCAGTTAAATTTAGCTATGCAGCGAAAGCTCTCTCCCATACTCTCTTCCCATGAACTTTTTTAAGCCGCAG 

AGCCCGAAGCTCGACTCAAAGCTGAATTCTGCGTTTTCTGGCATTCAGTGTCGGCGGTTTGCGGCGAGCGGTTGGATTCGGATTCTTTGGTCGGTAAAACGCGAAATCTTCT 

GTGCGGCGGGTGAAATCTTCGGCAGAAGAAACAACAAAGCGACAAGTGTGGAGTGTGGAGTGCATACCGTACCACG 

(SEQ ID NO: 148) 

Exon: 1001.. 1354 
Exon: 1410.. 1757 
Exon: 1909. .2772 • 
Start ATG: 1001 

Transcript No. : CT32957 

ATGGCCCTGCTCCACTACACCTTCGACCAGCTCGAGTTGTACCTAGAATGGGCGTTCGCCCAGCACGGAGAGGCCACGCCGATTCCCTCGATCCAGCCGTATCCGGGCeTGT 

TCGTGGGCGATCTCAGCCAGCTGAACCGCTTCAAGCGACATGCCTTCTCCGCCGTCGTGGGCACGCTCTTCGTCCTCGCCTTCTGCGGCAACCTGAGCACCCTCTATGTGAA 

CAGCCGCCGGAAGCTGCGGCCCTTCTTCCGGGCCTGCCTCATCTCACTGGCCTGCAGTGACCTTGTCTCCAGCATATTCTGCACCGTATCCTACATGGCCCAGTTCCAGGCG 

CAGTATCTGCAGCTCTGGACAATTGGTGGCTTCATGTGCAAGTTTGTGCCGTTCATAACCACCACGTCGGTGTTGTCCGGCAGCCTGACCCTGGTGGCCATCGCCTTGGACC 

GCTACTTGGCCGTGATGCGACCCGTGCTGGGGTTCTGGAGTCCCGACAAACGCTTCAGCACTCTGAGCATGCTGCTCATCTGGGCATGTTCGATAGGATCCTCTGGCCCGCT 

GCTGGGCATCTACGACTACCGAAAGATCTATCTCCTGGATGTGGAGGACTCCAGCGAGGAGAGCGAGGAGGTCGTGACCGCTGTGCCCGAGGAGCTGGTGGTCACCGAGCTG 

GAAATGGTTCACATGTGCCTGGCCGGGGACCACGACGTCGGACTCTACTACGTCATACTGTTCACCCTGATCTTCCTGCCGTGCATTGTGTCCTTCCTATGGCTGAATGCCG 

TGATTGCGAGGCAGCTCTGGCTGCGGAGACACTACCACCAGGAGCAGCAGGAGCAGCATCAGGAGCCCAAGGAGGGTCAGTTTAAGACCATGGCGAATGGCGGGGATCTCTT 

GATGCCCTCCACCTTGGTCAGTGCCATGGGAGTGGCCGTTCCCTTTGCGCTGGACAACACTCCCTTGCCGCCCAAGAGTACCGTCAACGCTCCTGGCAAGAAGACCACTGCT 

GCTGCTCTAGCCAGGGAGGCCAGGCACCGCAAGATGGTGGTGGTGGTGCTGCTAATGATGGCGGTTTTCATATGCCTTCGACTGCCCGCCTGGGTCTTCCTGATCATGCGGC 

TTTATGGATCCTACTCCGAGCCCATCGACTGGCTCCTGTACTTCAGCTTCGGCATACTGAACCTTTTCAGTTGCGCCCTAAATCCCATCTTCTACACCTTCCTCACCCAGAC 

CATTAGGACCTTGACGCTGGTGAAACACAAAATTCAGGGATTTCTTGGCTGTCCGCCTGGTAAAGTGCCGGATGGTATGCCCACGGATCAGATGGACAAAAGCGGGTGCTGC 

TGCGGCCTGAGGCCACCCACCTTCACATGGAGGTGCCATCCCAGCCGCGATAGGGCAGCAGCCACCGTAATCCGGGATGTGGACCAGCCAGATCCGCCCTCCGACCAGGTGC 

AACCGGATCCGAGTAGCCTACGGCGGTTCCTCAGCTACAAGCAGGAGGTGTTCflCCATTTACAAGCAGTGCGGTGACTCGTCGAGTGCGTCAATCGAGTCCTCCGCCTGA 

(SEQ ID NO: 149) 



MALLHYTFDQLELYLEWAFAQHGEATPIPSIQPyPGVFVGDLSQLNRFKRHAFSAVVGTLFVLAFCGNLSTLYVNSRRKLRPFFRACLISLACSDLVSSIFCTVSYMAQFQA 

QYLQLWTIGGFMCKFVPFITTTSVLSGSLTLVAIALDRYLAVMRPVLGFWSPDKRFSTLSMLLIWACSIGSSGPLLGIYDYRKIYLLDVEDSSEESEEVVTAVPEELWTEL 

EMVHMCLAGDHDVGLYYVILFTLIFLPCIVSFLWLNAVIARQLWLRRHYHQEQQEQHQEPKEGQFKTMANGGDLLMPSTLVSAMGVAVPFALDNTPLPPKSTVNAPGKKTTA 

AALAREARHRKMWVVLLMMAVFICLRLPAWVFLIMRLYGSYSEPIDWLLYFSFGILNLFSCALNPIFYTFIiTQTIRTLTLVKHKIQGFLGCPPGKVPDGMPTDQMDKSGCC 

CGLRPPTFTWRCHPSRDRAAATVIRDVDQPDPPSDQVQPDPSSLRRFLSYKQEVFTIYKCCGDSSSASIESSA* 

(SEO ID NO: 150) 

Name: Neuropeptide receptor-like 
Classification : G_protein_linked_receptor 



Start ATG: 1 



FIGURE SHEET 66 





WO 01/070980 



PCT/US01/09341 



67/89 



Celera Sequence No. : 142000013384832 

RGGACACATTGGCGTRCTrAGTCTAGRCCftCTGGGTGAGCTTAATGTTTTGCCGTTCGGTTflTATATCATGCGGTGATTTGTGGACTTAGCATAAMCCTAACTGMCGGCC 
CGGCGACATCCCGACTCTCTGGCGACTTTGGACGGGCCACGATAAATCAAGGCGCAGATCCGGGAGCCCGGCATATAGTGGATATAGTATGCGTGGCGAAACTTAATAAGAG 
ATGCGACAACAAACAAATGATACAGATAGACAGACGGCAGGCGACAGTCGCGGTGGTGCGAGAGGTGGGTAGGTGGGTAGGTGGGAAATGGGATTGTGGAAAGCGGAAAAAT 
CTTTAAAGCAATGCCGCTGCACAGGACATTTTTCACCGCGCCATCCATTCGCAGACGGATCGCCAAAGAGAGACATCAACGTGACTCGCGGGGAGAACAAGTGTTTTGATTT 
ATCGCCAGCAATGGCAATGGCAGGGCGGACGGGGATTGCCATCCAATTATACTGCTAACTTTCATTATGTCACTGTGCTATTTCGGGACCGGATTCGCCGCCATATTGGCCC 
ATTGTTGCGCCATTGTCTATGCACAGCTCTCTGTTTAGCAGTGACACACTGCGGTTTCGGTGGCAGGACTCATCTCCATCCTGCCGCCTGCCTTAGTCGTCCCTTGCCGCTT 
CCTCGGTGGACTCCGTCTGATAAATCGTCAGCGCGGCTTCCCTTTCACATGCTTTGACTTCATTATGGCGTAACAAATCGTCGGCATTTTGCATTCCTCGCTGCGGAAACGG 
AAGCGGAAGACATCCCCGAAAGAAACTGCGAACTGGGCCTTTGTGGCCACACGACGAATTGGTCACGCGCCATATAGACAATTCCATCTCAGTTTTCCACATGTTTTGATTG 
CTCAACGTTGTCAAGGGACTTTGTGCTGCTGTACTTTTTTCGCTTACACTTTAATAGCTTCCGAAGAGACTCATATCCTTCATATCCTTTCTTGCAGCTCTGCCATGGATCT 
AAAGACTGGCTCTCTCCTGGTAGCACCCGCCGTGAAGAACTCCAGCTGCTCGCATCCGCGCTACTCCGGCCACAATTTCATCGCCCACATAGGGATTGCGGAGACCATCGAG 
GCGGTGCTCATCCTGGTCCTCACACTGGGCGTCATCGGGGCCAACTGCCTGGTCATCTTCGTCATCAACAACCGCCGCTATGCGGCCTACATACACCAACAGGTGGGAGGCT 
GAGTCACTGGCTTAACACCTTGTACATCTTATGTCTCTCCTTGTAGCCCCGTTATCTGCTCACCTCGCTGGCCCTGAATGACCTGACAATTGGTCTGCTAATTACACCATTC 
GGTCTAATGCCGGCATTGTTCCACTGCTGGCCATACGGAGAGATCTTCTGTCAGATACAGGTGAGCGGGTCAAGAGTGGGTCAACACCAGGTCAACAAGTGGGGCGGAAGTG 
CGATAATTTGTTATGGAACACCCGTCTCCCGTTTGCGGCTGTTGCCACTGCGGTCAATTAACTCTCTCTTAACAATGCACAGTAATTACTGGACATAAATGGCGGTGATGGA 
AACGGACAATCCCGATTAACGGGCGGAACAATGTGTAATGCTAGTATTAACGTGGTCGAGAGTGTGTAGGATTCTGCGGTAAGAGTCAGGGTTCCGATGCAGAAGGTGTGAC 
AAGCGGAAAGTTATCCCAGCCATATGGTTAGGAAGCTAGTGCACATGGAAAAACCATTGTGCCATTAATTATCATGACGGTGGACAGGGGGCATCGTGGTTAGGGTGTGCAA 
CAGAAACTGTTAGTTTATCACTTATCAAGTGTCAAAAAGCATGCAACGAAATCACCTAGAATCCATCACAACAATTGTTGCATACATGCTTAAAAAGGTTGGGTTTTTATGA 
AAATCAGAGTTGTAAACATTGTAATAAAATCCAGTTAACACTCATACAACTATTTTCATGGGAACCAGACTCTTGTTATATATTTTATACGATGTCGTTTCACAAAATTTGA 
ATTTTCCAACTGCCACTTTAGGGTTATTGGTAAAACATGCACGCATTCCACGCATAACTCTCACAAATATTGAAACACATACTGACGATTAATAATTTTATTAAAAGTTTTT 
CGATTCGCCTCGACAAGCGAAAAAGTTTGCCGAAAGCAATAATACAAATAATAAACTTCTAAACGAGCACTAACGCATTTTCTGGCTTTTTCGCGTTTTCCAGGCCCTGCTC 
CGAGGGGCGCTTTCCCAGCAGAGCGCCGTCATCCTGGTTTGCATGGCAGTCGATAGATACATGTGTGCCCTGCATCCGCGCCGCTACTACCAGCACTCCAGCAAGAAGGTAA 
GATTCCACGTCCTATCGCCTCGCATCGATAGGGAACTTCTATTTTGGGTTGCCCGGATAAGGCACGCTCCTCGGCGATCCTTATCCTGTTGTTCTTGCACTCGCGCTCGCGC 
TCGGAAGGACAATAAACAAAGCTCCGGCATAATGACGAGAAATCACGCGCCAATAAAATCGACTATAACAATGGGAAGAGCACCGCACAGCAACTACACAATATGGTCGGCG 
AGCCAATTACGCAAGCATAAATAAGGATAATGGGCTTTAAGCAGCTTGACAGTCATTAAACAGGCAAAAAATGGGTGGGCAGCGCCAAAGAGCAAGTGCAAAACAAAGCATC 
ATGGAGAATGAAAAAGCAAAGTTGGCAAAAACCAACAAAACCGAAAAAAAAAACAACGGGGGCCAAGTCGAATAGACATTCGAAAAGTGTATAAAATGTGGATAAAAAAAAG 
GGAGCAAAGTTGGCCTAGAGATTTTTTACGCGCAATGACTTCGGCACAGGGCGACGACCACAGCAAAGTCCTTTTCCAGGACGCAGGCAGGCAAATGAAGCGGCACCTCCCA 
CGGAAACCGCAGCCAGGACCAAAGAGCAGCAAACGAGCTCTCAACCAACCCTATTTCCCATTCCCCCCTTCGTAATTTATGGCTCCCGGACTGTAAACAAACGAAAACTAGG 
AGGCCAACAGTCGGCTAAGAACACCGAAGTCCGAATGTCCTTGTCCCAAATGAACGGGACTGTTTCAGCTGAAATTGGCGATCAATAAATGTATAGTTTAGAATCCATTTCC 
CGCAAAATTCAAGCTAACAAACAGATTTTAGCTTACATATATGATTATTTAACCTTTTTGATTTACAGTCTTAATTTCTCGCTGTTTGCTTGGGGATTTTGCAAACAGTTGT 
AATTACTAAACTAACTAAAAAAGGAAAATGCCATCAGGCCAAATAAAACAATTGGAAAACGGGAAAGTACCAATAGTGGAAAATAAATTTACATAACAATTTTGTATGCTCC 
GAAAATCGGCGCCATTTCGGTTTTTTCATGCCATCTGGACGAGTGGGCGAAATTGGCGCACTGCCACGCCCCATTTGTAAAAATGATGGGCGCAGTGTCGAAAATTAATAAA 
TGGATGCTGAATGCGAAATACTGGGGATGCTGAAAAACCAATGCTCGGCGGAACTCGGTAATGGTAATTGGGAACTTTTGACTTTCGTTTTTGCTTACTTTCAAAAGTGCCA 
TAAATTCAAAGCCAGTCTTGGACATATCTCTTTATTTATAGTGAATTTAACAGCTTGACTCAAAACTCGTCTTGTTGTTCGCCACTCACAATTTGTATTAAATGCACTTGAA 
AATTCAATTATTTTGTTATCTAGGATATTTGCTGACTATCCTTACAACTTCAGCAGTCGCCGGGTTGTGCTCTACGAAAAACTCCTGCAATCCTGCAAAAAAAAAAAATAAT 
ATTTACATTTCAAAAATTGTACCCAACTGTTTGAGATCGCCCTGCTTTGTTTGTTTCGTTCGGGAAAAGTTTTAAATGTGATAGCGTAAATTAAATTAATCACAGCCATGGT 
CCGTTTCAAAGAGACACGGACAAAAGTTGAGCTTCCCTTTTTTCCATGACTTTTTGGCACTTGGGATGGGGCCATTTTCATTTTTTCAGGTGGACAGGTGGACAACATTTTC 
CCATTCAGGATGTCAGAATGCTGCATTTAGTAACTCAGCCAGTCAGTGCGATTGTGGCAGCACTTTTCCACTTCCAAAGAGCGGAAGTATCGCTTTCCCTCTGTGGTCTGCA 
CCCTTTACCAAGTGATTTTGGCATAATCAAAAGTTTGTAATTCTATTTTTGGCCTCTTGGACTCCCTTGTTGAATGTCAGACGAAATGTGCAGTGAAGCGAGTGGAATTTTA 
GCAAATTAGAAAGTTGAAGAGGGAATTTCCGACTTATCTGCACGCCTTAGTGTGATATAAGCCACTTCCGGCTAAATAAACACTTAATATGAATTACCCAACCTTCATTCAT 
TTTCAATTCAGTATTGGCCATATTGGCCTGAAATGCAGGGAATACGTGCCTGAAGCCCTTCGCCAGTCATAAATCAGACCGCACCTGGCGGAAATGTGCTCCACTAATTGTG 
TATTCATTTTGCAGGGCTGCGTGGCCATACTGAGCTTAACCTGGATTATCAGCCTGACGGTGTTTGGCTTTCTGGTGCTGCCCAAAGGTGAGCTGCCCGAGATTGGCCCGAC 
ATCAATGTGCTCTCGAACACGTCAAACCATAAACTATATGCGCAGGTTACTACTTCAACAATACGGGTCTAATGGCCTGCGAGCCCTTCTACAGCAAGCCATCCTACAGGAT 
ACTGTCCACGTGCGCCCTCTACTTTCCCACGACGATGGTGCTGATGTACTGCTACGGCTCCAGCTTCCACATGAGCCGCTTCCGGCTAAACGATCCGACCATGCCCCTCACG 
GCGGCCGCCCACCACCCCCATCCGCACCCACATCCCACCGCCGCCCAGCAGCTGCAGATGCACCAGCACCAGCAACATCACCAGCAGGCGGGAATGCACTCGCACCTCTACC 
ACGGCCACTCGCATCACCCCAGCCACCCCAGTCACCCAAACCACCCCAACCACCACGGCCACCCCCACCATCATGGACCGCCCGTTATGGGCCATCTCAGCATGGCCATGAG 
CATGGGACTGGCCGGCATGCCGAACATGACAAACAAGATTACCAAAAAGGTGAGCCAGCTGAACCGGGTGAACTGAGGAAGCCAGGTGGCCTGCTGGCCAAAAGGATGAAGA 
TGGATGAAGGAGGATGAAAGAGGATGAAGTGGGCGGGCTCCACAACAGTTTGCTCAGGCGGGCGATTTGGGGATTAAGTTAGCAGTAAATAGAATTTTCTCGGCATCGCCAA 
ACAAATTGTTGGGTCTATCAGTATGCACTCGCAGAACTTAAAGAAGTGTTAACAGCTTTTAGTATTTGTGAACGGGCTATTCAATATTGGTACCAGCTAATTCAATTAAATT 
CCAGAACTCAGGTAAACTAAACTAAAACTCACTTAAAAAAGAACTCGAGTTGTGCAAGTTACTTTACTTACTTTCCGTGCATAAAAATATTTCCAGAAGCCAATATTACATT 
TTAGCAAAATAGAAAGCAGCAGAAGTTGCATTTGAAAATATATAGAATACTCAACTATATTGCTAATAAACAGGAATAGAAATTTCGTTTGGGCAGTATCTACACATAATTG 
AATCAACAATAGTCACTTTAATTCAATCGCCGCAAACATTCCCACAGATTGTTCCCATCCAAGAGAAGAACTCCAGCGGATCCACCTCGCGCTCGATGGCCGCCATTTCACT 
GGGCTTCATAGTGATGGTCACTCCGTGGACGATTCAGGAAATAGTCACCGCCTGCACGGGATCCAAGGCAAGTGTCCTGTACAAAGGATTATTGTGTGACCGCTTTGTGCTC 
CGTACAACGCGGCGCATACATAATGAGGTCACGCGTGGCTCATCAGTTTTGATTTTGTGACATTTAATGTTCTATGTCCGTCCACAGCTGCCGCCGTTCCTCGATTTCCTGG 
TCACCTGGACGGCGCTGAGCAATAGTCTGTGGAACCCGTTCATGTACTGGCTGCTGAACTCCGATTTCCGCCGCATGAGTCGCCAGCTGATGCCGAACAAAGTGAGTATCGG 
GTATCCAACAGTGGGAAGATGAATCTCCTCTTCTGGCGCTAATTTACTGCATCTCTGATCTGGAGCGATGTTCACTGTCCGTTTCAGCAGCAAATCTGCCATAAGCCCTGAT 
TGATTGATTCGCTTATCCCTATTCCAAACACCACGTGCACACTTTCATTGACCTCCCCGAACGCCAATCCCCAAGCCCAAATATGGGTATTTCGCCCAAGTGCATCCTGCAG 
CTCGCAAATTTTCCTCCTGCACAGCGCAAAGTTTGCAAAAACAGGAGCTACATCCTCCCGAAAGCCCCCATAAATCCGTAACCGAAGGTGTAAGCTCCAGAATGCTGGCAGA 
CTGACCCGGGCAAAGTAAACAATCAACCGACGCAGCAACAACAATAGCAGGTCCTGCAAAGTGATTTTGAATTATGAAAGAGCAAATCTAAGACGGCATTTACATCCTCGTA 
GCCCCACAAACCCCATCCCATAGGTCCTGATTCCATTTTGGTAAACGCAGTCAAGTTGCATGATTTACTTCCGGAGAGCCGGGGTCAGTCTGTGGAATATTTATGGCCCAAC 
CTTGTGCTCTGCCAGAATAAATGCGCATTTAAAATCCTTATTAATTTTTGTGCGAGTCGGAATTTATGATGCCTCCCTGCCGTTTCAGTGCTTCCCCCACGAGGATACGCCC 
GAGCACAAATCAGGCTGTTGTCACATCAATGCTAACGATTTCGAAATCACAACGCTTCCGATTCCCCCGGAGCCGCCATCATCCCGTCCTCCGGGCGGAGCAGGAGGAGGAG 
GTGGAGGTCCATCCAGTGGAGGTCCCAGCTCTGGCGGCGGAGTGGCCAGTTCCATTGGCGGCTCCGTCCTTGGAATCTGCGGCCGCTCGCGCACAAACAGCCTCAGCCGCAG 
CGCCTCCCAGTACATCCGAGGGACAATGGGCGGAGGAGTCCACAATCACCAGGCCACGGAAACCTTCTCCACCACGAGGCCTGACATTGAGGGCCTCTCCGAGAAGTACTGG 
GGCGAGATCCTGGAGAGAACCGTCAGCTCGGGCAACCTGAATGCCATGCAGAAGCACCTGCCGCCCCACCTGCCCTACGCCCACCACCTGGTCCACCAGATCCACCAGCCGC 
AGCAGCACCACCAACAGCAGACCACCTCCTTCAGCAAGGCCAGCGATCTGCAGCTGAACCTGAACCTCAGCCAAGCAGCCACGGCTGCCGACCTGGGCAAGTTCTCCAACTC 
GGAGCCGAAGCTGTGCGAGCACCTCTTTCACGACGTAAACTGCGCCAAGAGCAAGGCCGCCAACGGGGATGGGGAGTCCTCCGGCAAGCTGGCCTGCGGTCGCAGCATTCCG 
GACATCTAGTACGCCGAGGACGTGATCCTGGCCAAGAACCAACTGCTGGCCAGGCAGGCCAAGTGCTCGCACCATCCGCTGCACCAGCAGGCCAAGACGCGTCTCCATTCCC 
ACACGGGCAGCCTGCGCTTCAGCCGGATTCGCGCTGCCTGCCACAGTCCGCCGCCGGAGAAGTCCGCCTTTCCAATGGCCGAGTATCGCAAGTAGGATCACAGAGGGCACTG 
GGATCAAGGATGCCGGGATACGGTACTCCAGGATGTCCAGGATATGGTATAACCATGCACAGAATAGGGAGTGAGAGCGAGCGAGTATCTATAAGATTCCATAGGTGTATGT 
GCGTGTGAGTGTCTATATAATRCCGCATGATCAAACCATAATATGTGTGTTTTCTATTAGGCTTAAGATCTAGGAGGTCCTTTGGTTGGGTTTTCAATTTAAAGTCTCTGAG 
ATTCGCAAAATACCCATGGCCTGTAGAAAGCGCCCTGAAGCCTAACTAAAAGAATATGAAAGATAGCACCTATCCTCAGATATTCGACTCTTTATATAAAGTAATCTGTGTT 
GGTTATAACCTGAATGATTGCTGTCCATCTGAGAACTTTCAATACCATTAGCTAGACTTTCTTTACTATACATGGATCTGGGGAATCGGAATTAGCCGAGTTAAGCAGAAGT 
TAACCATAGTAAGTGTTCATTCAATGTTTGTGATACCCTAAGTTTCCCAGCCCAACATACATACCTACCAGTAACATATAATATAATCCTAGTTGTTGTGCGTGCGTCGGCT 
AAATTAAACGTAGGATATATGGAAACAGAGATATCAGACTTTCCTATCAGAACCCGCTCGACATTCGGAAGCCCTTCGCCAAATTCTCCACCACTTAGGGACTAAGGATCTA 
CGTAGACTCTGATATCGGCGGACCTTAAAGCGTTGTGGTTAATCATAGTTGTAGTTCCATAGCTCGAGTCGTATACACCAAATTACTCACTGTTCGACGAATCAAAATCAAA 
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TCCTAACTTTGTGATTGTCAAGRATTTAAGTCGGTTGATGTGGACATTGCTGTTAAATTMTTTTTATTTGTAGCTAAGTAflTTGflCTAGTTAARGCCAAATAAflTTTflCCG 
TTTAAGTAAGTCTAACTTTAGTTAAATAAACGAGCGGTGTGTAACAAATGTATTTCATAGGCAGAGGATTCAAATCGAAATAAATAA 



(SEQ 


ID NO: 


151) 


Exon : 
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. .1222 
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.7599 


Start 


ATG: 


1001 


Transcript 


No. : 



ATGGATCTAAAGACTGGCTCTCTCCTGGTAGCACCCGCCGTGAAGAACTCCAGCTGCTCGCATCCGCGCTACTCCGGCCACAATTTCATCGCCCACATAGGGATTGCGGAGA 
CCATCGAGGCGGTGCTCATCCTGGTCCTCACACTGGGCGTCATCGGGGCCAACTGCCTGGTCATCTTCGTCATCAACAACCGCCGCTATGCGGCCTACATACACCAACAGCC 
■CCGTTATCTGCTCACCTCGCTGGCCCTGAATGACCTGACAATTGGTCTGCTAATTACACCATTCGGTCTAATGCCGGCATTGTTCCACTGCTGGCCATACGGAGAGATCTTC 
TGTCAGATACAGGCCCTGCTCCGAGGGGCGCTTTCCCAGCAGAGCGCCGTCATCCTGGTTTGCATGGCAGTCGATAGATACATGTGTGCCCTGCATCCGCGCCGCTACTACC 
AGCACTCCAGCAAGAAGGGCTGCGTGGCCATACTGAGCTTAACCTGGATTATCAGCCTGACGGTGTTTGGCTTTCTGGTGCTGCCCAAAGGTTACTACTTCAACAATACGGG 
TCTAATGGCCTGCGAGCCCTTCTACAGCAAGCCATCCTACAGGATACTGTCCACGTGCGCCCTCTACTTTCCCACGACGATGGTGCTGATGTACTGCTACGGCTCCAGCTTC 
CACATGAGCCGCTTCCGGCTAAACGATCCGACCATGCCCCTCACGGCGGCCGCCCACCACCCCCATCCGCACCCACATCCCACCGCCGCCCAGCAGCTGCAGATGCACCAGC 
ACCAGCAACATCACCAGCAGGCGGGAATGCACTCGCACCTCTACCACGGCCACTCGCATCACCCCAGCCACCCCAGTCACCCAAACCACCCCAACCACCACGGCCACCCCCA 
CCATCATGGACCGCCCGTTATGGGCCATCTCAGCATGGCCATGAGCATGGGACTGGCCGGCATGCCGAACATGACAAACAAGATTACCAAAAAGGTGAGCCAGCTGAACCGG 
ATTGTTCCCATCCAAGAGAAGAACTCCAGCGGATCCACCTCGCGCTCGATGGCCGCCATTTCACTGGGCTTCATAGTGATGGTCACTCCGTGGACGATTCAGGAAATAGTCA 
CCGCCTGCACGGGATCCAAGGCAAGTGTCCTGTACAAAGGATTATTGTGTGACCGCTTTCTGCCGCCGTTCCTCGATTTCCTGGTCACCTGGACGGCGCTGAGCAATAGTCT 
GTGGAACCCGTTCATGTACTGGCTGCTGAACTCCGATTTCCGCCGCATGAGTCGCCAGCTGATGCCGAACAAATGCTTCCCCCACGAGGATACGCCCGAGCACAAATCAGGC 
TGTTGTCACATCAATGCTAACGATTTCGAAATCACAACGCTTCCGATTCCCCCGGAGCCGCCATCATCCCGTCCTCCGGGCGGAGCAGGAGGAGGAGGTGGAGGTCCATCCA 
GTGGAGGTCCCAGCTCTGGCGGCGGAGTGGCCAGTTCCATTGGCGGCTCCGTCCTTGGAATCTGCGGCCGCTCGCGCACAAACAGCCTCAGCCGCAGCGCCTCCCAGTACAT 
CCGAGGGACAATGGGCGGAGGAGTCCACAATCACCAGGCCACGGAAACCTTCTCCACCACGAGGCCTGACATTGAGGGCCTCTCCGAGAAGTACTGGGGCGAGATCCTGGAG 
AGAACCGTCAGCTCGGGCAACCTGAATGCCATGCAGAAGCACCTGCCGCCCCACCTGCCCTACGCCCACCACCTGGTCCACCAGATCCACCAGCCGCAGCAGCACCACCAAC 
AGCAGACCACCTCCTTCAGCAAGGCCAGCGATCTGCAGCTGAACCTGAACCTCAGCCAAGCAGCCACGGCTGCCGAGCTGGGCAAGTTCTCCAACTCGGAGCCGAAGCTGTG 
CGAGCACCTCTTTCACGACTACGCCGAGGACGTGATCCTGGCCAAGAACCAACTGCTGGCCAGGCAGGCCAAGTGCTCGCACCATCCGCTGCACCAGCAGGCCAAGACGCGT 
CTCCATTCCCACACGGGCAGCCTGCGCTTCAGCCGGATTCGCGCTGCCTGCCACAGTCCGCCGCCGGAGAAGTCCGCCTTTCCAATGGCCGAGTATCGCAAGTAG 
(SEQ ID NO: 152) 

Start ATG: 1 

MDLKTGSLLVAPAVKNSSCSHPRYSGHNFIAHIGIAETIEAVLILVLTLGVIGANCLVIFVINNRRYAAYIHQQPRYLLTSLALNDLTIGLLITPFGLMPALFHCWPYGEIF 

CQIQALLRGALSQOSAVILVCMAVDRYMCALHPRRYYQHSSKKGCVAILSLTWIISLTVFGFLVLPKGYYFNNTGLMACEPFYSKPSYRILSTCALYFPTTMVLMYCYGSSF 

HMSRFRLNDPTMPLTAAAHHPHPHPHPTAAQQLQMHQHQQHHQQAGMHSHLYHGHSHHPSHPSHPNHPNHHGHPHHHGPPVMGHLSNAMSMGLAGMPNMTNKITKKVSQLNR 

IVPIQEKNSSGSTSRSMAAISLGFIVMVTPWTIQEIVTACTGSKASVLYKGLLCDRFLPPFLDFLVTWTALSNSLWNPFMYWLLNSDFRRMSRQLMPNKCFPHEDTPEHKSG 

CCHINANDFEITTI»PIPPEPPSSRPPGGAGGGGGGPSSGGPSSGGGVASSIGGSVLGICGRSRTNSl»SRSASQYIRGTMGGGVHNHQATETFSTTRPDIEGLSEKYWGEILE 

RTVSSGNLNAMQKHLPPHLPYAHHLVHQIHQPQQHHQQQTTSFSKASDLQLNLNLSQAATAAELGKFSNSEPKLCEHLFHDYAEDVILAKNQLLARQAKCSHHPLHQQAKTR 

LHSHTGSLRFSRIRAACHSPPPEKSAFPMAEYRK* 

(SEQ ID NO: 153) 

Name: Biogenic amine receptor-like 
Classification : G_protein_linked_receptor 



Celera Sequence No. : 142000013384618 

GATAAAGCCCAATTACAGAACCTAATACATACACCTAAAATGACTAATTGACATCCATTGTGCGTATTACATAATAATGAATAATTTAAGTGATAGTTGTTTGAACTTACAG 
AACACTTCTTAGTATTGCGGACTCATCCCATCCAACTGTTGACTCATTTCAAATTCTATTTAATTATATTTCTGGTCCTGGCACGAACAATAAATTGAGTTTATATGTATTA 
CGCCCCTTCTTGTGGTCTCACTTTATGAAAAAGTTAATTGCGTGTAGTTGTTGGCTTTGCTAGATGGATTTTTATTGTGCCGCTTTACTGTCTGCTTTTTTCTTCATTTTCC 
ACTCGCCTTTCATTTTTCATCCGCCGGAAAGCGAATCATTTTTTCGCACGCTGAGCAGTGCAAAGAAAGCAAACACAAGCAAACGAGCAGAATTTCATATAGTTGATTGCAT 
TGTCTGCAGGAAGATGTTCGCACGATGATGCAGGACATGCAGGACTAAGGACTTCGAAGCGAATGGTGTGGGCTGGGGCAGGGATAGTCTCATGAAATGCTGATAGCATGCG 
CTCAATATTGTCAGTAAATAATACGCGCTTGTTAGAGCCAACTGCATACCAACTAAACTGAACACAGTGAGAATAAACATAACAGAAAGTTGAATTTGCTTAATATATAATT 
TCATAAATTTGTCACACCAAAATTCGGATTTAAATTAAAAGTATAATTATATTAATAACATTATTTAATATAATTATAATTATATATAAAAATATATTTTTATATGTATGTT 
TCATTTGGTTGCTGAAAACCAGTAACTAACTAATCTTTGTGGGCTATCCAAATCTCTTCATTGCAGAAACCGCAGCGTGCAACATGTGACGTCACAGGCATAAAGTATGGAG 
AGCATATTAAAATATTTGCCATTTAGCTGGTTTAGCATGCATTGTTGCATTGACTGCCACATAACAAGTGGAAATTAGCAGGCAGCTACTGTAGACGAATCGAAATGGAAGG 
TGGATGGTGGCGAGGAGGAGGAGGAGGAGGACGTTTGGGCGGGAAAGCCATCATGGAAGGACACTCGACACCAAATGGGGCAGCTGCAAGCCACAGAAACAACAGCACAAGG 
ACCAATATCGCGACCAACGGCTGTGCCCATTCGGGCATCCTGTTATTTGTGCTGACAGCTATGACACTAACGAGCTTAATAACGCCCACAGAGCAGCTGGCAGTGGCGCCAA 
ATGGCACCACACTGCATCAACTGGAGTCCGTGGAGTCCGAGTCGTATCCATCCATAAATGGCACTCAGAATGAAACAATGGTGACCAGTGTGCGACCCCACCTGGACCACAG 
GAATCGACCGACGCAGCAGAATGGCAGTCACTATTTGGAGTACGACGATGATGGCCCGGACTGTTCGTACAGCTACAACTTCATCCTGAAGCTCATCACGATGATCCTGTAC 
GCACTGGTCTGCATCATTGGACTCTTTGGCAACACCCTGGTGATCTATGTGGTGATGAGGTTCTCCAAGATGCAGACCGTAACCAACATATACATACTGAATCTGGCTATCG 
CGGATGAGTGCTTCCTGATCGGCATTCCCTTCCTGCTCTACACAATGCAGGTGGGCAACTGGCCCTTCGGCAACTATATGTGCAAGGCCTACATGGTGAGCACCTCGATCAC 
CTCCTTCACCTCCTCGATCTTCCTGTTGATCATGTCGGCGGATCGCTACATAGCCGTTTGCCATCCCATATCCTCGCCTCGCTACCGAACGCCCTTTGTATCCAAGTTGGTT 
TCGGCCTTCGCCTGGATGACATCCGTGCTGCTGATGCTGCCGGTTATCCTTTTTGCCAGCACCGTGCAGTCGAGCAACGGCAATGTGTCCTGCAACATCGAGTGGCCAGACA 
CTCAGAACTCGCACACCGACTCCACCTTCATTTTGTACTCGCTGGTCTTGGGATTCGCCACTCCACTGACTTTTATCCTGGTGTTCTACTGCCTGGTGATCAGGAAACTTCA 
CACCGTGGGACCGAAGCACAAGTCTAAGGAGAAGAAGCGCTCTCACAGGAAGGTCACCAAGTTGGTGCTCACGGTAATAATATATGACATTTCGATTAGTTGTGGAACTAAC 
TACT ACCTGACCACTCTTGTGTTCAGTTTGATAAATATTATTATTAC ATT ATAAATATATATTTTTTTTTGACATTTTGAATTCGACTATAAATTAATTGTTTTAAATTGAT 
ATATAGAAAGAATTTTATAATATTTATTCAATACAATTTATGGTTTTTTGATTTCTCCACAGGTCATAAGTGCGTACATATTTTGTTGGCTTCCACACTGGATTTCACAGGT 
AACTAATTGTATACATTACAGTCTAAAAACATTTATCTTTGTGTACTTTAAAAGGTTTTAAAACCATAAGCTATTTGCCTACTATTATCCTTTTGGATTGATTTTTAAATTA 
TTATAGCTAAAACAAATTACGAGATTTGTGGAAATTGACATGAACTGGACTTCTTTGGCAGATCATTTGTAGATTCGCTTTTTATTGACTATTTTCCAGGTGGCTTTGATCA 
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GCTCCGCTCCTCAACGCTGTGCATCTCGTCTGGAGCTGGCCGTCTTCCTGGCCTGCGGATGCCTCAGCTACTCCAACTCGGCCATGAACCCAATACTGTACGCCTTTTTGAG 

CGATAACTTCAAGAAGAGCTTCATGAAGGCCTGCACGTGTGCTGCCCGCAAGGATGTGAATGCCCAGTTGCAGCTGGAGAACAGTTTCTTCCCCAAGTTTGGCAAGGGCAGG 

CAATCGGAGCGTCTTCTTGGTGGCAATGGAAAAGGTGGCGCCCAGCGTGGGGCATTAACCAAGAAGAAGTGCTTGGCGACGAGAAACAACAATGCTCCGATGGCCACTACAA 

CGACGACGACAACCACCACAACGGGCACGGATGCAGTGACCTGTCTCCAGCCGCCAGTACACCAGGTGCCAGCCGAGATCCAGGTGGGAAATCCGGCCACCGTGCTGGTGGT 

CAATGCTGAGACCAACAACTGTAAACCGCCCGTGCTCCACACGGACTTATAAGACCGGGCTCCCTCGATGCCCCTGGAAACGGTGGTCTTCATCGCGCGGAGGTGAGACCAT 

CAGGTTGAGCTGGACCTCGATACGGCCATCGATTGCCAGGCAATAGCGAGGCAGCCGGAATGCTTGCTATAGGTCGCCTAAAATGATATTAACATTTTATACCAATATGGCA 

GAGCTTTGCAAAAACGGGTAAACTTAATCCCATTTTTATATGATCGAATCTCAGCAAGAAGAAATTTTTAAAAACCCGAATAAA 

(SEQ ID NO: 154) 

Exon: 1001.. 2332 
Start ATG: 1001 

Transcript No. : CT33159 

ATGGAAGGTGGATGGTGGCGAGGAGGAGGAGGAGGAGGACGTTTGGGCGGGAAAGCCATCATGGAAGGACACTCGflCflCCAAATGGGGCAGCTGCAAGCCACAGAAACAACA 

GCACAAGGACCAATATCGCGACCAACGGCTGTGCCCATTCGGGCATCCTGTTATTTGTGCTGACAGCTATGACACTAACGAGCTTAATAACGCCCACAGAGCAGCTGGCAGT 

GGCGCCAAATGGCACCACACTGCATCAACTGGAGTCCGTGGAGTCCGAGTCGTATCCATCCATAAATGGCACTCAGAATGAAACAATGGTGACCAGTGTGCGACCCCACCTG 

GACCACAGGAATCGACCGACGCAGCAGAATGGCAGTCACTATTTGGAGTACGACGATGATGGCCCGGACTGTTCGTACAGCTACAACTTCATCCTGAAGCTCATCACGATGA 

TCCTGTACGCACTGGTCTGCATCATTGGACTCTTTGGCAACACCCTGGTGATCTATGTGGTGATGAGGTTCTCCAAGATGCAGACCGTAflCCAACATATACATACTGAATCT 

GGCTATCGCGGATGAGTGCTTCCTGATCGGCATTCCCTTCCTGCTCTACACAATGCAGGTGGGCAACTGGCCCTTCGGCAACTATATGTGCAAGGCCTACATGGTGAGCACC 

TCGATCACCTCCTTCACCTCCTCGATCTTCCTGTTGATCATGTCGGCGGATCGCTACATAGCCGTTTGCCATCCCATATCCTCGCCTCGCTACCGAACGCCCTTTGTATCCA 

AGTTGGTTTCGGCCTTCGCCrGGATGACATCCGTGCTGCTGATGCTGCCGGTTATCCTTTTTGCCAGCACCGTGCAGTCGAGCAACGGCAATGTGTCCTGCAACATCGAGTG 

GCCAGACACTCAGAACTCGCACACCGACTCCACCTTCATTTTGTACTCGCTGGTCTTGGGATTCGCCACTCCACTGACTTTTATCCTGGTGTTCTACTGCCTGGTGATCAGG 

AAACTTCACACCGTGGGACCGAAGCACAAGTCTAAGGAGAAGAAGCGCTCTCACAGGAAGGTCACCAAGTTGGTGCTCACGGTAATAATATATGACATTTCGATTAGTTGTG 

GAACTAACTACTACCTGACCACTCTTGTGTTCAGTTTGATAAATATTATTATTACATTATAAATATATATTTTTTTTTGACATTTTGAATTCGACTATAAATTAATTGTTTT 

AAATTGATATATAGAAAGAATTTTATAATATTTATTCAATACAATTTATGGTTTTTTGATTTCTCCACAGGTCATAAGTGCGTACATATTTTGTTGGCTT 

(SEQ ID NO: 155) 



MEGGWWRGGGGGGRLGGKAIMEGHSTPNGAAASHRNNSTRTNIATNGCAHSGILLFVLTAMTLTSLITPTEQLAVAPNGTTLHQLESVESESYPSINGTQNETMVTSVRPHL 

DHRNRPTQQNGSHYLEYDDDGPDCSYSYNFILKLITMILYALVCIIGLFGNTLVIYWMRFSKMQTVTNIYILNLAIADECFLIGIPFLLYTKQVGNWPFGNYMCKAYMVST 

SITSFTSSIFLLIMSADRYIAVCHPISSPRYRTPFVSKLVSAFAWMTSVLLMLPVILFASTVQSSNGNVSCNIEWPDTQNSHTDSTFILYSLVLGFATPLTFILVFYCLVIR 

KLHTVGPKHKSKEKKRSHRKVTKLVLTVIIYDISISCGTNYYLTTLVFSLINIIITL* 

(SEQ ID NO: 156) 

Name: Somatostatin receptor-like 

Classi f ication : G_protein_linked_receptor 



Celera Sequence No. : 142000013383997 

GATAAAAGCATTACATTTTCGAGTCTGCACGATCGTATCATCAGTTGTTTGCAGGACCTTGAGTTCTTCATACAAAGTTTTAGCTGCTCATTTTTAATCTGTTCCTTGTGCA 
GAGATTACACTTCTCCGAAAAGAGCCGACACGTAAATTGGAAAAGGGTTAGTGCAACTGTTGTTGCAATTACAATTCATCACATAGCCCGTAGTTCCTCTTCCTGCTGCACA 
GAAAGAAATTGGTATGTGGCGATCGTTGCAATGCCATGCAATTCAATGCCCAATTCAAAAATATTTGAAATGTGTAAACAACTCACTTGTTTTCTTTTCTGCGTGCATGTGG 
TGCTCCTGGCCATTTGCCATTTCGCCAGGCATAATCCCGACAATCGGTGCGATGTGGGTGATACGAGTGCGAATGCGAATGCGAATGCCAATGATATGTGTATTGTATGTAT 
GTGCATACATATGGCGTCGTAGCGTGTGGGCGTGATGTTTGCACTCCGGTGCAGCGGGACTATAAATAAGCCGGGTATTTATTAGGGGCAATCATAAACTGCGTTAACGACG 
GGCACTCTGCCAACGAGAATAAGATCCGGTAAACAAAGGCACCGGTCGACGTACGTCTCATTGTGCGATAAGCTTTCAATTAAATCTGACCCGGATAACGGGGCGCATGGGT 
GCGGGAAATCGGAAATCGGAAACGAAAACGAAAACGGAAGCGGAAATCGAAATCGAAATGGAAAGAGATCAATTTTCAATTGCAGCCAATGCATGCATGTCCATGGGTCCAA 
TGCTAATATCGAAAGACAAAGCTCCGTGTTCTGGAGGTCGTGTTCGCCATGCAGACTCATTGGTTGGTTGGGTTTAACTCGTACAAAGTTTGATGGGTTTGGGATTGTTTTT 
TTTTTTTTTTTAGAACGATATAACTCGATATAATGCTGGTGCGTAAATAACAACGATGTTCATTGTTCGCTTTGCAGCACATTTACTATGCGGTCGATGGCAAAATGACCCT 
CCTGTCGAACATTCTCGACTGCGGAGGCTGTATTTCCGCCCAGCGCTTCACCCGCCTGCTGCGCCAGTCCGGCTCATCAGGACCATCCCCATCTGCACCGACGGCCGGAACA 
TTTGAATCAAAATCCATGCTGGAGCCAACATCCTCGCACAGCCTGGCGACCGGACGCGTGCCACTACTGCACGATTTCGATGCCTCGACAACGGAATCGCCGGGAACGTATG 
TCCTCGACGGTGTCGCCAGGGTGGCCCAATTGGCCCTGGAGCCCACCGTCATGGACGCACTGCCCGATTCGGACACGGAACAGGTTCTCGGTAATCTCAACAGCAGCGCGCC 
CTGGAATCTAACGCTTGCGTCGGCAGCGGCCACCAATTTCGAGAACTGCTCGGCGCTTTTTGTCAATTACACCCTACCACAAACAGGTACCTAAAGCCTAATAACTTTTAAA 
TCATATTGCAGTATTTTTTTTTTTTTTTTTGTTGCCGGGATAGGACTCTACTGCAATTGGACCTGGGACACATTGCTCTGCTGGCCACCCACTCCGGCTGGAGTCCTTGCAC 
GGATGAATTGTCCTGGCGGCTTTCATGGCGTAGATACGCGCAGTAAGTAACTGAAATACACTCTAAAAAAAAAGCAGCCGAAAAAGAACATGCTGAGTACAATTAGCATAAT 
TAATCGTTGAAGCTGTTTAGCTGTTTAGCAGCCACAAGAAGCTTTATGACTTCTGCCGGCTTATTAGGAGTCAAAAGACATGTCCTACGGTGTTTATCCCATCCACACGGCA 
TATCTGGCGGTCACAATCATACGGATCATTAGCTGGGAAACGGTTCAAGTGGTGCACGGCAAACAATTACTTGATTCCGTATTCATCGAAAGGATGTCAATATTTACATGCA 
TGACTGGGGGCGTCCAATCAACCGCACTCCCAGTCGGGGCCGCCCCTCCTTGGCTTCCATCGCCGGGGGCTGTCCAGGTGTTTCATGTGCACCCTCGAGCTCCGCCAGTTGA 
ACAGGGTTTAAAACACCAGCCAATCGATTGCTTAATTACGATCCTAAATGCTTGCTCTACCCCTCCGATAAATTAAACGTTCTGCAATGGGGAATTGCTATTTAATAACCAT 
GGGCGATATACACTGAGCGAAAAGATTCTAAGTAGCTGACTTACCACCCAATATTTTACAAACTTAAAATGCTCTTCCATTATCACTAGTGAATGTATTATCTGCGCTTGTG 
GTGCAAAATCTAGGCATATCGAACTATATCTGTTAAGAAATAATTATTCGAGTGCATTGTTGAGCTACGGCCTCATTTCTCCGGGCCGTGTCTAGATTTTTAGCCCCAATAT 
TTTCAACTGGACGAGGTCGGTCGCCCATTCGATTGGCATAATTGCACTTGTCAGTCAGTGTGTCAAAGCTCGATTACTGTCATTAATCCAACCCAACCTCCCTCTTTTTCGG 
TTCGCAGAATTCGCCATCCGAAAGTGTGAGCTGGATGGTCGATGGGGCAGCAGGCCAAATGCCACGGAGGTGAATCCGCCGGGATGGACGGACTACGGGCCGTGTTACAAGC 
CGGAGATTATCCGTCTCATGCAGCAGATGGGCAGCAAGGACTTCGATGTGAGTTGAATTCTTTCCACACTACATCATAATACCTGTTTCAGTTATAGTTAAGAGCATTGCTT 
AAATGCCAACCAAAATCCCAAATCAAGGGTATGGTTTTGAAATTACCTAATAGGGTTCCAAGTTTTTTAGACACCCTAAAAAGTGATTTCAAAGGCCTGATTACATTTGTTG 
ATAGTGGTTTATAGTAGTAGTTTTAATTCCATTTCTTCCAACTGGTTTGCGATATCCCTTAGGCCTACATAGACATTGCCAGGAGGACTCGAACCCTGGAGATCGTGGGCCT 
CTGCCTCTCCCTGTTCGCCCTTATAGTTTCCCTGCTGATCTTCTGCACATTTCGCTCGCTGCGAAACAATCGCACCAAGATCCACAAGAATCTTTTCGTCGCCATGGTGCTG 
CAGGTGATCATTCGCCTGACCTTGTATCTCGACCAATTCCGGCGGGGAAACAAGGAGGCGGCCACCAACACGAGTCTCTCTGTCATTGAGAACACGGTAGGCTTAATATTCC 
CGCATAAGGATAAGGTATAAGGCTTATGTTCGATATTATATGTTATGTTCTACAGCCCTATTTGTGCGAAGCATCCTATGTACTTCTGGAGTACGCTCGTACCGCCATGTTC 
ATGTGGATGTTCATCGAGGGCCTTTACCTGCACAACATGGTCACCGTGGCCGTTTTCCAGGGCAGCTTTCCCCTCAAGTTCTTCTCGCGACTCGGCTGGTGTGTGCCCATTC 
TGATGACCACCGTGTGGGCGAGATGCACGGTCATGTATATGGACACCTCGCTGGGCGAATGCTTGTGGAACTATAATCTCACGCCCTACTACTGGATCCTCGAGGGGCCACG 
ACTAGCGGTCATACTGGTGGGTAATATCAACTGTTGATCGCTTTAATCACCTATTCACCCAACTGAAAACATCCGCATGCAGCTAAACTTCTGTTTCCTGGTGAACATTATC 
CGAGTGCTGGTAATGAAGCTGCGTCAATCGCAGGCCAGCGATATAGAACAGACTCGCAAGGCAGTTAGAGCGGCTATAGTCCTACTACCACTTTTGGGTATAACCAATCTCC 
TGCACCAGCTGGCTCCTCTGAAAACGGCCACGAACTTCGCGGTCTGGTCGTATGGCACCCACTTTCTCACCTCGTTTCAGGGATTTTTTATAGCGCTAATTTACTGCTTTCT 
AAATGGCGAGGTAAGACATCAAGTCACTCAGTTGTCCATTTGCCTAAAACCCTCACTATTTCCAACCAAAATATATATTTTTTAGGTTCGTGCCGTGCTACTAAAGAGTCTG 
GCCACCCAGCTGTCGGTGCGAGGTCATCCGGAATGGGCGCCGAAAAGGGCATCTATGTACTCGGGTGCTTATAACACGGCGCCGGATACGGATGCAGTGCAGCCTGCAGGAG 
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ATCCATCGGCCACTGGAAAGCGGTAAGCATCCGCTTTGTGCACACCCCACCTTGAAACTCCTTACATCAATTTAAAACTCCCAGAATATCACCGCCGAATAAAAGGCTGAAT 

GGAAGAAAGCCGAGCAGTGCCAGCATTGTGATGATTCACGAGCCTCAACAGCGCCAGCGACTGATGCCCCGGCTGCAAAACAAGGCGCGGGAAAAGGGCAAGGACCGGGTGG 

AGAAGACGGATGCGGAAGCGGAGCCGGATCCGACCATCTCCCACATTCACAGCAAGGAGGCGGGCAGCGCGAGATCGCGAACTCGCGGCTCCAAGTGGATAATGGGCATCTG 

CTTCCGGGGTCAAAAGGTACTAAGAGTACCGTCAGCGTCATCCGTGCCACCCGAGTCAGTTGTATTTGAGTTGTCAGAGCAGTAGACCAAAACCTAGCCTAACTAATAGATG 

TAGCCATAGCCGTAGCCAAATAGCCGTAGAAGTAGCCAACGCCATGTAGCCCACCACCACAGTAGCCAGCACCCACACAATCGGACCGGAAATGATCCCAAAGATAACCCAG 

AACCCGACTATTCCTGCATCTCCCATCGACCCACCAAGCAAGCCAGCTGGAAACAGCACATATGTAAAATACCCCAACAAAATATCGCAGTTCTTATACCCAAGATCATTTT 

GAACAGAAAGTAAGCTCATTGTTTTGTAATTCAAATTAAACCAATAGACATTTAATTTAAATAGTTATTTTCGCTTGCCTTCGGTTCTTGCTCCACTGTACTTATGTAAATA 

CATACATACCTTTATTTGTGTAAGCCATATCCAAACCCTTTCCAAAATCTAGAAATCTCCTATCCTTTCGAAATCGGCTAATCCTACGAAAATTCAAATTTCCCTACAAGTT 

TTACAAAATATTTTTGATGACAATAAATTTCCAATAGCGTCACCTTCTGTTCTGGACACGCAAATACATTCAAAAACACTCGCACACGGTTCTTACTCGATGAATTTTTCAG 

CAAAACTTCAGCGGCATGTTCTTTTACTTCTACTATTCATATTTAAATATTAGATTGAAATATTTAAGTGAATTAACACAAACTTGCGGTAACTGCGCAGTGTGGCCAGGCT 

TTGTGAGCCATTAGAACTTATTACATAACATAATTGAAAAGTGACATTGTTTAAAGGAACTCACCTTTAATCAACTGTTGAACTTGCTGTTCATCGGATTCCGTTATTAAGT 

TTTAAGATTTGGGTTTACTTGTATTTACATTTTCTTTACCACTTTGATTAGCTTACACTTTCCGTTCCTGAGCTGGTCACACTGCCAGCTACAATCGATCACGTTCACTGCA 

ACTGCCTTGAATTTTGCTAGCTGCGCTTTGTTGTGCAACAAAAACTTTGCCACAATCGATATTTAACTAATGGGGTC 

(SEQ ID NO: 157) 

Exon: 1001.. 1322 
Exon: 1500.. 1610 
Exon: 2472.. 2623 
Exon: 2863.. 3120 
Exon: 3192. .3488 
Exon: 3555.. 3818 
Exon: 3894.. 4054 
Exon: 4117.. 4453 
Start ATG: 1001 

Transcript No. : CT33238 

ATGACCCTCCTGTCGAACATTCTCGACTGCGGAGGCTGTATTTCCGCCCAGCGCTTCACCCGCCTGCTGCGCCAGTCCGGCTCATCAGGACCATCCCCATCTGCACCGACGG 

CCGGAACATTTGAATCAAAATCCATGCTGGAGCCAACATCCTCGCACAGCCTGGCGACCGGACGCGTGCCACTACTGCACGATTTCGATGCCTCGACAACGGAATCGCCGGG 

AACGTATGTCCTCGACGGTGTCGCCAGGGTGGCCCAATTGGCCCTGGAGCCCACCGTCATGGACGCACTGCCCGATTCGGACACGGAACAGGTTCTCGGACTCTACTGCAAT 

TGGACCTGGGACACATTGCTCTGCTGGCCACCCACTCCGGCTGGAGTCCTTGCACGGATGAATTGTCCTGGCGGCTTTCATGGCGTAGATACGCGCAAATTCGCCATCCGAA 

AGTGTGAGCTGGATGGTCGATGGGGCAGCAGGCCAAATGCCACGGAGGTGAATCCGCCGGGATGGACGGACTACGGGCCGTGTTACAAGCCGGAGATTATCCGTCTCATGCA 

GCAGATGGGCAGCAAGGACTTCGATGCCTACATAGACATTGCCAGGAGGACTCGAACCCTGGAGATCGTGGGCCTCTGCCTCTCCCTGTTCGCCCTTATAGTTTCCCTGCTG 

ATCTTCTGCACATTTCGCTCGCTGCGAAACAATCGCACCAAGATCCACAAGAATCTTTTCGTCGCCATGGTGCTGCAGGTGATCATTCGCCTGACCTTGTATCTCGACCAAT 

TCCGGCGGGGAAACAAGGAGGCGGCCACCAACACGAGTCTCTCTGTCATTGAGAACACGCCCTATTTGTGCGAAGCATCCTATGTACTTCTGGAGTACGCTCGTACCGCCAT 

GTTCATGTGGATGTTCATCGAGGGCCTTTACCTGCACAACATGGTCACCGTGGCCGTTTTCCAGGGCAGCTTTCCCCTCAAGTTCTTCTCGCGACTCGGCTGGTGTGTGCCC 

ATTCTGATGACCACCGTGTGGGCGAGATGCACGGTCATGTATATGGACACCTCGCTGGGCGAATGCTTGTGGAACTATAATCTCACGCCCTACTACTGGATCCTCGAGGGGC 

CACGACTAGCGGTCATACTGCTAAACTTCTGTTTCCTGGTGAACATTATCCGAGTGCTGGTAATGAAGCTGCGTCAATCGCAGGCCAGCGATATAGAACAGACTCGCAAGGC 

AGTTAGAGCGGCTATAGTCCTACTACCACTTTTGGGTATAACCAATCTCCTGCACCAGCTGGCTCCTCTGAAAACGGCCACGAACTTCGCGGTCTGGTCGTATGGCACCCAC 

TTTCTCACCTCGTTTCAGGGATTTTTTATAGCGCTAATTTACTGCTTTCTAAATGGCGAGGTTCGTGCCGTGCTACTAAAGAGTCTGGCCACCCAGCTGTCGGTGCGAGGTC 

ATCCGGAATGGGCGCCGAAAAGGGCATCTATGTACTCGGGTGCTTATAACACGGCGCCGGATACGGATGCAGTGCAGCCTGCAGGAGATCCATCGGCCACTGGAAAGCGAAT 

ATCACCGCCGAATAAAAGGCTGAATGGAAGAAAGCCGAGCAGTGCCAGCATTGTGATGATTCACGAGCCTCAACAGCGCCAGCGACTGATGCCCCGGCTGCAAAACAAGGCG 

CGGGAAAAGGGCAAGGACCGGGTGGAGAAGACGGATGCGGAAGCGGAGCCGGATCCGACCATCTCCCACATTCACAGCAAGGAGGCGGGCAGCGCGAGATCGCGAACTCGCG 

GCTCCAAGTGGATAATGGGCATCTGCTTCCGGGGTCAAAAGGTACTAAGAGTACCGTCAGCGTCATCCGTGCCACCCGAGTCAGTTGTATTTGAGTTGTCAGAGCAGTAG 

(SEQ ID NO: 158) 



MTLLSNILDCGGCISAQRFTRLLRQSGSSGPSPSAPTAGTFESKSMLEPTSSHSLATGRVPLLHDFDASTTESPGTYVLDGVARVAQLALEPTVMDALPDSDTEQVLGLYCN 

WTWDTLLCWPPTPAGVLARMNCPGGFHGVDTRKFAIRKCELDGRWGSRPNATEVNPPGWTDYGPCYKPEI IRLMQQMGSKDFDAYIDI ARRTRTLEI VGLCLSLFALIVSLL 

IFCTFRSLRNNRTKIHKNLFVAMVLQVIIRLTLYLDQFRRGNKEAATNTSLSVIENTPYLCEASYVLLEYARTAMFMWMFIEGLYLHNMVTVAVFQGSFPLKFFSRLGWCVP 

ILMTTVWARCTVMYMDTSLGECLWNYNLTPYYWILEGPRLAVILLNFCFLVNIIRVLVMKLRQSQASDIEQTRKAVRAAI VLLPLLGITNLLHQLAPLKTATNFAVWSYGTH 

FLTSFQGFFIALIYCFLNGEVRAVLLKSLATQLSVRGHPEMAPKRASMYSGAYNTAPDTDAVQPAGDPSATGKRISPPKKRLNGRKPSSASIVMIHEPQQRQRLMPRLQNKA 

REKGKDRVEKTDAEAEPDPTISHIHSKEAGSARSRTRGSKWIMGICFRGQKVLRVPSASSVPPESWFELSEQ* 

(SEQ ID NO: 159) 

Name: Calcitonin receptor-like 
Claasification: G_protein_linked_receptor 



Celera Sequence No. : 142000013385192 

GCTTACAAAACCAAAAACCAGATTTTGTGCTTCAAATCACAAACATAACGAAAGAAATATGAATGGAAAACAGAGGATTTTATCATCACTTGAAGGCTGACAAAACATTTGG 
CACAATTTACATTGTCCGGCGATTTATTGCCGTCTAGAGACAAAAAACACTCCGGAAAATTGTCACTTTGCTTGCAGTAAGAAAGTAACTTTTAATTAGAGGTTACTGCAGC 
TATGCAGCTGCTACTGCGGTTGCTACTAAAAGCAAAAGTGAAGATATATCCTTTGAATCTCAGCACAAATTGCATTCTCCGTATTTTCTCATTTGTCTAATGGAGTTATGTC 
ACCAAGAAATGCGTGAGTGTTACACTCACAAAATATGTTCTATTCCCAAGAAATTCTTCTCTGTAAGTAAGTGTAAAAGTGTGTACACTAACTCTGTTTGTATTAAAGACAA 
ATGCATAAGAATGCCTTAAATTCGTCTACGTGTAACATGTAAGCCAGAATTTTTGTTGCATGTTACACGTTTTCCCACAAATTGCCAAAATCATCTGCTTCTGCCGGCCCAC 
TAGTCGTATGCGTATTATTGATTACGGAACGAGGCCTTGTTTTCTTACCATTTCGGACAAATGGTAATGACATTTCATTGTTGATTCGTTTGAAATCGTACCTTTGACCTGC 
CCTGAACTTGGACATCGTTCTTTGCATGTGGCATGCAATGCCATTGTATTTGGGGAATTTCGAAGCTTAAATTCTGGTAAAATTCAAACCACAACACTGAATTCGTTTTTTA 
TTCACCTAACGCCTGATGTTTATTTGCAGGTAGCAGGTAGCACTTTCGCCACGCAGCCTGCATATTTGAGTGTTGGAATTTCCGAAAAGGGAATTTAAATGAAACGCCAGAG 
GAGCAGCGGGGCGCACTGAGTTGGCAGCCGGACACCGCGGTCAATCAAACGGAGTCCTTGCAGGACATCCCACCACCGACTACCATCTGGCCTTCATTGACACCATGGTCAC 
GAACATGTCGCAGCCGCATTATTGCGGCACCGGCATCGATGATTTCCACACAAAGTGAGTAGAACGAGGACGTGGACTTAGGCCTGGGCACATAGCCCCTGGGTATGTCTGC 
ATATTAATTAGTTTTTTAAGTGTAATTTACATAATGATTTGCCAGCCGGGGGGCGCATAATGAAATCATGTGTGTAATTGCCTACGGCCGTGGTGGCAAAAAGGCTAACGAA 
GTGTGCCTAATGTGGATGGGCCCCTTTCGGATTAAACTGAACAAATACTCGCATAAATCCATAAAGCAGATGTTAAAGGTCGGAGATGAACATTTTGAAAAATATATTGAGT 
TCAATCTAACCAACCATGTTGTTTATTCGGTTTCTTTCTCTACATAAGCTCTAATTACTTTAAGAATTTAATCATACCCAATTTTGTGCTCAAGTGAAACTGGTTTTAAATA 
TTTATCCCAATTTTTTTTGTCTTTCTCCCCAACAGCTACAAATACTTTCACGGTTAC7TCTCGCTGATTGTCTGTATCCTGGGAACCATTGCGAATACCCTAAATATCATAG 
TGCTGACCCGACGGGAGATGCGCTCCCCCACAAATGCCATACTCACGGGTCTGGCTGTCGCCGATCTGGCTGTGATGCTGGAATACATACCCTATACGGTGCACGACTATAT 
CCTCAGTGTAAGGCTGCCGCGAGAGGAGCAGCTCAGCTACAGCTGGGCGTGCTTCATCAAGTTTCACTCGGTATTTCCCCAGGTGCTGCACACCATCTCCATTTGGCTAACG 
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GTGACGCTGGCAGTTTGGCGGTACATAGCGGTAflGCTATCCGCAAAGGAATCGCATCTGGTGTGGAATGCGAftCTACTCTGATCACCATAGCCftCGGCCTATGTTGTCTGTG 
TCCTGGTAGTGTCACCTTGGCTGTACCTAGTCACAGCCATTGCCAAGTTCCTAGAGACTTTGGATGCCAATGGCAAGACGATTGCCTCAGTGCCATTGAGTCAATACATTCT 
GGACTACAATCGGCAGGATGAGGTGACCATGCAGGTCATGTCGAGTACAACGCCAGATGTTTCCTGGGCGATACCAAGTGATTCGGCCAATGGAACTGCAGTTAGCTTGCITA 
AGTCTAACCACAGTGATACCCCTAACCACATTAAGCACTGGAGTAACCACATCCTCGTCGTTGGGTGAGCGCAATGTGACTGTCTATAAGCTGTATCACAGCGCACTGGCGC 
TGCGTGATCGGCAGTTCAGGAATGCGACCTTCCTTATATACAGTGTCCTGATCAAGCTGATACCCTGCTTCGCACTGACCATTCTGTCTGTGCGGCTCATCGGTGCTCTGTT 
GGAGGCCAAAAGGAGGAGGAAGATCCTGGCCTGTCATGCAGCCAACGATATGCAGCCAATTGTCAATGGAAAGGTGGTGATTCCGACGCAACCCAAGAGCTGTAAACTGCTG 
GAGAAGGAGAAGCAGACCGATCGCACCACGAGGATGCTTCTGGCGGTACTGCTGCTCTTCCTGGTCACCGAGTTTCCACAGGGCATTATGGGTCTGCTGAATGTGCTCCTGG 
GCGATGCCTTCTTTCTGCAATGTTACCTAAAGCTGAGTAAGTATTTTAGGATTAAGACACCCACTCCTCCGCTGTCTGTAACCCGTAATTGCTTTATTGGCTATGATAATTG 
ACTGAATTTATGGTTGGTGTTTAGGTGGGCCCAGTTTGGCACATTCATCAATCATACGCACCGTTGGCTGCCATAATTGCACCACAAATTTGAGGCTTCAATGAAGGCCTGG 
CATTGACGTGTTCAATAAGTGTACATTTTGTTTATCTCGTACACCCAAAATAAAGTAGTCTTACATTTAAAACTTCGCAGATACTATTAAATTGAGAGATTTTTTTCTTTAA 
TTTGAGTAAAAGTATAAAATTGAAAAAAAAATATTTTAAACATTTAATATGTAGTTTTTAGCTAGCATATGTTTTTTCAAGTGTAGGAGGTATAAAATCTAATCGCCCTTGA 
ACTTTATGAAAGCCCTGCCCATAGATGGCGTGTTGGTGGAACCCGACATTTATGGCATTTCGATTTTTTCGTTTCAGGTGACCTTATGGACATCTTGGCGCTTATTAATTCG 
AGCATCAACTTCATCCTGTACTGTTCGATGAGCCGCCAGTTCCGGAGCACGTTCGCGCTCCTCTTCCGTCCGCGCTGGCTGGACAAATGGCTGCCGCTGTCGCAGCACGACG 
GCGAAGGGAGGGTGGGCGGAAGTGGCGGCCTGGGCGGCTACGGCGGATATGGACGGCAGCGGTTGCTGCACACGGATGCCGTTAGCAAGAGCATGGCCATCGATCTCGGGCT 
GACGACCCAAGTGACAAATGTGTAGCAGGAGAGCAGCGGCCGGGCGGCGATGTCAGCCGCAGCCGGCGGAGCAGCTGCATCGGTGGCTCTGGCCCTGGCAGCCACTGATGTT 
GATGGATGTCCGCCTGCCACCGATGCTGCTGTTTCCACTAACGACATCAGCCTGGTCGAGAAGCTACATTTGCAGCCCAGTCCAAGGGGGACTGCGATATCCAGTGGCCAGC 
ATCGAAGGCGGCGCAGTGGGAGTGGCACCAAGTGCATCTGGCCCACGACGGACTGGCTGAGAAAGCTGCGTAACCAGAAGGCTAGAGAAACGGAGCAATCCTCCGAACAGGA 
CATAGAGCTGGGCAAGAGCTCCATCAACAGGCGCAGCAGTGTTCTTCTAATGGTATTACTAAGCAGCTCCGATGAGGTTAAAGCCAAGGCGGTTTTGGTCAGTGAACAGCCG 
CCGAGTCCAGCGGACGAGGATGTGGAAGACGCCATTGACGCCCTCTGGCTGTGAGACGACCCCTCACTTCCATTTTGTACGCCCTGTAAATTATCCAAGTACCCACTAATAT 
TTAACTACCGTGTACAGACCAATTATTGAGACAAATTGTTATGCGCTGATTCCATTATTTTCCCGCCAGTGTTGTCTTGGCGAATCGAGAGCCCCTTTCGCTGAGACTTTGC 
TTTGCTGGAAAATGAGTGGATTTTCCTTGAAAGGACTCCCTGGAAGCAGCCTGCAACATTTTGTGACCACATTTGATAAGTGATATACTTGACTAAGTTTCCCCCCTTAATA 
GTTGTGTAAGTGCTAAGCGCGTCTCTAGATATAAACGAATTAGTCATAGTCTAACTATTAATTAAGATATATTGATGAAATAATTCGGAAGCTAAGAGATTAAATTTCTTTT 
TATATTGAATAAATGTTTTTTCGAAGTAAACTTGATTCAATTCACTTATTGTAAGTGTTATTGCGACACTTTGTTTTAATTAAACGGAATTTATATTGAACTTTGATAAACG 
AATGCTTTTGAATGCAT 
(SEQ ID NO: 160) 

Exon: 1001.. 1062 
Exon: 1492. .2093 
Exon: 2172.. 2612 
Exon: 3102.. 3385 
Start ATG: 1001 

Transcript No. : CT33298 

ATGGTCACGAACATGTCGCAGCCGCATTATTGCGGCACCGGCATCGATGATTTCCACACAAACTACAAATACTTTCACGGTTACTTCTCGCTGATTGTCTGTATCCTGGGAA 
CCATTGCGAATACCCTAAATATCATAGTGCTGACCCGACGGGAGATGCGCTCCCCCACAAATGCCATACTCACGGGTCTGGCTGTCGCCGATCTGGCTGTGATGCTGGAATA 
CATACCCTATACGGTGCACGACTATATCCTCAGTGTAAGGCTGCCGCGAGAGGAGCAGCTCAGCTACAGCTGGGCGTGCTTCATCAAGTTTCACTCGGTATTTCCCCAGGTG 
CTGCACACCATCTCCATTTGGCTAACGGTGACGCTGGCAGTTTGGCGGTACATAGCGGTAAGCTATCCGCAAAGGAATCGCATCTGGTGTGGAATGCGAACTACTCTGATCA 
CCATAGCCACGGCCTATGTTGTCTGTGTCCTGGTAGTGTCACCTTGGCTGTACCTAGTCACAGCCATTGCCAAGTTCCTAGAGACTTTGGATGCCAATGGCAAGACGATTGC 
CTCAGTGCCATTGAGTCAATACATTCTGGACTACAATCGGCAGGATGAGGTGACCATGCAGGTCATGTCGAGTACAACGCCAGATGTTTCCTGGGCGATACCAATAACCACA 
TCCTCGTCGTTGGGTGAGCGCAATGTGACTGTCTATAAGCTGTATCACAGCGCACTGGCGCTGCGTGATCGGCAGTTCAGGAATGCGACCTTCCTTATATACAGTGTCCTGA 
TCAAGCTGATACCCTGCTTCGCACTGACCATTCTGTCTGTGCGGCTCATCGGTGCTCTGTTGGAGGCCAAAAGGAGGAGGAAGATCCTGGCCTGTCATGCAGCCAACGATAT 
GCAGCCAATTGTCAATGGAAAGGTGGTGATTCCGACGCAACCCAAGAGCTGTAAACTGCTGGAGAAGGAGAAGCAGACCGATCGCACCACGAGGATGCTTCTGGCGGTACTG 
CTGCTCTTCCTGGTCACCGAGTTTCCACAGGGCATTATGGGTCTGCTGAATGTGCTCCTGGGCGATGCCTTCTTTCTGCAATGTTACCTAAAGCTGAGTGACCTTATGGACA 
TCTTGGCGCTTATTAATTCGAGCATCAACTTCATCCTGTACTGTTCGATGAGCCGCCAGTTCCGGAGCACGTTCGCGCTCCTCTTCCGTCCGCGCTGGCTGGACAAATGGCT 
GCCGCTGTCGCAGCACGACGGCGAAGGGAGGGTGGGCGGAAGTGGCGGCCTGGGCGGCTACGGCGGATATGGACGGCAGCGGTTGCTGCACACGGATGCCGTTAGCAAGAGC 
ATGGCCATCGATCTCGGGCTGACGACCCAAGTGACAAATGTGTAG 
ISEQ ID NO: 161) 



MVTNMSQPHYCGTGIDDFHTNYKYFHGYFSLIVCILGTIANTLNIIVLTRREMRSPTNAILTGLAVADLAVMLEYIPYTVHDYILSVRLPREEQLSYSWACFIKFHSVFPQV 
LHTISIWLTVTLAWfRYIAVSYPQRNRIWCGMRTTLITIATAYVVCVLVVSPWLYLVTAIAKFLETLDANGKTIASVPLSQYILDYNRQDEVTMQVMSSTTPDVSWAIPITT 
SSSLGERNVTVYKLYHSALALRDRQFRNATFLI YSVLIKLI PCFALTILSVRLIGALLEAKRRRKILACHAANDMQPIVNGKVVIPTQPKSCKLLEKEKQTDRTTRMLLAVL 
LLFLVTEFPQGIMGLLNVLLGDAFFLQCYLKLSDLMDILALINSSINFILYCSMSRQFRSTFALLFRPRWLDKWLPLSQHDGEGRVGGSGGLGGYGGYGRQRLLHTDAVSKS 
MAI DLGLTTQVTNV* 
(SEQ ID NO: 162) 

Name: TRH receptor-like 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384809 

ATAATTCTTATGAAAAATATAAAAATGGGCTGATTAAAATGATTGGTATTATGGAAATTATTATTGAAAAAATATGAAATTTACATTTTAAAAAGTGTTATTCGCGTTATTA 
CTCAGTTTGTAAATAGCGGGCATCAATTGTGCATAGTTTGAAGGTAAGTTGCCCAATAAATGGCGCAAAGCTCGTAACCGCGAATTATATATGTATGTACATATACGATAAA 
AATAAAGTGTGGCCTTTGCTTGGATAAAAAAATCGTCCGAGTCGCCGGATATCCATATTATCGCTTTCAGAAAATTCGATATCATCGATAGGTGTTTCCCTTTCAAAAGTCT 
TCGTCCGATTTGGTCCGTTTGACTACTTTTATCTGGTGCATACCCGATTTGACTGCTTGGACGGAGTGGGATGTGTGAATGAACACTAAAACGCTTGAAACTTGAGAAATTA 
GGGAAGTAGACGGGTAGCGTCAGTTTGTTTGAGAAGATTAACAAGCCCACAAACTGCACAAAACTTGCAAGGCCATTTTATTATTTGTCTATCCATTTTCGATCGATATGGC 
AACAACATTTAAGCCATGCCCACTCTAATGCGTCAATAAATGCGACTAATCCAGGCGGGCCCCCAAGTTAACTTGGCCCTCCTCATCGAGTCAAGCGCTTCCTTCGAAGAAT 
GATTTACCGTTGATCATACCGGGAAGCCTTTTTTGGTCTTCAGGGGAGGTAGAGACTACATGTGGTATTAAGATGAAAAAACTAAATGCGTACATCAAATCAAAAGTATTCT 
■TATTAATCCAAGTGAATTATAATATTTATTTTGGCACCTTGTAAATGTGACAAGTGATTAGAAATGTGGTTCTTCTTAAAGATATAAATGTACATCATAATATGCTAGAGTA 
CTGAGGCACTATTATTTATATACTCTCCCGTGTGCGACAGACTAAATGATTCATAATGAATATGAATACCAGAGTTCACAGATGAGCAATAATCTCAGTTCCTGTTAAAGTG 
ATGTCACTGTCGGCTTGTAGGGAGAGTCTGTGAAGGAAAATATCAATGTTAAAAGAAGATAAATTAAAGCAATAAGATGAACACCAACCTGAAGATTTTAGATCGCCTTTCT 
CATCGAACTTGTATACGCGAGCATTAATAATACTCAGATCTGTCTGACTTGATTCCCTGTAAAAAAAAAAAATGGGCAATTAAAATATTATAGCTTCTATTCGAATATTTCA 
ACTTACACATTAATGGCCCACCAGGATTTGACGTGCAAATTCTTGCCAAGGAGCAGAATAAAACCAGCTATGGCCAGGCCCAGAATGCAAATTCCACTCCAGAAGTCAATGA 
TGAAGAAATGCTCTCTCACTCTGGCCATTATATACACGACTATTTCTCGGATCCAGACCGCAAACACGCCCACTATGTAAGCGCTAAAGGCATAGAAACTGCAGGAATATAA 
AAGATAAATAGGTCTATAATAAAAACAGCTTTTAGGAAATCTTACTGCGTCTTGACGATTTTCAGTTGAGTGCCCAATATGTACTGAGTATCTGGTGGCAGTTCGTAGATGG 
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AAAAGTACGCCTTGAGACAGCATACAATGCTGAATATCAGGAGGAGTGCAATGGGTCCGTAGTAGTAGATCAAAATACCCCAGGTGCGAACTGAAAATGCATAGATAATTCG 

AAATTAGACTTTGTTGTACTTTCTTATTCAACCTACCATCGAACCAGCAGTAATCTTCTCCCATGCCTGGCTTGAAGTGCTTGGGCAGCTTGGAGTCCTGGGCTAAGGAGAC 

CAAAAACCGAAGACCAATCACCGCCAGAACTGCCAGAGCAAGGGACAATCGACGGACGGAGCTCCTGACAGCCTTTTGTTTAAATTTGAGCAGAAAGTCAAGACTGCAAATG 

GCCAGAAAGACGAAGGAGAGCATGATGCAGAAGTAGGCCAGAAATCCTTTTGGGATGCATAGAACAAAATAAGTTATCCAGTTGAAGTAGACAAAATTACTCAACCTATGTT 

TCTGCAAGCCACATGCGAGAGCTTCATGGGATTCTTCAAGGCCAAGTAGACCAACAGGGAGTAACCAACGATCATGGACAGCAAGTAGTAGATGATTAACTGGCCATAGTGG 

CTTTTACGGGCATCTCTCACAGATCCAAGTAGGGAGAGTATGAAAATGTTTATAATGATGGCTATAATACTGCCTAGAAGGGAACAAGAGTTAGAATCTCTTCGAAAGTATT 

CAAACAACACGACTCACAAATGGCGTAAATCCAAACCCGGTACCCCGTTTGAAAGCGTTCGCAGTTGAGCGGAGTGAGCTCCCACTGCTCCGGATTGTGCTCCAACGGGCTA 

AAGCAATACTCATCCGTTGACCACAGACGATTGTCCCGCCGCAGAGTTCCATTCTCGAAGAGGTCCCATTGCCAGAAGTTATCGTGCTTTTTGTCCACGAATTTGTTCCGGC 

AACCCAGCTCTGTGCGCACCACAAAACGATCGCGTA.TGCGTACTTGATCCACAGTCCTGTTAGCGTAGGTGAGCTCCACGTGAGTGGACTCGCGGACTTGGTGTGGCATTGT 

GCAGTTCCAGTGCTTGGCATCAAAGACGAGGTTCTCCGGACMCAGAAACTGATGCAGGGCTTCAGCAGACAGACGCAGCCGCGCAGATGTTTCTTGGCCCGGTACTCCACT 

CCATCGATCACCTTGAAGGAGTACTCCGCCATCAGATGCGGTGGGACCACTACTCCAGCGTATGAGTAGGAGCCATCCTTCATCCGCAGTCCATCCGTAATGTTGACCGTGT 

GGGCGTAGGCACAGGGATGGTTTATACTGGCGATCTCCACTGACTTTGCGCCCACGGAGAGCCAAATAATTAAAAGAATTATCAATGGCGAGACCATGGCTAACTGATCCAC 

GCTTGTTGGATTTTTGGTCGATTCGTTCGTCGGCCTGACGTCACACGCTGTTCAGCTCGATAATTAAGTAAACAAATATGAGAAGCGATCGCCGTTCGCTTTGAGCAATGCA 

CGCACACTCGCCGAGTCAGAAATCTGCACCAACAACAATGGAAACCCAAAATTGACAACGAATGTGACGCGGGTCTAAAAACGGCTGCGATTGCGACAACGACACCGACAAC 

ACCAAAAAGAACAACCATAAACATAAAAGTTTGGTGGAGAGTCTCTCCTAGCTAGTCGTCTTGCCTCTCTCTAAACGACAAACCGGCAACCGCAACGGAGCTCCATGCTGGC 

CAflGAATAAAAAACAGACTAACTGATTAGCCGTTCCCAAAACCAGACGCGACCACAAGCAGCTGGCAAAAAGAGCGATTGGGAAGACTCACAACCCGTCCCGCCCCTCCTCC 

TAGAAACCGGCGTGGAATCTCAACCAGCTCGGAACTTGCACATGCATACCTTAAGTTGTAAGCATATAATCCATTCGAGCCATTTCCAGACTTATCTTGAGATAGATATAAA 

AGTTAAGAATATTAGATTAGATTATTTCCATCCATTTTCTATAATTTATTGCCTTTGCGACCGTAAAGAAAATCCAATTTTTAACCGACAATCCAAATGGCCCAAACGAATT 

TACCACAGCTTTAGTTTTTAAATATTTAGGTGCATTTCCAGGCTATACAAAGTTTGTCTACATGTAGGTGTGTGACCTGGCTTGATATCGTTTACGGTCTTCAAGAATATTC 

GAAAAGATTCGTTGATTTTTGACAAACAGATCCGCTGAGGAAAAAACGACGTTCATTTTATCTTTGGCTTTATTAAGCTTGAAATACAATTATAATTCTTGTATTGTGTTTT 

ACATTAGTTCAGAGATAATGCAGATATGGAGAATGGTACATTTATGCAAAGAGGGTATACAATCAACACCAATCTCGTTTTTAACAAGGCACATCTGTATGTTACGTATTTT 

TGCTGGATGTTTACCTTGGACTTAGGCTAATGGAGAATTACAAATTTGATCTCTCAGCTGGGTTCGGAGAACCTTACAATTTTGCTTGCATAAACAGTAGATATGGCTCTGC 

ATAAATTTCTTATACAACTAATCAGGTTGTTCGTGGTCACATGGCATTAAAAAATGAAAGAGAATTGGGCGGGGTGGGGTGGGTAAGGGTTTTCGTGGTAGTTCTGGGGGAT 

AGGGACTTTCTCTTATAGTAGTGTCTTTCATAAAGTTCTATCCGTATGTGGATGCTACTTCGCGAATTTA 

(SEQ ID NO: 163) 

Exon: 3214. .2258 
Exon: 2201. .2010 
Exon: 1950.. 1717 
Exon: 1658.. 1502 
Exon: 1442.. 1239 
Exon: 1176.. 1097 
Exon: 1035.. 1001 

Start ATG: 2897 (Reverse strand: CAT) 
Transcript No. : CT33414 

TCGTTTAGAGAGAGGCAAGACGACTAGCTAGGAGAGACTCTCCACCAAACTTTTATGTTTATGGTTGTTCTTTTTGGTGTTGTCGGTGTCGTTGTCGCAATCGCAGCCGTTT 

TTAGACCCGCGTCACATTCGTTGTCAATTTTGGGTTTCCATTGTTGTTGGTGCAGATTTCTGACTCGGCGAGTGTGCGTGCATTGCTCAAAGCGAACGGCGATCGCTTCTCA 

TATTTGTTTACTTAATTATCGAGCTGAACAGCGTGTGACGTCAGGCCGACGAACGAATCGACCAAAAATCCAACAAGCGTGGATCAGTTAGCCATGGTCTCGCCATTGATAA 

TTCTTTTAATTATTTGGCTCTCCGTGGGCGCAAAGTCAGTGGAGATCGCCAGTATAAACCATCCCTGTGCCTACGCCCACACGGTCAACATTACGGATGGACTGCGGATGAA 

GGATGGCTCCTACTCATACGCTGGAGTAGTGGTCCCACCGCATCTGATGGCGGAGTACTCCTTCAAGGTGATCGATGGAGTGGAGTACCGGGCCAAGAAACATCTGCGCGGC 

TGCGTCTGTCTGCTGAAGCCCTGCATCAGTTTCTGTTGTCCGGAGAACCTCGTCTTTGATGCCAAGCACTGGAACTGCACAATGCCACACCAAGTCCGCGAGTCCACTCACG 

TGGAGCTCACCTACGCTAACAGGACTGTGGATCAAGTACGCATACGCGATCGTTTTGTGGTGCGCACAGAGCTGGGTTGCCGGAACAAATTCGTGGACAAAAAGCACGATAA 

CTTCTGGCAATGGGACCTCTTCGAGAATGGAACTCTGCGGCGGGACAATCGTCTGTGGTCAACGGATGAGTATTGCTTTAGCCCGTTGGAGCACAATCCGGAGCAGTGGGAG 

CTCACTCCGCTCAACTGCGAACGCTTTCBAACGGGGTACCGGGTTTGGATTTACGCCATTTGCAGTATTATAGCCATCATTATAAACBTTTTCATACTCTCCCTACTTGGAT 

CTGTGAGAGATGCCCGTAAAAGCCACTATGGCCAGTTAATCATCTACTACTTGCTGTCCATGATCGTTGGTTACTCCCTGTTGGTCTACTTGGCCTTGAAGAATCCCATGAA 

GCTCTCGCATGTGGCTTGCAGAAACATAGGATTTCTGGCCTACTTCTGCATCATGCTCTCCTTCGTCTTTCTGGCCATTTGCAGTCTTGACTTTCTGCTCAAATTTAAACAA 

AAGGCTGTCAGGAGCTCCGTCCGTCGATTGTCCCTTGCTCTGGCAGTTCTGGCGGTGATTGGTCTTCGGTTTTTGGTCTCCT-TAGCCCAGGACTCCAAGCTGCCCAAGCACT 

TCAAGCCAGGCATGGGAGAAGATTACTGCTGGTTCGATGTTCGCACCTGGGGTATTTTGATCTACTACTACGGACCCATTGCACTCCTCCTGATATTCAGCATTGTATGCTG 

TCTCAAGGCGTACTTTTCCATCTACGAACTGCCACCAGATACTCAGTACATATTGGGCACTCAACTGAAAATCGTCAAGACGCATTTCTATGCCTTTAGCGCTTACATAGTG 

GGCGTGTTTGCGGTCTGGATCCGAGAAATAGTCGTGTATATAATGGCCAGAGTGAGAGAGCATTTCTTCATCATTGACTTCTGGAGTGGAATTTGCATTCTGGGCCTGGCCA 

TAGCTGGTTTTATTCTGCTCCTTGGCAAGAATTTGCACGTCAAATCCTGGTGGGCCATTAATGTGGAATCAAGTCAGACAGATCTGAGTATTATTAATGCTCGCGTATACAA 

GTTCGATGAGAAAGGCGATCTAAAATCTTCAGACTCTCCCTACAAGCCGACAGTGACATCACTTTAA 

(SEQ ID NO: 164) 

Start ATG: 318 (Reverse strand: CAT) 

MVSPLIILLIIWLSVGAKSVEI ASINHPCAYAHTVNITDGLRMKDGSYSYAGVVVPPHLMAEYSFKVIDGVEYRAKKHLRGCVCLLKPCISFCCPENLVFDAKHWNCTMPHQ 

VRESTHVELTYANRTVDQVRIRDRFVVRTELGCRNKFVDKKHDNFWQWDLFENGTLRRDNRLWSTDEYCFSPLEHNPEQWELTPLNCERFQTGYRVWIYAICSIIAI IINIF 

ILSLLGSVRDARKSHYGQLIIYYLLSMIVGYSLLVYLALKNPMKLSHVACRNIGFLAYFCIMLSFVFLAICSLDF1LKFKQKAVRSSVRRLSLALAVLAVIGLRFLVSLAQD 

SKLPKHFKPGMGEDYCWFDVRTWGILIYYYGPIALLLIFSIVCCLKAYFSIYELPPDTQYILGTQLKIVKTHFYAFSAYIVGVFAVWIREIWYIMARVREHFFIIDFWSGI 

CILGLAIAGFILLLGKNLHVKSWWAINVESSQTDL5IINARVYKFDEKGDLKSSDSPYKPTVTSL* 

(SEQ ID NO: 165) 

Name: mth-lixe 9 

Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384809 

CGAGCCATTTCCAGACTTATCTTGAGATAGATATAAAAGTTAAGAATATTAGATTAGATTATTTCCATCCATTTTCTATAATTTATTGCCTTTGCGACCGTAAAGAAAATCC 
RATTTTTAACCGACAATCCAARTGGCCCAAACGAATTTACCACAGCTTTAGTTTTTAAATATTTAGGTGCATTTCCAGGCTATACAAAGTTTGTCTACATGTAGGTGTGTGA 
CCTGGCTTGATATCGTTTACGGTCTTCAAGAATATTCGAAAAGATTCGTTGATTTTTGACAAACAGATCCGCTGAGGAAAAAACGACGTTCATTTTATCTTTGGCTTTATTA 
AGCTTGAAATACAATTATAATTCTTGTATTGTGTTTTACATTAGTTCAGAGATAATGCAGATATGGAGAATGGTACATTTATGCAAAGAGGGTATACAATCAACACCAATCT 
CGTTTTTAACAAGGCACATCTGTATGTTACGTATTTTTGCTGGATGTTTACCTTGGACTTAGGCTAATGGAGAATTACAAATTTGATCTCTCAGCTGGGTTCGGAGAACCTT 
ACAATTTTGCTTGCATAAACAGTAGATATGGCTCTGCATAAATTTCTTATACAACTAATCAGGTTGTTCGTGGTCACATGGCATTAAAAAATGAAAGAGAATTGGGCGGGGT 
GGGGTGGGTAAGGGTTTTCGTGGTAGTTCTGGGGGATAGGGACTTTCTCTTATAGTAGTGTCTTTCATAAAGTTCTATCCGTATGTGGATGCTACTTCGCGAATTTATAGAT 
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ATCGACTTTAAATGTTGATTACAGAATAATCTCATGATAGGAATATGTAGTCCTAAAATATGAATAAGTGATCTTTATCTACAGAAAGGAACGCAGTGTTCGCAATAATGCG 

AGGATCCGATCGAAATGAATATAAATGCGATTAGTCCTTCCTGATCTGCACGCATCCCGAGGTATTCAATGCTCTGCCTATGGCGGCAACTGCGAGTGCTGTGATCAGCGGA 

AGATGGTGGTTTTCTGAGGATCCACGCTCGAGCTGATCACTAGCGGCTTCTCCACCGATGGCTTCTCGTGCAGCGACAGATTGGCCACGCTGCTCGACGTGGTCTTGGTGGA 

GTACTGGCTCTGACGTTGGTTGCTGCCATCGCGAACACTCGAGCATCTGTGTTTGTGTGAAGTGGTGGAGTGGTTATGACGTCCATTGCGTGGGCAGCGTGGAATAGACAAT 

GGCGGTAAATCTGATTAGAATTGGTTCGTTTGAGTGCTTGGATTTTAAGTTAGGGAAAAGCAGACACATTAAGCGATCCTACAGGATATAAACAGGTTACCTTCACCAAGAC 

ACAAAAGTGGATGGTCATTATGATTAGTTTTCGAGGGGTTAGAGGTTATCAGGGTTATCTAATGGTTTCTTCCAACTGGGGTTAACTACAATGTCAACTACTTCTCCATTTC 

CAGAGGATTACCTGTGACCATTTTGATTAGATGGCTATCTTCGGGGTCGGACTCGGGTCGGTGGCATTGCTACCGGCCTTGTCAACAGCCTTAAGGTGCAGCTTTCTTGGCA 

GCACATTCGCGTCAGAAGTGATGGAAGTCGAGGTAGTGGTCGAGCTCAGGCCATTCGAGCGCCTTTTCTCCAGGAGTCTTCAATGAATAGCGATCGATGATGATCAAATTGA 

TGTTGAGACACATGCACATATGGACACACATGGTGGATGCAAACATAAGTTTGTGTTTAGATCAGGGCAATTAGGAATGGTACTTCTGTTTAGTAGCTACAAAGACGCGACC 

GGACTGAGTCACCGAGCTAGGCGGCTGGAGATTCAGTGCTGTGTTGGCTGTCAGTGTTCGGTTTAGAGTTAGTCTGCAGGCAGTGTCTGTTGGTGTTGAGCTTGCTTTTAAG 

CAATATCGAAAGCAGTCTGACAACGAAAATAAGTACAACATATAAGGTTTAAAATAATCACAGATTGCGCGCTGTCAGAACTCGAAGTGCATATGTATTCCCATAATCTTCA 

ATGGGTCTTCCATCTTAGTGCAAGAGATTTAAGGAACGTGGAATTAGACAGGAATCATATAACCCGAAATTTTCAATATTCTTGATTTAAGCGCGGGTTAGCAAAGGCCACA 

ACAAACGAGCAACCGAGCTAAAGGAGACCTTCTTACTGAAATCGACGAAATCGATTTAGACTTCGTGTGAATGAATACCACTGGTGGACCCATTGGCAAATGCGTTGCGGAT 

CACCACCAGCTTGCGGATCTGGTTGGAGGCGTCCATAAACCGCTGCTTCATCGAGAACGAAGTCGGTCCCATCGTTACGCTCTCAGTCTGCCGCAGACTGCGCACACGTATC 

GTTCTGAGAATAGTCGGGGTTACAGATATGATGGTGGACAGAGTCAGAGACACGACACACAAGAGGGAGTCACGAGTCGTGTGAATGGACAAACGCCCAGTGGGCGGGACAG 

GACATAACGTGGAGTTAAGAATAGAGCAACACTTACCTGTTTGTGATCAAGTGCTTGACCTTCTTCTTCATTACAAACAACATGAAAATGAGAAAGCCCTGCATCGCATTGG 

CCAGATCCGAGATATAAAAGAGCTTAGACCATCCCTTGTCGCTGCCCACCAAGTACGAGATAAGCTCCGTGAGCCAGGTGATGCCCATTATGAGAAAGAGGCGTAAGAACAA 

GCCGAATCTATAGTTGGACCAAGCACGATAAAAACGTCTTTAGGGGAAGGTGGATAAGAAATACTGAAAGGATGACACTCACTTGTCCTTCTCGGTTCGTAGGTTCTTGGTG 

CTGTTCTCACTCGCAATTATCCGGGCCATCTCGCGCTGCACGCCGTGAATCTTAATCGCCGTCATTATGAACATGATGGTGTTGGCAACAACGATCGCCAGAATGGGACCGT 

AGAAATAGATCATGGCTGCCCAGTTGCTCATATCCAGCCAGCAGTATACACCATCACCAATACCTGGCTTTAGGTTCGCGGGCAAATTGGTAAGTTGCTGGGCGCAATAGGT 

GAAGGCTAGGAAAACTAGGGCAATACCCCAAGAGTACAGCGAATAAAACAGGAAGCGTTTCTTCTCCTGGAATCGGTTAATTCCTCTCGTCCCTCGGAAATTGTGCCACAAG 

TCAAAGCTAATGACACTTAGCCACATGTAGGCACCCATAAAAAAGAAATATGCTGTGTATCCAAACACTTTGCATGTAACACCAGTGGCATCAACTTGATAGAGTTGCACGT 

AGCACAGCGAGGAGTAACCCACGGAGAGCCCTATTAAATAGCAGACCAAGCTCTTTCCGTGCTGGTTGCGGAGCTCGGGAATCAATAGGTAAACGGCTATGGTAAGCATCAT 

GAAGGGAATGGAGAACATCATGGCTAGTGCAAGGAATTTCGGTCGGATTAATAAAATGCTGTTTACAAGAAGTAAAGGTGATCGTTTCACTTACCATACGAATTGATAATCT 

TTACCGTGCTATGATCGTCCTGCATGTCACAATTGGCCGGATGTATGGTGTAGAACATGTCCGATTCGTTGACGTATGTCGGAACCATGCAGAACTCGTTCTTCCTGATTAG 

CAGTTTATCGTCCACACGGAGCATGGATCCGTTCTCAAAGAGTAGATAATCATCCCAATTGTTGGTCTCTGGCTGCAGAGAAAACATCTTGGGGCAGGGTCTGAAGCTTTGG 

ATAGCAAACTGCGTAAACATGTTGACCAATATCGTAGACTCGTTGCGGAAGGTTATATTGAGCATAGGCGGATCTGGCCACGTTCGGTTGTCCGAGTGGTCTATTATGCATC 

CGTCAACTTCAGAGTTAAAGACCTGCCGCCACGGGCAGCAGATATTCAGGCAGGACTTGAACTTGCAGACGCATCCCCTAACGTGTTCCCGAACCTCTATGCGCTCCGTTAG 

CGAGCTGTGTATGTAGTCGTACGTCCCGACCAATTCTGCGGGAACTATTGTGCCGTAGTAATCATAGCTTCCGTTGGGAAAGAGCCTATGGCCCGTCAGATTCACTGTATCG 

TAGAAATTACAGGGCACATTTGGATCGTCTATGGTGTGGTAGTGGTCTCTCTCGGCCATAACGTAAGTGCCGAAGGGAATCCCCGGAATCAACCGGAACACGACCAAGCACA 

GGACGCCCAGCAATGTAACTCCGCAGTATAGGGATCCTCCAGGCTGATGAATTTTCTTCGGCATAATGAAGGTTCTGAAATTCGTAAATCTTTATTAAATATTATTACAAAT 

ATTTGTTATTATTCTTTAACGTTTATTCTTTTAACGTTTAAATATTCAGTGGAATATAGTTATTACAGTTAGTCATCAGTCGCAACACGCAGCAATTAAGCCTTAGCAAGTG 

AAGTCACTAATTAAGACCTATTTGCCGATTAGTTAGTCGTTTAATGTTTATCTAAGTGAGTTCACTAATTTGGGCAATAGAGCAACATCTAGGGACCCATTAGCGGGGACGA 

GTGGTGAATGATCTATGCGGACAACAGGTTGTCGAAAAGAATTAAGGAGGCGAAACAGGATAGCGTCACAAGCAATTAGCAGCCAACGTGTCCCATGGGGAGGAACCAGGGG 

CGGACTACCCACGACCTGAGCGCCATTCTCCGCGGGACATCAATCATAAGGTCAACCACGCAGGGAGTGCAGACTGAGCAAACATTTATGGATTCGTGTTAAATTGATAGTA 

ACTGCCATTGTCTGTCAATAGAGGTGACAGTCTCCCCGAACTCCACTCCAAACCAATCCCGATCTGCCCACTTTTCAAGCGCTGCTAACGAGCCCGTCCATAAATTCATTAC 

CAGTGCGAAAAGTTATACCGCTCGCTATCTTATCTTGTATCATTAGCCTTTGGGGACCAACTAATTCATTGATCCGCTGATTGTTTGCCTTTGGGGGAGAAGCAGAAGGGTA 

ACCTGCGGCATTGTCAGGAGTGTGGGCCACTAAATTAGTCCGAATTTATGGGATGCGAACGGGTGGAGAAGGACAAACACATTAAACTCCGCACAGTTGGCTAATGGGGAAA 

ATGAATACTGGCAGTGGGTGGGGAAGGGAGAGCCAGAGCACTTATCTTGCGGTGGGAAATTAGCATAATTAATCGCTGAGCACATGCACTTGGGAAATTTCAGCGGACCAAA 

TTCAATCGTATATATCTGCGCACTTTAAATGTGCGAATGAGTGCTGCGGATGTTTG 

ISEQ ID NO: 166] 

Exon: 4208.. 3455 ' 
Exon: 3344.. 2771 
Exon: 2695.. 2501 
Exon: 2355.. 2207 
Exon: 1242.. 1001 

Start ATG : 4208 (Reverse strand: CAT) 
Transcript No. : CT33415 

ATGCCGAAGAAAATTCATCAGCCTGGAGGATCCCTATACTGCGGAGTTACATTGCTGGGCGTCCTGTGCTTGGTCGTGTTCCGGTTGATTCCGGGGATTCCCTTCGGCACTT 

ACGTTATGGCCGAGAGAGACCACTACCACACCATAGACGATCCAAATGTGCCCTGTAATTTCTACGATACAGTGAATCTGACGGGCCATAGGCTCTTTCCCAACGGAAGCTA 

TGATTACTACGGCACAATAGTTCCCGCAGAATTGGTCGGGACGTACGACTACATACACAGCTCGCTAACGGAGCGCATAGAGGTTCGGGAACACGTTAGGGGATGCGTCTGC 

AAGTTCAAGTCCTGCCTGAATATCTGCTGCCCGTGGCGGCAGGTCTTTAACTCTGAAGTTGACGGATGCATAATAGACCACTCGGACAACCGAACGTGGCCAGATCCGCCTA 

TGCTCAATATAACCTTCCGCAACGAGTCTACGATATTGGTCAACATGTTTACGCAGTTTGCTATCCAAAGCTTCAGACCCTGCCCCAAGATGTTTTCTCTGCAGCCAGAGAC 

CAACAATTGGGATGATTATCTACTCTTTGAGAACGGATCCATGCTCCGTGTGGACGATAAACTGCTAATCAGGAAGAACGAGTTCTGCATGGTTCCGACATACGTCAACGAA 

TCGGACATGTTCTACACCATACATCCGGCCAATTGTGACATGCAGGACGATCATAGCACGGTAAAGATTATCAATTCGTATGCCGTTTACCTATTGATTCCCGAGCTCCGCA 

ACCAGCACGGAAAGAGCTTGGTCTGCTATTTAATAGGGCTCTCCGTGGGTTACTCCTCGCTGTGCTACGTGCAACTCTATCAAGTTGATGCCACTGGTGTTACATGCAAAGT 

GTTTGGATACACAGCATATTTCTTTTTTATGGGTGCCTACATGTGGCTAAGTGTCATTAGCTTTGACTTGTGGCACAATTTCCGAGGGACGAGAGGAATTAACCGATTCCAG 

GAGAAGAAACGCTTCCTGTTTTATTCGCTGTACTCTTGGGGTATTGCCCTAGTTTTCCTAGCCTTCACCTATTGCGCCCAGCAACTTACCAATTTGCCCGCGAACCTAAAGC 

CAGGTATTGGTGATGGTGTATACTGCTGGCTGGATATGAGCAACTGGGCAGCCATGATCTATTTCTACGGTCCCATTCTGGCGATCGTTGTTGCCAACACCATCATGTTCAT 

AATGACGGCGATTAAGATTCACGGCGTGCAGCGCGAGATGGCCCGGATAATTGCGAGTGAGAACAGCACCAAGAACCTACGAACCGAGAAGGACAAATTCGGCTTGTTCTTA 

CGCCTCTTTCTCATAATGGGCATCACCTGGCTCACGGAGCTTATCTCGTACTTCGTGGGCAGCGACAAGGGATGGTCTAAGCTCTTTTATATCTCGGATCTGGCCAATGCGA 

TGCAGGGCTTTCTCATTTTCATGTTGTTTGTAATGAAGAAGAAGGTCAAGCACTTGATCACAAACAGAACGATACGTGTGCGCAGTCTGCGGCAGACTGAGAGCGTAACGAT 

GGGACCGACTTCGTTCTCGATGAAGCAGCGGTTTATGGACGCCTCCAACCAGATCCGCAAGCTGGTGGTGATCCGCAACGCATTTGCCAATGGGTCCACCAGTGATTTACCG 

CCATTGTCTATTCCACGCTGCCCACGCAATGGACGTCATAACCACTCCACCACTTCACACAAACACAGATGCTCGAGTGTTCGCGATGGCAGCAACCAACGTCAGAGCCAGT 

ACTCCACCAAGACCACGTCGAGCAGCGTGGCCAATCTGTCGCTGCACGAGAAGCCATCGGTGGAGAAGCCGCTAGTGATCAGCTCGAGCGTGGATCCTCAGAAAACCACCAT 

CTTCCGCTGA 

(SEQ ID NO: 167) 

Start ATG: 1 (Reverse strand: CAT) 

MPKKIHQPGGSLYCGVTLLGVLCLVVFRLIPGIPFGTYVMAERDHYHTIDDPNVPCNFYDTVNLTGHRLFPNGSYDYYGTIVPAELVGTYDYIHSSLTERI EVREHVRGCVC 
KFKSCLNICCPWRQVFNSEVDGCIIDHSDNRTWPDPPMLNITFRNESTILVNMFTQFAIQSFRPCPKMFSLQPETNNWDDYLLFENGSMLRVDDKLLIRKNEFCMVPTYVNE 
SDMFYTIHPANCDMQDDHSTVKIINSYAVYLLIPELRNQHGKSLVCYLIGLSVGYSSLCYVQLYQVDATGVTCKVFGYTAYFFFMGAYMWLSVISFDLWHNFRGTRGINRFQ 
EKKRFLFYSLYSWGIALVFLAFTYCAQQLTNLPANLKPGIGDGVYCWLDMSNWAAMIYFYGPILAIVVANTIMFIMTAIKIHGVQREMARIIASENSTKNLRTEKDKFGLFL 
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RLFLIMGITWLTELISVFVGSDKGWSKLFYISDLANflMQGFLIFMLFVMKKKVKHLITNRTIRVRSLRQTESVTMGPTSFSMKQRFMDflSNQIRKLVVIRNAFANGSTSDLP 

PLSIPRCPRNGRHNHSTTSHKHRCSSVRDGSNQRQSQYSTKTTSSSVANLSLHEKPSVEKPLVISSSVDPQKTTIFR* 

(SEQ ID NO: 168) 

Name: mth-like 10 

Classification; G_protein_linked_receptor 



Celera Sequence No. : 142000013384577 

CCAGACATGCTTCCTCGATGGGGCTTAACGTCAGCGGAGCAGCGATGTCAGTGGCAGCCATCATCTGAACCGAATGACAGTGGAGGTCGCTAGAAAAGGGGTTCCATTTCCA 

CTAGTTTATACTACCGAACTGATACTTAAATCAAGGCAGTTAAAAATTTAATATGAAACCGGAAATGGCTTCAACATCTCGTTAACGCTCTGCCCAATTCTACAGTCATCCG 

ATTTTAATGAAGCAAAATTTACTGCCATGCCATGCAAGAGATATGAATCACCGCATGGGAATAGCCCACAAAGGACCCTTACCCCATGCACTCTATCATGAACACCGCACAC 

ACTTTTAATCTGCGACTAATTGCTGCTCCGTATTGAATTGATTTCGGTTGCCCCAAAGTGCCACATTCCAAATCTCCGTTGATTGGGGAAATCCCCGAAAAACATTCGAACC 

ATTCGTCCCTGGCGAGCACAGAAACAGCGTCTCTGATTCTGGCAGGAATTGGAGGGCATAGGGCGGACCTTGTGGCGGTTTGTCGAGTGCACAAAGTGCAAGGTTGCGTGGC 

CATTCATTTCGCAGCCGATGGGTGGAAGTGGGCAGCGGTGGGTGGCGCCCGGAAGGAAGAGCATCAGTAATCTCTAAGTGCGCCTTGTTCGCATTCGTACTCGTGCCAGCAT 

TGCATAAGTGTCAGTGTCAACGGGGCGGGGATGTGGTTGTGCTTGGTGGTTATGTGGGTGTTTGTGTGGTAGTAGTGTCCTTGTTGGCAGACAAAAAGACTGATGATGATGG 

CATTTTGTTACTTTGCTACCTTCCTTTGTCGCCCGTCGCACGGCAGCTCCTTTAGTATGAAATGAGGCGACCAGGCAAGCGCCTCGTCATCGAAGATCACCCAAACAGAGTA 

CGGACCAGGACACGGCAACTCGCACAGGCACAGGAACAGGTGCAGGCACAACAACTGGCAGGAGCATCGGTGTCGGCAAGGGATACTGGCGCGGGATCGAAAGCATGAACAG 

GGACAATCTGCAGCAGTGGTGGGAGAACTCGTATCGGCGCCAACATCCAGAGCCCACCGATGACCTCGGCCTGGACTCCGCGGAGCTGCATCTCGCCCTCCAGGAGCCCAAC 

CAGCTGCCCGCGGACTACGACTATGGCAACTTTAGTCTAGGCAATCCGTATGACGTGGACTCAGAGCACTCCATTTCCCCGCTGACGCTGCTCCTGCTCGCTGTTAGCTATG 

GCCTGGTGGTTTTCGGCGGCGTCGTGGGCAACTCCACGCTCGTTCTGACCCTCTGCTCCGCCTCTTCGGTGCGTTTGCGTAACCCGCTGCTGCTGGCCGTCTGCATTGCCGA 

TCTCCTGGTCACCGGCATCTCCGCTCCGGTCACGCTCCTCAATCTCGCGATGAACCGGAGGACGCGATCGCTGCCCCTTGTGCTGTGCAAGGTCATACACTACGTACAGGTG 

AGTGGGTCAATCAAACAAGCTTAAGTTAGCCAAGTAACTAAGTTAACCCCACGCTATTCAGGTCATGCCCGTTTCGGCCAGCACTATTTCGTTTTTCATGCTCTCCCTTGAC 

CGCTACGCCACAGTAAAGCATCCGCGATTGGCGCAGCTTCGCCAGCGGCGCTACTTGCACGTCTCCCTGGCGCTGCTATCCTGGCTCGCCTCTGCCGCCATCAGCACTCCCT 

TTCTGTTTGCCTACAAAATCATCGCCAAGTCGATGGTGGTCAAGGGCGGTGGGGCGGCGAACACCACGCCAAACCCGGTCAGCATCAGCTGCACCTCCGACCTGGGCGCCAA 

CGCCATGTTCATGTCCTTCATCATCTTCCACACCATCGCCGTGTTCGTGTTGCCTGGCATTGGTGTGCTACTCAACCACTACGGCGTCCGACGCAAGCTCTGCGCCCTGTCG 

CTAACGGCTCGAGCTGCCCATGGAGAGCTGCCGCTGCCCATTCCCATCCTGCGACGCCAGACCCACATGGTCATTGTGACGGGCTGCCCCAACGCCCAGCAAGCGGCCTGTG 

GTGGGGGCACCACGGCGGACGACACCTCGAACGGGAACGGCACGGGCACCGGTGGCGGTCCCATGGCCGTCAGTCCGGGCGACATACAGCTTCACACACTGCAGCCGCGCCA 

GCCAGGCTCTGCCGGATCGGCCCTCGAACCGGGCTCCTACCGCTCTAGCAATCCCATATCACCCAGGTAAGTCATAAATATTGTGAAGTTATCAGCAAAATCGTTGTAAACA 

GGATGTATGCTTGATGTAGTCAAAAGCTATTAGTTTATGTTAATCATACGCCCCGTTGACTTTGGTCAGAATATATTTCATCGAATGGCAGTACGTCATTTGGTATAAGTTA 

ACTAACTTGCAAACCCACTTAGGGCCATGAGGGAGATTCGCGCCCACTCGCAGCGCCAGCGCATCAACCGAGCAGGCAGGGGACCAGCGACACCCGGCATCCCACTGCCCCA 

AACCTCCACGCTGCGGTCGCGAAGGCACCTGGCTAACATGCTCATCGCCTCGGCCGTGATCTTCATTGCCTGCTGGGCACCGCACGTCTTCTGCATCTTCTACAAGAACTTC 

GGCAACAATCAGCAGTGCAGCCAGACCTCTGTTTACTTCAGCCTTCTGCTAGGTAAGTCTGAATTGGCAATCATTCCCAATCCGAATCCGAGTCCGAATTGGAACTGACTAG 

TTGATCGCGTTGCAGGCTACTTCTACTCGGCCATCAGTCCGGTGATCTACTGGGCGCTCAACCACAATTCGCTTCGACAATCTCCCTGCGCGCCCATCATCCGCCTGCGCTC 

CATGCAGAACTTCCTGAGGTCGCGCTTCCGGACGCACACCGCCCCGCCGCCTCCCTCGTCCACCAACGAGGCGGCACTGGGGGCATTCAACCCCAAGCTGATCAAGCTAACA 

CCGAAACAGTATAGAGCTCAGGCCTCTTCGCATTATCTCTACTAGTGTGATACTAAGCTAAACTTGATATACATATATACACATTTATATACACGCATATATAGTATATTTT 

TTACCTACGACCCCACCCCCCCCCTGACCCGTCGATGCCCTCTTAGTACCCTCTAGTATCGAACTCCGAACCCGAACTTTGACCCAACCGAACGAAACTATCTAGATGTTTT 

TCAAAAGCTCTACGCAACAATTGCATAATACGAAAGATACATGCTACATATACACATATATAGAGTACATACTACCTTGCATAGCAGTGTTAGTCGACGCTTATTTACACGA 

TGTTGTGCGCCTGAAGCTATGCAATAAGGACACCGGAGTCCTGCCGCAACTCTTGCCTATAACGAGGATAGCACTGGTAACGAAAGCAAGAAAAAAAAAAGGAAAAAAAATA 

CAAAAATAACGAAACCCTTCTAGTGACTTGTTCGAATTGTTGTTTAGTCTTTTAAGCTGAAATATTGTACTATGTTCCTTCTCATAGATAAGCAATAAGTGAACGCTTTAAG 

CCAAGTTAAACAAACGGAAAACGAATCAAACTAAACTAAAACTCGATCTAAAACTTTTAAACGATTTAAACGTAAACTCGAAAGCATACCTAACGAAACATTAATAATTATA 

TACATACGCGACTACATACAAATAACAACTAACAAGTGAAAACTAAAAACAAAAAATGAATGTAAATGTGAGCAAAAAAACAAAAGGATAACATTTAGTATTGTATACATAT 

TCTTCGACTTAATTCTTTATACACTTTTTCTAACCATTTTTAATATCGTTCAGTAAACTGAACACGGTGTATAGGGCAATTCAAGCTGCTGGATATATTCGCGTTTCAGATG 

CCAGAGCCACCCAAAGGATCGGACTCTGGATCGGGTTTCATTTCACTCCATTCAGTGGCGACCGTTTCCTAATTTGCCAAACTTTTGTACATGCTCCGCTGCGAGCGACTCT 

CTCCTTACTCCACCTGGTTTTTCGTTCAATCCTTGGA 

(SEQ ID NO: 169) 

Exon: 1001.. 1453 
Exon: 1518 . .2194 
Exon: 2375.. 2628 
Exon: 2704.. 2957 
Start ATG: 1001 

Transcript No. : CT33551 

ATGAACAGGGACAATCTGCAGCAGTGGTGGGAGAACTCGTATCGGCGCCAACATCCAGAGCCCACCGATGACCTCGGCCTGGACTCCGCGGAGCTGCATCTCGCCCTCCAGG 

AGCCCAACCAGCTGCCCGCGGACTACGACTATGGCAACTTTAGTCTAGGCAATCCGTATGACGTGGACTCAGAGCACTCCATTTCCCCGCTGACGCTGCTCCTGCTCGCTGT 

TAGCTATGGCCTGGTGGTTTTCGGCGGCGTCGTGGGCAACTCCACGCTCGTTCTGACCCTCTGCTCCGCCTCTTCGGTGCGTTTGCGTAACCCGCTGCTGCTGGCCGTCTGC 

ATTGCCGATCTCCTGGTCACCGGCATCTCCGCTCCGGTCACGCTCCTCAATCTCGCGATGAACCGGAGGACGCGATCGCTGCCCCTTGTGCTGTGCAAGGTCATACACTACG 

TACAGGTCATGCCCGTTTCGGCCAGCACTATTTCGTTTTTCATGCTCTCCCTTGACCGCTACGCCACAGTAAAGCATCCGCGATTGGCGCAGCTTCGCCAGCGGCGCTACTT 

GCACGTCTCCCTGGCGCTGCTATCCTGGCTCGCCTCTGCCGCCATCAGCACTCCCTTTCTGTTTGCCTACAAAATCATCGCCAAGTCGATGGTGGTCAAGGGCGGTGGGGCG 

GCGAACACCACGCCAAACCCGGTCAGCATCAGCTGCACCTCCGACCTGGGCGCCAACGCCATGTTCATGTCCTTCATCATCTTCCACACCATCGCCGTGTTCGTGTTGCCTG 

GCATTGGTGTGCTACTCAACCACTACGGCGTCCGACGCAAGCTCTGCGCCCTGTCGCTAACGGCTCGAGCTGCCCATGGAGAGCTGCCGCTGCCCATTCCCATCCTGCGACG 

CCAGACCCACATGGTCATTGTGACGGGCTGCCCCAACGCCCAGCAAGCGGCCTGTGGTGGGGGCACCACGGCGGACGACACCTCGAACGGGAACGGCACGGGCACCGGTGGC 

GGTCCCATGGCCGTCAGTCCGGGCGACATACAGCTTCACACACTGCAGCCGCGCCAGCCAGGCTCTGCCGGATCGGCCCTCGAACCGGGCTCCTACCGCTCTAGCAATCCCA 

TATCACCCAGGGCCATGAGGGAGATTCGCGCCCACTCGCAGCGCCAGCGCATCAACCGAGCAGGCAGGGGACCAGCGACACCCGGCATCCCACTGCCCCAAACCTCCACGCT 

GCGGTCGCGAAGGCACCTGGCTAACATGCTCATCGCCTCGGCCGTGATCTTCATTGCCTGCTGGGCACCGCACGTCTTCTGCATCTTCTACAAGAACTTCGGCAACAATCAG 

CAGTGCAGCCAGACCTCTGTTTACTTCAGCCTTCTGCTAGGCTACTTCTACTCGGCCATCAGTCCGGTGATCTACTGGGCGCTCAACCACAATTCGCTTCGACAATCTCCCT 

GCGCGCCCATCATCCGCCTGCGCTCCATGCAGAACTTCCTGAGGTCGCGCTTCCGGACGCACACCGCCCCGCCGCCTCCCTCGTCCACCAACGAGGCGGCACTGGGGGCATT 

CAACCCCAAGCTGATCAAGCTAACACCGAAACAGTATAGAGCTCAGGCCTCTTCGCATTATCTCTACTAG 

(SEQ ID NO: 170) 



MNRDNLQQWWENSVRRQHPEPTDDLGLDSAELHLALQEPNQLPADYDYGNFSLGNPYDVDSEHSISPLTLLLLAVSYGLVVFGGVVGNSTLVLTLCSASSVRLRNPLLLAVC 
I ADLLVTGISAPVTLLNLAMNRRTRSLPLVLCKVIHYVQVMPVSASTISFFMLSLDRYATVKHPRLAQLRQRRYLHVSLALLSWLASAAISTPFLFAYKI IAKSMVVKGGGA 
ANTTPNPVSISCTSDLGANAMFMSFI I FHTI AVFVLPGIGVLLNHYGVRRKLCALSLTARAAHGELPLPI PI LRRQTHMVIVTGCPNAQQAACGGGTTADDTSNGNGTGTGG 
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GPMAVSPGDIQLHTLQPRQPGSAGSALEPGSYRSSNPISPRAMREIRAHSQRQRINRAGRGPATPGIPLPQTSTLRSRRHLANMLIASAVI FIACWAPHVFCIFYKNFGNNQ 
QCSQTSVYFSLLLGYFYSAISPVIYWALNHNSLRQSPCAPIIRLRSMQNFLRSRFRTHTAPPPPSSTNEAALGAFNPKLIKLTPKQYRAQASSHYLY* 
(SEQ ID NO: 171) 

Name: GASTRIN /CHOLECYSTOKIN IN RECEPTOR like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384577 

GAATGATAACTAATTCAATAAAACTACAGATTTCAATAACTAAAGCATACATGTATATGTATCTATGTATCTATGACTATGTATCTCGAATCCTCCGCTGAAACTCTAAACT 
ATTCCCATGCAGTTTCTTTTCCTTCGTATCGAGTTTCAGAGCACGGCTTGTAGTGTTGCACGTTTTTCGTATTTTTTGGCAGCTCTGGCTGGTTTTTTTCATTTTCCTTTTT 
TTTTTGTACTCCCATTTCCTGTCACCCTGCCCGGTGTAATTAAAAAAGTTTTTCTCGACA1TTGCTTGCCATGTGAGGCTCCAGGAACAAATGCGTTACATTGTGGACCATC 
CGATTGTAAAATAAATTTTAAGTGCAATACGAAGTGCAAACTCAGCCGTAACAATAAGATTTTTGCAATAATATTTAACTTTTAATTGTAACTTTGGTAAAGTCAATGTTTT 
CTTTTTCCAAGTTGGCCAATAAAAAGAGCATAAAGCAAAGCGAAAACTTTAGTGCGTACTTTTATTACAACATTAACTTCATGCTATATATTTTATTTAAATATTTTGGCTT 
TTCTATCCATTGTTTTTCAGTTAATATCTTAAACAAACAATTTTTGATGCACTACAAATGCTGTTGATTTTAATAAACGATTTATAGCTTACATGGATTATTAGCTATGTAC 
ATATAATTACGAATATTTTTAATAAAGGGCATACTTAATAAATTGAAATTCTGGAAAACACTCATTTATTTAACACTAGCAAATTAGCGCACAAATAATGTGCAAAGTTCGT 
TAATCTTTAAAGTGTATTTGTGACCACAGCACAATTAAACGGTGGTCCGAAATTCTATATCTTCTAATTCCGCATTGTTTTTTTTTTTTTGTGAATTTTTCAGGTAACCTCC 
TTGTTATTTTGGTGGTCACATTATCACGACGACTGCGCTCAATAACCAACTTTTTCCTCGCCAACTTGGCGTTTGCCGACTTCTGTGTTGGACTCTTTTGTGTGATGCAGAA 
TCTGTCCATCTATCTGATAGAAAGGTGAGTGACAATTGGCAAAAACAACATTATTTCAGTATTAAATTAACGTTTCATCATGCACGAGCGGAGCTGCAAAATTCTCGGAAAG 
CATATACGAAGAATACGTGAACTAACTTCACTGTAAAGTTGCCAATAGTTTTTAATGTCCGTTTTCATACGCAAAAAATTTAAGTGAAGCATTTAAGTGAAGCATTTATAGA 
AAAGTTTTTAGTTATAGTAGGTTAAAAATTAACACAAAAGTTAAAGTAGCGTTTCAGCTAGTACAGATAGTAGAATGTTATGTGGTCGATTAAATTTATTTATTTATCAGCT 
GCAGAATGTGGCTGAAACGGTATCTGAGTGCATACGTATGTATGCATGTGCCTCGAAAACAAAGCCCCAAAAAAAAGTTTCAACTTTCAACCGTGAAACATTTACGCCGGCT 
GTCACATAAAGGAAACTAAAACTCCGCAGGCGTCACTTTCACTTCAGGCTTTTCAGGCTTTTATCCCCACGCTCCGCTTTCACTCCAATCATGTTTATTTGCATTGTGTTTC 
CTTAACTTTCCCAAAAACACGTCGCAGTGACGTGTTCACTTAAAATAAAGCGTGGAGTGCTGCTTGAAAATTCAAACAAAAAGAAGCCTTTTTCAGATTTGCAGGGCAAATT 
AAAAAAGTATTTTGACAAATGAGTTTGGCTTACAGTTTCTATAAGTTATTTAATTTAAAATTTTAGTCGCAGTGTCAGAAATAAATAATGCTTCATTAATTTAAATTTGTGA 
AATGCATGTGTTTAATATGTGTTTAATTAAAAGCTATGTTCTATACATTTAGAAAAAAAAGTAATTACCTATCTCATTAGAATTCTTGAAATGACCATTTATTTATGCACTT 
AATAATAAACAGACATAATGATTTATTTGCATTCCCGATGTTAAATCCTGAAATTGGTTTGCATATTTTCCATTTCATTTTCTAATATGCTTTTTGTGGCCAAAGAACTAAA 
TTATTGAGATGACCCATCTGTGTTTGGCATAATTTTAGCTAAATTATGCGCACAACTTGACGCCCACCAGAAATCATATGCAAAATGAGATGAGATTACGAAAAATCATAAT 
TTATGCATTTATAATTTATATGTGCGGCATACGGCTGCCAGTGACAATTTAATTTAATGATTATAAACATCAAGTCGTTTTAAGAAAGTTTTTAGTTCATTATGTTAAGGAA 
TGATTTGTCCCTTTTAAAGAGAGAAATTGAATCACCAGGCAAATGGGGAATCTTTTTAAGAAACGCATTATCCAAAATCTTGGCGACAACCTTCAGGATTATTATGTTTATT 
ATAGTATTATTGGGCACATTTAAACGCATGAGTGCGAAATTACGTGGCATTGCTAATAACGAGGCACAAATGCGGTTTTCGAATGTAATTTCAGCCGAAATAGTAATAAAAA 
GGAAAAAGCTTTGCGCCTCATAAAATACTAAAGACACCTTAATGTCGCATTAGTTTTGGCGGGCGTGTCATAACATTTTATGTGGCTTGTGCTAAATTTGGCTCTGCCACCA 
ATATTGCCAAGTATATGCTCTTTAAAGAGCGAAACTTTTGCCCATAAATAATTATTTATGTCCTTTCAGCTGGGTCTTTGGCGAGTTCCTCTGCCGCATGTACCAGTTTGTC 
CACTCCCTGAGCTACACGGCCTCCATTTTCATCCTGGTTGTCATCTGTATGGAGCGCTACTTTGCCATCGTACACCCCATTACCTGCAAACAGATCCTAACTGCGGCTCGTC 
TGAGGGTAAGTTCTAACCATAAACCTCATATTCACGGCCAGTAATAGGCAAGTATGCATTTTAAATGAGGTCTATGTGTATGTGCCCTGCTTTTTTCAATTCCCATTTTGAT 
GGGTCAAAAGCGGAGGAAATCTGTTTATGTATTGTATTTTGAGACATTCTCGCAACTTTAATAACTTCATTTGAATACAATTTGCAAGCTAATGAAACTTATTAGACCAAGC 
CCGGAGACTCATGCAAATTAACCGGCAGAACAGAACCGACTCGAAAGGCTTAATAAGCCGTGGGCATAACTTTGTGGCCGGGAAAACTTTTCCGCACTCGAAGGGTGAAGTG 
GAGGGAGCGCAGCCAAAGGGTTGAACCACATAGCCCGGTTGTTTCATCACAGCCAGACAAAAAATACGTTTCGGTAATGACGATTGCACTAATTAAACTACAAGAGACATAA 
AGACAACGAAATGGGCTATCCATTAACATGCAAATTCAGTGTGTTCATTCGCAATTTAACAAATTGTTTGCGTGTTTGGCAGCCGAGCAGACAAAACAAAACGTCAATGCGT 
GTTTTTTTTTTTGCAGATGGTCATTGTCACAGTTTGGATTACATCGGCAGTTTACTCCACGCCCAAGTTTGTTTTCAGCAAGACCATTAAGAACATCCACACGCAGGACGGG 
CAGGAGGAGGAGATATGTGTCCTGGACCGTGAGATGTTCAACTCCAAGTTGCTGGACATGATTAACTTTGTGCTTCTCTACGTTATGCCGCTCTTGGTGATGACGGTAAGTG 
TAACAAATGATACGTAGCTTGAAGTTAATAATTATGTTTAACCAATCCTTGGATAGGATGTTCATAAATTCATTTTCAATCCAAAAGTGTTTCCTAGTCACGTTACCTAGCA 
TTGAGCAGACTATAATTAATAATTAAATGTATTTTTTTGAATTGCTTATTAAAGTGATTAATTATATCAATAACAACTATCTCTCTAATGAGATAGAATATAAATAATTTTG 
GGTACATAATAAATAATCTATGAAATGATTTTTCTTCGTCTGTTAAAAAAGTGATTAATGGAAGTGTAGATTTCCCTTTATTTTTTTAACGACTCCATTTTTCAATTTCTTC 
TGAGAAATTTGATAAGAATTGAGAAGCTTTTTGCATAAAATCGCCCCTTGTTCCGCAATGAAATTTGAATAATGAATTTAACGAACCCATTTGCATTGCAGGTGCTGTACAG 
CAAAATCGCCATCGCCTTGTGGCGCAGTTCGCGCGGCCTCACGCCGCATGTGGTGCAGCATCAGCATCAGCAGCCGCAGCAGCCATCCTGCCAGGACATTGGCATGGGTATG 
CACAACAGCATGTACCATCACCATCCGCATCATCATCACCACCACCACCAGCATCACCAGCTGCAATCGGCGGCTTCGTCGGCAGGAGTGGTGGGAGTGGGTTTGGGTGGTG 
GTGGTGGGGGCGGTCCTGGTCCGTCACTCGCTTCCGGCGGCAGCAGCACCACGTCCTTGTCCCGCAAACAGAGCAGTAAATATGAGAAGCGCGGCGTCAGCATCACGGAGAG 
CCAGGTGAGCTGCCAGGAGAACACTGCATCAGAAAAATCGCCGGTTGAGCACGGAGGAGTAGCCTTAAAAGCCTAACAATAGCAATAAACCGAAAGTGGAGTCGAACCTTTG 
TTCCCTAATTGGCCAGAACGGCTTTAACTTCGATTGAGTCCGCACGTGCTGTGTGGTAAATAATGTTTTTTGAAAATCACATCGCTATCGGAGCACTAATTTATTAGATACA 
CCATTTGTCGGCCTGGCCAAGTTGGCAGTCGTAAAAACTTTACCCGTAGCTGTTACTCCCGCACTTTCGCCGGGCCAAGTGGCACGGTGCTCGGAGCAGGTGTCAATTCACA 
TTAAGGCCATCGCAGGACCATGCCACAAGCTCGGCCACCGATGTGACACCTGGAGTGTGGAGTGGTGGAGCGAAGTGACCACCGGCACAGGGCGAAACTGCAAGTGCGAGGC 
CAAAGTTATGGCAAAAGTGTTTTTTTTTTTCCATTATCGGTGGCCGGACAGCAGGTTCAATTGGTCGGGTGCACCATGGCGTATTTCTTATTTCTTATTTCGAGCTCGACAG 
CAGCTTTGCCACACAAACTGGCACACACATCATTGTTCTTGATCAGCATCAGCAGCTCACTTTGTCAATAAAAGCCGGCGGATAAAAATCGAGAATTTGTGACACGATGACA 
TTCGGAAAAGTTGTGCAACTGCCGCAGATTGAAAATTGGTAAACACCAAAGGCAAGGCCCTTGAATTTTTTAATATTCCTCGAAGTATCAAATGTCAGGGGAGCACTGGTTC 
GATTTCGGTATGTTTTCCGTTTTAAAAGTCACAAAAATGGAAGTTTGGTAGGAAAAACAATGGCGGGAACACCTTTATAAAACTTTTACATAACTTTAGTTTACGCTGTTCC 
TTAACTTATGACCCCGATATACACAGATAACAACCTAACTATCTTTCTGTTGAACTCTTTTCTCTCTCTCTACCTTTCGTTTGATCAATAAAACTCTGTGCTCTCCCCCTCT 
ACTAACTCCCCAATTATAACAACAACCACACACAATCCCACACCACAAATTAGCTCGATAATTGCAAGGTAGGTTGTAATCGAATATCAAAATCGATATGATAATGTTCAAC 
AACTGTTGCGGCCAGCGACCTAATGCCATTACACCTGTTAATCCGCACAGGTGTCCCTGGAGGCCGATCGTCCCATTGTCAGTGCCTGCCGCAAGACGAGCTTCTACCACCA 
CGGCCACGCCCATCACCAGCGGGCGGGCAACGCCTCAGTGGGCGGTGGCAGCGGCGGAGCCGGGGCCGGAGCCACCCACATGTCCCACTCGTCCAGCAACGTCCTACGCGCC 
CGGC GCGGCGTTGTCCGCATGCTGATAATATTCGTGCTGACATTCGCCCTCTGCAATCTGCCGTATCATGCCCGCAAAATGTGGCAGTACTGGTGAGTAGTCCCTTCGTCCA 
CCCACCACCCATAATCACAATCCAATGCGAATCGGAATCCCAATCCCTATCCCTATCCCTTACCGCAGGCCATTCATCAGCCCACACGGACTGAGTGTAAACAATATTTTAT 
GGTATATTGCACCGGCAATCGGCGGGAATTTGCATAGTCCTCCGTTGGACGGGAAATTAACATCTGCGAGTGGGCAGATAAGCTCTAGATGGCTTCCTGCTGGAATCGAGGC 
TAGTAAACAATTAAGTCATCAATGGACAGAGCTGGAGCTGCAGCCAGCTTTGCCGCTGGCCTAAATGATTTGAAAGCGGTCTTCCGTTTTCCAATCCCACTCCCTTCCCAAA 
GACGCCAGTTTCGCTTTCTTCTCGCCTTTCAAAAGCGGTCGCCAAGAAAAGTCCGTTTTAGCCAGCCCTTTGGCTATCAGTTCCGCTTCCGGTGGAGCACTTTCGAATAGGT 
GGACGGACCGCTTAGTTTGCTTTGAATTCGGGCAATTTGGTTTTTCCGGCTGGCGGGTGTGGTGGGCGTATAACTTGATAAGTGAGTGGGCGGTGGAAGTGTTTTTGTTTGC 
TAAATCGAGGAGGAGTTAATTGCTCCCTGGGTAGAATTAAAAACGTGCACTTAAGCTGCTGACTGCTGTAAATTGGAAATAAAATAAAAGTCACAGAAATGTCCTTTAAGCG 
AAAGTCGAAAGACAAACGATTGATTGAAGATGGGAAAAGTCATCAAGTGTGAAAATCTGGAGACATTGGCAATTTTGGCACGTAAGAGAAACGTTTTTATTTTGAACTTTTC 
ATACAGATTCTACTAAACCAAACGTTTTTGAGTCAATTGTAAATTGTAAAGCATTTTACATGCCGATGTGCAATAAATCCAATTTAATGTTCATAAGTTCATGTTTTTGGTG 
AATAACTACTGAAATAAATATCGATCTCTAGTTCTCATTAGTGCAAATATTCAGTGTAATTTTGTAAGAAAATACTTCGCTGCAGATTTGCAACGTTGCACTTTTGGCAAAC 
GCAACGAAAACGAAAATGTTTTTGAATTTTTGTTCGAGCACTCAATTTGCACAATTGTAAAACGAATGGCGGACAAATAAATATAAATAATATCTGGCCAAATACAAGAGAA 
ATGGATTTCCACATCGCCACTTTTCCCGCAGTGCCATAATTGAGATTTATGTCTAGCCCGGAATGGGCAGCGAGACTTGTTTACTTGTTTGAACTTTCAGCTTTTATTTATT 
TGAAGAATTGGTTGCCCCCAGGGATATTGGTATTGTTAGGTGGAATTGAAAGTGTT 
(SEQ ID NO: 172) 

Exon: 1001.. 1032 
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Exon: 2646.. 2805 

Exon: 3377.-3577 

Exon: 4022.. 4372 

Exon: 4614.. 4820 

Exon: 5539.. 5804 

Exon: 5893.. 6112 

Start ATG: 1001 

Transcript No. : CT33559 

ATGCAGAATCTGTCCATCTATCTGATAGAAAGCTGGGTCTTTGGCGAGTTCCTCTGCCGCATGTACCAGTTTGTCCACTCCCTGAGCTACACGGCCTCCATTTTCATCCTGG 

TTGTCATCTGTATGGAGCGCTACTTTGCCATCGTACACCCCATTACCTGCAAACAGATCCTAACTGCGGCTCGTCTGAGGATGGTCATTGTCACAGTTTGGATTACATCGGC 

AGTTTACTCCACGCCCAAGTTTGTTTTCAGCAAGACCATTAAGAACATCCACACGCAGGACGGGCAGGAGGAGGAGATATGTGTCCTGGACCGTGAGATGTTCAACTCCAAG 

TTGCTGGACATGATTAACTTTGTGCTTCTCTACGTTATGCCGCTCTTGGTGATGACGGTGCTGTACAGCAAAATCGCCATCGCCTTGTGGCGCAGTTCGCGCGGCCTCACGC 

CGCATGTGGTGCAGCATCAGCATCAGCAGCCGCAGCAGCCATCCTGCCAGGACATTGGCATGGGTATGCACAACAGCATGTACCATCACCATCCGCATCATCATCACCACCA 

CCACCAGCATCACCAGCTGCAATCGGCGGCTTCGTCGGCAGGAGTGGTGGGAGTGGGTTTGGGTGGTGGTGGTGGGGGCGGTCCTGGTCCGTCACTCGCTTCCGGCGGCAGC 

AGCACCACGTCCTTGTCCCGCAAACAGAGCAGTAAATATGAGAAGCGCGGCGTCAGCATCACGGAGAGCCAGTTGGCAGTCGTAAAAACTTTACCCGTAGCTGTTACTCCCG 

CACTTTCGCCGGGCCAAGTGGCACGGTGCTCGGAGCAGGTGTCAATTCACATTAAGGCCATCGCAGGACCATGCCACAAGCTCGGCCACCGATGTGACACCTGGAGTGTGGA 

GTGGTGGAGCGAAGTGACCACCGGCACAGGGCGAAACTGCAAGTGCGAGGCCAAAGTGTCCCTGGAGGCCGATCGTCCCATTGTCAGTGCCTGCCGCAAGACGAGCTTCTAC 

CACCACGGCCACGCCCATCACCAGCGGGCGGGCAACGCCTCAGTGGGCGGTGGCAGCGGCGGAGCCGGGGCCGGAGCCACCCACATGTCCCACTCGTCCAGCAACGTCCTAC 

GCGCCCGGCGCGGCGTTGTCCGCATGCTGATAATATTCGTGCTGACATTCGCCCTCTGCAATCTGCCGTATCATGCCCGCAAAATGTGGCAGTACTGGCCATTCATCAGCCC 

ACACGGACTGAGTGTAAACAATATTTTATGGTATATTGCACCGGCAATCGGCGGGAATTTGCATAGTCCTCCGTTGGACGGGAAATTAACATCTGCGAGTGGGCAGATAAGC 

TCTAGATGGCTTCCTGCTGGAATCGAGGCTAGTAAACAATTAAGTCATCAATGGACAGAGCTGGAGCTGCAGCCAGCTTTGCCGCTGGCCTAA 

(SEQ ID NO: 173) 



MQNLSIVLIESWVFGEFLCRMYQFVHSLSYTASIFILVVICMERYFAIVHPITCKQILTAARIRMVIVTVW1TSAVYSTPKFVFSKTIKNIHTQDGQEEEICVLDREMFNSK 

LLDMINFVLLYVMPLLVMTVLYSKIAIALWRSSRGLTPHVVQHQHQQPQQPSCQDIGMGMHNSMYHHHPHHHHHHHQHHQLQSAASSAGWGVGLGGGGGGGPGPSLASGGS 

STTSLSRKQSSKYEKRGVSITESQLAWKTLPVAVTPALSPGQVARCSEQVSIHIKAIAGPCHKLGHRCDTWSVEWWSEVTTGTGRNCKCEAKVSLEADRPIVSACRKTSFY 

HHGHAHHQRAGNASVGGGSGGAGAGATHMSHSSSNVLRARRGVVRMLIIFVLTFALCNLPYHARKMWQYWPFISPHGLSVNNILWYIAPAIGGNLHSPPLDGKLTSASGQIS 

SRWLPAGIEASKQLSHQWTELELQPALPLA* 

(SEQ ID NO: . 174) 

Name: Neuropeptide receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013385217 

CCTTCGACTTGAGCGGACAGTTGAGGTTCTCGAACTAATTGCAAAATTGTCAATGATTTATTCAGTTGGCAATTAAGGTGCCAACTAACGGTGTGGCTGGCACGGCTATATC 
GATTTGAAAATATTCAATAATCCCGGAGCCTTGATAAGCTCATCAAACATGCACATGGCCATCAGCAGGGGGGGGCAAGTTTGTATTTAGAGCCAAGTGTGTTTGGTGAAGG 
GGGGTTAGCTGTATATTTCAGGGTTTCCCCTTTTTTTGTCGAGCGGTTTGGGGGGGGGTTTTGCGAGGGGACACAAAAGTACAAGTTGCTATGGATCAAGCTCAAGGCGTGG 
CAGTTGGGCATTAGCTCTATGGTGGAGCACGGCAACCAAGCGAACAAAGGTCAACTCGACAAGTGAGTTAAATGACAAGGACCTCACCTTGCCTTCCTTGGGCTTCGGCTTT 
GGCTTTGGCCCGTGCGTGTTTGCATGTITGGCTCGATGAGGCGACGCGACCCCAAAAATGAAACTCAACTTTGGCTTTGACTGCGGCTCGGGATTGTGGCGTGCAAGCATTA 
TGGCCTTTCGCTTTCAAAACAATGCCCCCATCCTGTTTCCCGGTTGGCCTAGTGGCACTTGCTCCTTGTTGTTGGTTTTTCGCATTGCCCTTGGCCAATTGCGAGTGAATTT 
TGATGTTTTGACTGGCCATTCCTGGGGCCATTTGACAACTCTATAAAACCGGTGTCCTTAAGCCCGCCCTCGTCCTTGCGCCCAACCCCCAATTACCGTCCAACTACCACCC 
ACCTACACCCCGCTACACCCACCCACAAAGTAAACGTAATTGCCCGGAGAGTTTCGCGACTTTCGACTTGGCCAGCAAACTGAAAAAGCGAAACTTCAAAAAACCCGACTTG 
ACTTGGCAACTGGAAAAGTGGAAAAGCGGAAGCACGAGCTTTCCCACCGAAAAGCTCTCGTGCTCTAAGCTTTCGTCCTGGCGAGGCGAAAACGTAAATGTCAGTCATGGAG 
TTCCAGTCGTCGGATACATGTGGAACACAACGAACACAGCAGGAGCCTCTCGCGTCTCATCCCCCGAATACAATGCGAAAAAGTCTGTACAAAAAATAGACAAATAAGGAGC 
AGGAGCGAAAACACCACTCTGTGCATAAGATATAACTCCCGCAGAACTGAATACCTGCAAAAAGAAATACGCGTTAAATGGAATTACGGTGCGGGAATTGATAAAGTGAAGT 
GTGTTACAGCTAGAAAGCAAGTGAAATAAAACGAATACATTAATAATTCATGCCCCTCATAGTCAGAAGCTTACTAGTTTGATAAGGAATTCAACGTAAATCTGATATAGTA 
ATGTAGAAAAGTTGTACAAAGTAATATACATAGCATATAGCATATAGCACAGTTACACAGAACACACCGCAAACTCAAACAAAGTTGACAAAAAAAAAGCTCACGCCCCTCA 
CATCTCAGTTTTTTCTCGAATTATTTGTGAAAGTGCAACAAACATATATAAAAAATAAAAAAGAAAAGGGAGGAGAGAGCACAGCAAAAAAAGGCAGTGACAGCAGGACCTG 
CAAAGGAATTGCCATCATTTGGGGATTACGGAATTATCGGGATCCGTTCTCATTAAAATGGATTTAAGGCAATGCCGATGTGCTGAGCTGCATTTGTGCATGGGATTATCGA 
TTGTAATGCGCAGTCATCAAATTATGTTTGAACAACGCCGCCGCGACAGCGATAATAGCAACAATAACAACAACAACAACAACCAAATCAACAACAAACACTGATTTGGCCC 
CTGACCCCCTTTGGCAGCACCCCCTCCATCCAACACCCCCTACGACCCTGCCCAAGCAAAAGGAGCAGCGAAAAAGAAGAAGCGTCTTATGAAACAATTGAAGCACCGGCCA 
AAGGAGCAAAGGAGCAGACAAAGACAGAGACAAAAACATCGGCAAAGGATAAAACGGAGGCCGAGAAGAAGAGGAAGCAGTAGTAGCAAAGCAGGAAACAACAACAACAACA 
ACCGCAACATCGCATTTTTAAACGGCAATACGCAAAGTAAGTGAAGTGTAAAGTGAGCGAGATGGCAATAGAAAGGTGTAGAGTGCCGCGAAGAGAGGCGGAGAGAATACAC 
TTGCAAAAATGTGAGCAGCTAATTCCCAGGGATGTTAAAAAAAATTAAACCCTCATGGTTCAAAGATTCTAGGAGATGGACAAGTGATGAATGGTCAGTGGAAATATATGTA 
TGTTTTAAGCCAAGTGAAATCAAAATGATTTGAGAAGTACAAATAAAAGAAAGAAAGGAAAGAATGATATATGATATTTATTGGTTGTAGTACAGGAAAATAATAGTTAAAA 
TGATATCCCGGATTGATGGTCATTACAATGATCGTACCACAAAACTCTTTCTAACTCAAATTAAATTGTTAAAATTATTTTTCAATACAAATTTGATATATTTTTGTTTTTT 
CGCCGACAAGATGCATAGATCTTTCATAGAATCTGAAATCTATATCTTTTTTATATTGCTTTTGAGAAGAGTAAACCGATCATCATCTACGATCACTCCTTTTTGTTCTGTG 
TAGGTATGAAGAGAGCAGGTGGGAAGGCAGTGAAGTGAAGTGAAGCATTTGTCGCATTTGTGCAGAATTCCCGACGTAATTAATGTTTAAAGTCTGTGTGTGTGTATGTGTG 
CGAAGGTTTGTAAACTAGGTCACTAGGAAGTTAGCCCCTTGACGACCATCCCCTCTCATCAGCGCCTCTTCAGCGGCAACTCTACGTCAAAGGTGTCAACTCGCCAAATCTG 
CAACCACACCTTAGTTAAGTGAGCCACCCCGTAGCCACCAAAATTATCCACAGCACCACCACCTTTTATTCGTCCTTAACACACCGCCATGATTAACGCGCCTCAGTGCGAT 
TAACTGGCAAACAAATTTGGAGTGACGCTGCGGCGATGTACATACATACATATATGTATGCATGTTTGTGCTAAACCGACAGCATTTCCATTCAATAACACACGCGGCTCGT 
ATGCAAACACTGTAAAAAGGTTTTATGCTAAAAAAGACGCCGTATTTGTGCATGAAAAACTGTCATTCATCATCCAACCAACACAAATAACCGGTAATGGCATCGCCACATT 
ATGGCTTTGTATTTTTTACATCCTCAAACACAAAAAAACTTTAGTTGATTTCTGACTTGCATCGCATTCATCACCCACTAAAGTCGAGCAATCAATCCTCAAAATACGTATA 
CGCACTGTTAGTTCAGTTGATTAATAGGCTAATTATGAAGGAAACTGAAAGGAAACACTAAATCAATAAATCCTGGCAAATTAGCATCAGTATATATTTGTGTTAGTGCAAA 
CATTTAAGTTTTCTATCAATTTTAAATACAAATATGTTCAGAATTAAGATATATTTCAAAGGGAACTATGCTTTGTTTAATTAAGATTTAAGGTTTTATAACTATAAAAGCC 
TTCAAAATGTGCCATCTATTTTATATTCACTTACAAGCAGATTTCAATCAAGACCACTCTTTAATATGATATATTATGTCTAGAACAACCGAGAAGGCACTCTTAAATTAAA 
TATTTTAGTACCGTAATAAATTATGAAACGTGCTGACGGTAAAATATATAATACAATTCATACTTTAAGTTATATTTTTTTTTTGCCCTATATGGTTTTACGTATCGGTGTG 
TATGTGAGTACCTCGCACACGCAGGACATAGAGATAAAAAGGCTTCTCCCACATAAAAGTTCTATTCTGTTTTTTTTTTTTTATAGAAAAATATTGCTGGCAGGAGTTTAGA 
ACATTGCCAAAGGGGATGCGGACAGATGAAGGGGCGGGGGCGTGGCGGGTAAAAGCCATAAAACTCAGGCGCCGAAGCGCAAAATGCCCTTGCCCTGAACAAAAAGAAGAAA 
TTGAACAGCAACAACCAACAACAAAAACAAAGAAGAACAAGAGCAAATAAAATACAAACTGCTTATTTATTTATTTGTTTCGACAAATCTCAATTGAAACAAAGCTTAAAAG 
CTTCGAAATCAAAACTTAAATAGAAACCACACAGACATCCATCCAGACGAATAAAACGAAAACGATAGAGCGCGAGACAAAGAGACAATTACTATTAAATCCAAGTTCAATT 
CGGTCCGCAAACAATTGAATTTGTTGCAGATTTGATTTGGCTTCATGTGGATTGCGATAACTATTTTCCTGGGATTAGTATGGGATTGGATTCATTGGCCCGACCGCAGTTT 
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GCTTCCTAATGAGATTTGCTTAATTAAAATGAAATTCAATAACTGGAATTCCAATGAATTTAAATGTTAGAGAAAAACTCTAATTCGGACTTTTGTGGCTAGCCATTATTTG 
CAGCAAGGGGGACATAAAAAGCATTTGGCTTTCGTTTGGATTAGTCAGCAAGGTCAAATCTCGAAACTGACCTAAGCTAATGCAAAATGACTTTTCCTGTCAATCACAATCT 
ATATTTAATTCAAGCGAAATTCTCTGCCACATTTCTTGCGCTATAAATCGCAGTACCCATTATAGCATATCAACAACCAAATTAAGACGAAAGTACAATAAAATGCAACTAA 
TTTCAGTTTTAATAATGGATTTGCTGGCGGCTTATTTTTGTTTCAAATTTGGTTATTTTCTTTAGAAATGTGCGCATACATTAAGACGAACACAAAAACCAGCTTTGGCTCA 
ATCTCTTAAACCTTTAAAACGTTATATGTACGCACATATTTATGTACGAACAAATGAAGGAAAATCTCAAGCCTAATTCCGCATTAAGCTGCAAGGAAACACATAACCCAAA 
AGTATGCAACATAAATAAAATTTAAATCAGCACGTTTATGAAGTGGATTATTTGAACTGAAATGCTTATTCAACGTTATTAGATGAGATTTATGGCCAGGAGAGCATTTGAA 
TTGGCAGATTATTCGGGCCAACCAAGTAATGGCCACTATATCAAACAAATCAGTTATCCTTCAATTGCAAAATACACAGTGCAGACAAATAGTTTAAACTGTCTTAGTGGTA 
ATCAAAATCAACTTTAATTATCAACTAATGTATGTGTATTTTACGAAACAAAAGTGCATTATGCATGAACGAATGATAGCAATATTTTTCAATATAAAAGCTTACGATTGTT 
TTCCTTTTGTTATCAATTATCTATCACACTCCACAGACATACTCCTAGATCATAATAATAAGTTATATAAATATAGCATAGATTTTGAAAGGTTAAAAGGTTTTCTCCCCAC 
CACTCCCTCTAATAATATTGTATATAATAAAATATATATACAAATGTATTTCATAATTTCATTGCCATCCAATTCCGCATAGTATCTGGTATACCCCATCTGCAGAAAGACA 
CCGAAGTGTATAAATTGCTTACGTTTGCCCATGTAGTCATAAAAATTGCTTTGGCACATAAAACGAAGCTTGACGCTCGCTTTTGGTTTTGGTTTTGGTTTTGGCCTGGTCT 
AGTCTGGTTTGGTTTCATTTGGCCCCTTACCGTTCTTGTTTCCTTTGGTTTCCATCGCTGGTGGGCTTGTTTTTTTGCGAGAGTCCTGGCCGTGTCCTTCTTATATTTCTAT 
TTCTATTTGTATTTGTATTTGTATTTGTATTCCCCGATTTGTATTTTGTTCGCTCGCCAATTCGGTAATCACATTTCAAGCGTTTTATTCGTCCTGATTTCATACGATTGCT 
GATTCAAGTTTCTGTTTGTGGTCCTTGCCTTGCAGGATATGACAAGTGATGGTGCTGTTACGTTTTGGATATTTTTGCTTTGTTTAATCGCCTCGCCGCACCTGCAAGGGGG 
CGTGGCCGGGAGGCCCGATGAACTGCACATCGGCGGCATCTTTCCGATAGCCGGCAAAGGAGGATGGCAGGGCGGCCAGGCGTGTATGCCTGCCGCAAGACTGGCGTTGGAT 
GATGTCAACAAGCAGCCAAATCTGCTGCCGGGCTTCAAGCTCATCCTGCACAGCAACGACAGTGAGGTGAGTAGTCAGTACACTGAAAAAAAAACGATATATACCTAAGGAT 
ATTTTGAATGTAATGCGATGCAATAGTCCTACAGTCCAAGTTCGATACGAAAATAGTTTTTTATGTTTCCTCTTAAGCTGGCTTTAATAAGAATAAACATTTATTGAAGCTT 
TTGAACTTTCTTATAATTTTCATAGGTAACTAATATATACAATATATTACATTAAATACCCAATAAGTATCCAATAAACTTCGAAGCAACACAGAACTTGTTTGCAACACGA 
TTAATATCACTGCAAATAATATCCTGACCAAAAACCCCTCCTGATGCACTGAGTTCTGTTTAAGCCTTCTGTTCACGAAGCATTTTCGCTCAGTGCACTAGTAAGTTGGAAA 
GCGATAAGCCCTACTTATATGCAGCCCCTGCTGCCTCCTCTGCTGCCATGCCACATACCCTTGGGTGCCACTTATTTTGTTTCGTTTTTGTTTCTGCTCACTGATGAGTAGT 
GGAGTGCTGGCTGAGCGCTGGCATTCATTAGCATATTTCTCTAGGGGATCAGGCTGGGGCTGGGGCTGGAGCTGGGCACTGTTAACAAAGGACAAACAGTTTTATTATGCCG 
CATTAAAGGGTATTGCTGCGCGTGTGTTCCACAAATTGAATTCGCCGCACAATCCGCCGCATGTCCAACAATATTTGGCAATGCCTCGGCTTAATGACTGCACATAAATTGC 
GATTGAGATTCGACTGGAGCACGTTCGAAACGTTACCAACAGCAATTATATTGGCTGAGTTCGGTTAATCGGGCTTTTATGTGCGTGATTCGTAATTGATAAAGGGTATTTA 
GCATTAAGAAAAATGTGTGTATATCTCGAACAAGGTCGCCCAATAAATGTCGCCCGGTGTTCATTAAATCATAAATGTATTTACATTTATTCGTGTAGTGCAACGCTGTAAG 
TTTAATTATTTCAATGCGAAAATCTCTCATTTCCCAGAAAGTTTCTATGAGATCAATAAGTAAAAGTTTTTGCTATAAACTTCGAACTTGGTTTTTTGCAATCAAAAGTTTT 
ACGCATTTTATCTTTAATGCCAGCAGCAGCACGAACTGAACTTCTGCACAAAGCTTTTCTTGAATAAATCGGTATAATCAAAGTGAACACAGTATTGATCATATTTTTTTTA 
TTGATCCTTATATAAAATTGTGAAATTTTTGCATTTTGTAGCTAACTTAAAGAGCGCGGAATATGTTTTTAGCTGGCATATTTGTTCTAAATGGAAAAAATAGAGCATGGGC 
ACGCAATAAATCAACATTTCAATAGTATCCAGTTGTCTGGATGCTGTAACTTCTGTTGGCTTTTAGAGAACCAGCTATTAAACAAATACCCGCCCTCCAAACGGTCATGCAT 
CATAATAAATGGCCAGTAACCGATGTAGTTTCCGTTTTCCAGTTCCCAACCAAGTTGCGCGCTATCAATGCCCCTTGGCGCGTATCTCTAATACAATACATATAAAACCAAA 
ACCAAAACTATCTTTATACATTTTCATATTTCGCACATATTTTTTGTTTTTTTGTGAATTTCGCATTTTAAGGGTCTGCGGTACAGTTGTTGCGGTTTCCACTACTTCGGCG 
GCTGTTTTGGGCCTGCCTATTTTTAAGCGCGAACCTCATTAGTTTTAAGTGAGCGAAAGCTTAACTACTTGTTTTGAGTTTTGAATGAGGTCACTTTGCCCCGGCCGAGCCG 
TAGCTCATCATTTGAGTTATTTTTATTCAAGGCTGCCAGGAGGGACAGTCACTACTACTACATACATATATCCTACGAACATATACGAACATATACACGCCCATTTACGAGC 
CAGATACGGACATATAGATAGGAACAGAGCCGGAGAGGTAAACCAAGTCCTATCCCTGGTCATATCTGTTCCTATCTGGCGTGGTCTGGTCTGGTCTGGTCTGGTTAGTTCA 
GTTCAGT'ICAGTTCGCTTCGCTTCGCTGGACTGAGCCTGAGAGCCAGGCGCGTTATGTTTGTGGCTTAGGCAAACAGCTTCCGTTAAATGCTGCATAATTAGTTGCCATCTC 
CAGGTCCCACAGCCACCTTCTCCTCCGGCACATCCACATCCACACTTACTCCCACAACGCAACGCACCACCCCATCCCGCCCACATCCACGTCACTTGGCAAGCGACAAGGG 
GCAAAGGTCGCACGCAATGTTTGCCTTTGCCGCTGCCTTTTGCTTGACCTACTTTGATGTGTGAAATGTCCCAAAGAATTCTGCACAAAAGCACTGAGAGTTGAACAAAAAA 
ATGTATAAACAAGGCAGGTTGGTTATCCTGTAATAAGCGAAATCCCTAGTCGGCTTAAGTCCAAGATCACACAGAACAGGTGAATACTTTTGTGGCGAATTTCGAATCGAAC 
TTTGATTTTTTAATGGCCAAATATAGGTGCTGAAAAATCGTATTATTGCATTTGCAAAGCTGCCACTTTCTATCGGCTAAGTGCCTTCATGCGAACCACTGAAATGGCTTAA 
ACCGGTTGTCAAGCTTTAGGAATAAGCCGGAAATTGATTTAACAGCGTTGGGATAAAAATATGTGCTCCTAAAAGGTTTTTGCTTTTTCTTTTTTAATTAAAGTTTAATTAA 
ATCACAACAAAGCATTCAATTATTTAAACTCACATTTTCAATTGCAATACAGAAAAGGTAAATATAATGGCTAAGGTAATCAAAAATTATATAAATAGGCATCTACAGAATC 
TGCGAGTTTTAATTTAAATTGAATATTGTTTTGTACAATGTAATAATATTCCTAAATGTGCTTTGTATTATAGAAGAACATTCTACTAATAAATCATTTAGCTATTTTTTGG 
TTAATATTAAAAGTACTATAGTACTATGTACTATCAGTTTTATGCTGCTTAGTTGTTCTATAAAATCAGTATGCCTCTGCCCACGAAAAGGGAAACTATTAAATTAGAGCGG 
ACATCGTGGCATGCCTCGGTACCCAAAGCCAAAGCACACACCCAGATCTGAATAGATGGAGTAGTACGAACAGGATAATACTTATGCATAAACAGGGACTGGACATGAGAAG 
GAAATGAAAGGAAAAAAAAAACCAATTAGAATGGTTGTTTACACGCAAACCAAGTAAGAAATATGAAAATTGTCGTCAGAATGGGGTTTGGGCCAAAGTAAAGGAAACGCCG 
AACAGGGAAACGAGTTTGCCATGTAAATGAATGAATGCGAGAATGAATAACTGAATGAATGAATCATTTATGTGGCCAATGTGGATGGATACACGTATGCCCACGTATGTAT 
GTATGTATATTCACAACTTCGGCACAAGGGAAAGGGACACACCTTCGTGAATTATTCATTGTCATGAATGAATCTTGTTTAATTGATAAATGCTATGACATGGGGATTAATG 
AAAAGCATAATGGAAAAATATCAACCAGGTGGCACATAAATTAGGCAGGACTTGGAATATGGAAATTTAACTAGCTACATAAATATTGCTACAAATTCGTATAATAAATCAC 
GCTAAATGCAACAATTAACATGCATAGTTTAAAATTTGATTGATTATAATTTTCAATTTTCTCTGCCAGCAAGTTTTTAATTAAGCAGCTTAATTTCGTAGTTTGCTATGTC 
AAATCCAGCATTCCATCCAGTTGATTAATTTTCATTTTCAATTATATTGGCGAAAATTGGACGAGATGACGAAAATTCCTTCTGGGCCAGTTTTGTGCATTAAATTCAATTT 
AAAAGCCATGCCAATGTCATAAACGCCCCGCATGAATCACAAAGCCAATTAATCATATCAATAATTATCTTCCAATGAAGACAGACACATAAATAACAGGAAATTCGCTATT 
AATAACTCCTGAAGGGTCGTTGCTTAGCATAATCATAATCAATACATCGCGTTGGCCATCAAACGAAACCTACATATCCGCCCATGTTTATGAATATTTATCGAGCACCGAC 
GGATGAGAGTATAAGTATTCTTGGCAACTTGGCTAATTCCGCTTTCCCCATCCCCATCCCACCATTTTCATTCACAGTGTGAGCCCGGTTTGGGCGCCAGCGTGATGTACAA 
TCTGCTCTATAATAAACCGCAAAAGCTGATGCTGTTGGCAGGATGCAGCACGGTCTGCACCACTGTAGCCGAGGCTGCCAAAATGTGGAATCTAATTGTGGTAAGTGAAATT 
TGCATAGCGCCAGAATGGCTGGAATATATACATACATACTTCGAAATACTTTACCTACATGCCACTGCCCCTCCACAGCTCTGCTACGGGGCCTCGAGTCCGGCTCTTTCGG 
ATCGCAAACGATTCCCCACTCTATTCCGCACCCATCCATCGGCCACGGTGCACAATCCAACGCGCATCAAGCTGATGAAGAAATTCGGCTGGTCCCGGGTGGCCATTCTGCA 
GCAGGCGGAGGAGGTCTTTATATCGGTAGGTAATCGGTGCTAACTAATTACATTTCCATATTAACTTCGAAAACTTTTATGAACAGACCGTAGAGGATCTCGAGAATCGATG 
CATGGAGGCTGGCGTTGAAATCGTAACTAGACAATCATTTCTATCCGATCCAACAGACGCCGTGCGCAATTTGCGACGCCAGGATGCACGCATCATTGTGGGACTCTTCTAT 
GTGGTGGCCGCCAGGAGGGTGCTCTGCGAAATGTACAAACAGCAGCTATATGGCCGAGCTCATGTGTGGTTCTTTATTGGTAAATACAAATTATACTACTTAGATACCGATA 
TCTCATATGCATAGATACTTTCACAGAGTCTTCTTGTAACTAAACTACTTTTTTTTTAGGCTGGTACGAGGACAACTGGTACGAGGTGAATCTGAAAGCAGAGGGCATCACC 
TGCACTGTTGAACAGATGCGAATAGCTGCCGAAGGACATCTGACAACGGAAGCGCTCATGTGGAATCAGAACAATCAGACAACTATATCCGGAATGACTGCAGAGGAATTTC 
GGTTAGTACTACTACTCTATCCATTGCTAAAGTTTTTGCTTCAATGTGCCTTCTTTTTGCGCAGACATCGACTGAATCAGGCGCTAATCGAGGAGGGTTACGACATTAACCA 
CGATCGCTATCCGGAGGGATATCAGGAGGCGCCACTCGCCTACGATGCAGTGTGGAGTGTGGCTTTGGCTTTCAACAAGACCATGGAACGATTGACAACCGGGAAGAAATCT 
CTGAGGGATTTTACCTATACGGACAAGGAGATTGCCGATGAAATCTACGCTGCCATGAACTCCACACAATTTCTGGGTGTATCGGTCAGTTTGTGAAACTAATTTCATAATA 
CTTAGTTAGTTACTTATTTTTTTATTTGCAGGGTGTGGTGGCATTCAGTTCTCAGGGCGATCGTATTGCTCTTACACAGATCGAAGAGATGATAGACGGCAAGTACGAGAAG 
TTGGGTTACTACGATACTCAGTTGGATAACCTATCCTGGTTGAATACTGAACAGTGGATTGGTGGCAAGGTAAGCAAATCAGTTAGCGTTGAAAAAACAATCATATAATCAA 
AATGTCGGACGATGATAAAAATCAAAAGAAGACAAAGCATGTAACGACTGAACCGAATAAGAACGGAAATCCGTCCAAAGACGACGTTAATACAAGAGGATCATTAACAAGC 
TTAAGTGAGAAAAATGATGAATCCAAAAATCCATCATCATCTGCAAGTCTTCAGCAAGCTCCTAAGGTAGCACCAAAACCGAAAAAGTTATCAATTTCGGATGCTGGTAAAG 
ACACAGTAACCCAAAAAGTCAAAGAAAATGAAGAGCCAGGATTTTCTAAACAATTTGAAAAAGAATCTATTGGTGTGCGAGCAAATAGAAACAGCAGTGCAACAAAAAAGGA 
AAATGAAAAGCTATTAGTAAAGACCGTGCCCGGAAAATCTCTTATAAAAGAATCTAACGATGAAAATGTGGAACCATCTAGACGCACAAAAAGTCAGCCAGTTGGAAAAAAA 
GTTTACGAGGAATCTTCAACGAGGCGAGTTCGTGAGCCATTTGATTCTTTTGATAGGGAAAAGTACCTGTCTGATATGATCGATTATGATCGATCGTCCGAGGACGTGCCGG 
AAAAAAAACTATCTTGGGAACCAGACAGTACACTTCGACGTCGTTTTGTCAGCAATAATGAATATCATTCTAGTCTTGAAGGTGAGGAGGAAGAGATCGATTTAGATTCAGT 
TGGATCGTCAACAATGCGAGGTAGCTATCGAATGCCACAGAATGCCGAGGATGAACGACCTGTTATCCTAGCTGAAATAATCAATATGGGAAAGTTGAAGGAGCTCAAGGAG 
CAACGCGAATCAACTAACGTATTTCGTAACAATCGTAGAGATGATAAAACCATTGATGAAGTGAAGGGTTCCAAGATCACAGGAACCAGCAAACAACATGTTGAAGAAGTTA 
AATATAATGAAACAGCAAGTAATGAAGATGATGACAATGCACAGGTAAAACATATAGGTACAATCACAAAAATCTTTAGCATTGCGCAAAGAATGAAAGAGAAGAAAAGCTC 
CAAAGAAAGTGAAGAAGACGAGAATAAGGACGAGAATAAAGATGTAAAAAAAGATGAAAATAAAGACGATGAACAGAAACTTCCCCCGGAACCAGAGATCGAATTGGAGGCC 
AAAAAAGCTTGGACTTTTCCGATCGCAGAGACATCACCTGATATCCCTGGAGAAAATCAGATCCTGCAAGAATACCAAAAGGTTAAAATTGGCAAAAGAAACAGGAAATACA 
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GATGTATAGGCATCAATACGGATATAAGCAGAAAATCTAAGATTAAAACATTTATCAGCAflGTCTGCAGGTGATGftCTTAGTAGTATACCATGATGGTGGAAGATTGCGftGA 

TATAGGACAAACCACTGAAAATCTTAAGAATCAACGAAAGAATCATAMCACAAAACCGAACCAGATGATATACCCGTTGATATAGGACACAGCGATGATCGAGTAAGGGAA 

ATTGGAGTGAATACTAAAAAGTTACCAAAAATCATAATACCACCAATTGCCGAGATGCATGTGCACAAAAATGGCAAATTAAGGGACATTGGAACCAGCACAGATAAGCCAT 

TTTGGCCAATCGATGACGGAACTGATGTAATCTACATGCATCCCATTAAGACTGATAGGAAGAAATTAAACAAACTGATTGTGGATCCACCGCCGGATAATGGACCCTACAA 

GATGCCAACCAAAGAAGATAGACGAACGTACTACAAAGGTTGTGAATATCATTTTCCTGGCCGAACGGAATGGCGCAGGCTTTTCTTCAACAAAATCCATGGAAAATACAflG 

CTACGCCGTCCAAGTCATTGGCTCTACACACTAGTTTTTTCCGTTTTATATATTTTGTTTGTCATTATCTTTTCCATGGCCTGGTTCGATTTCATCAAAGATGACGCTTCCA 

GAAAGGTACCCATGATAAAGATGGCACAACCATTTATTAGTTTCACTCCCATCGGACCACGAACTAATCCCAAGGCGGTATCCTTTGATCCTAGAAACAGCACAGAAGTCAT 

GGAAAAGTACGCTGGTATAATGGCTCTTCTGGAAAAATACGGCGACTATGGACACAATCCGCGTTTTGGAACGTGTACTGCGAATGAAAAGTTTGGCTATCCAAGCGGCGAA 

CCGTGTGTTTTTCTTAAAGTTAATCGGATTATTGGATTTAAGACCGAACCCTACATCAATTCCGATGAGCTTGTCAAAGCCAAGATCGATGAGGTTGAATTTACGGCCTTAA 

AAAGGTTATTGGAAAATACAACAACAGAGGAGGGTCATTTGAATCGCACTTGGATAACATGCCGTTCAGATAAAGATAAGAATGTTTTGATTGAATTTCATCCAGAGCCGGC 

GATAAGAACGGAATACACCGATATTGAAGAGAAAATCGAGTATATTGCGAACGAGGGTAAAAAGTCATTTTTCGGCCCAAACGATGTGAATCGAATCGTGGCTCTTAAGATC 

AAAAATCTCAAAGCAAATGAACGTGTCCATATAAATTGCAAAATGTGGGCTCAGAATATACATCATAGGAAAGAGGGTTATGGCCAAGTTTCGTTTTTTGTGTTATTAGCTA 

CAAATGAAAACCGCGAGAGGGTGGAAAAATTACTCCGAACCGACTACTCATTATAAAATACATAAATATAACTTTCTAATAAAATGTGTTATATGATATATGTATGTATTAT 

TTATAACATCTATAATCCAGGTTCCTCAAGATCGCACAATTGTCACCCATGTTCTACGCACCGTGTCCTTGCCATTATTTGTGTGCATGTGCACAATATCCAGTTGTGGCAT 

ATTCGTTGCCTTCGCCTTGATCATCTTTAATATATGGAATAAGCATAGAAGGTAACTATTTAATTGGTAGTGCTCGTGCAGAACTAACTAAGGGTTATATAATTTCTAGAGT 

AATACAATCCTCGCATCCCGTTTGCAATACGATCATGTTATTTGGTGTCATCATCTGTCTAATATCTGTCATCTTACTGGGCATCGACGGACGCTTTGTCAGCCCCGAGGAA 

TATCCAAAGGTGCGAGTTACTTACCTTAAAAGTTCTCTTCATTTTTATTAAAGTTAATTTGATATAAAATTATCCATTAGATATGTCAAGCGCGGGCTTGGTTACTATCCAC 

CGGTTTTACACTAGCATACGGTGCTATGTTCAGCAAGGTCTGGCGTGTGCATCGTTTTACAACAAAAGCAAAAACTGACCCAAAGGTGGGTTTAAATTAAATTAACTACAAC 

TATATATTAGTTGCTAATCATTTTGTGGAATTGCAGAAAAAAGTGGAACCTTGGAAGCTATACACCATGGTTTCGGGGCTATTATCAATAGATTTAGTGATATTACTCTCAT 

GGCAGATCTTTGATCCGCTGCAGCGTTATCTCGAAACATTCCCACTCGAAGATCCAGTATCTACTACTGATGATATTAAAATACGTCCAGAGCTTGAGCATTGTGAAAGTCA 

ACGCAACTCCATGTGGTTGGGTATGTTGTGATAACGATCCATTATATTGTATATAATATATTTATATTATTTTGCAGGTCTTGTATACGGCTTCAAGGGGCTAATCCTGGTG 

TTTGGCCTCTTTTTGGCGTACGAGACGCGCTCCATTAAAGTGAAACAGATCAACGATTCGCGTTATGTGGGCATGAGCATCTATAACGTGGTCGTCCTTTGCCTGATAACAG 

CTCCGGTGGGCATGGTCATTGCATCGCAACAGGACGCGTCCTTTGCCTTCGTTGCTCTAGCTGTGATATTCTGTTGTTTCCTAAGCATGCTGCTGATATTTGTGCCAAAGGT 

CAGCCACTACACTGAACTAAAAGAAGTAGCACATTTGTAATGATCTCTATTCGCACAGGTCATTGAGGTTATACGTCATCCCAAGGATAAGGCCGAATCGAAATACAATCCC 

GATTCAGCCATATCGAAAGAGGACGAAGAACGCTATCAGAAACTTGTTACCGAAAACGAGGAATTGCAACGATTAAIAACACAGGTATGCTGAATTATGGTAGAT.TATCTIA 

ATAAAATCGTATTTTATTTAATTTTGTTTAAAGATTAAATTCAACTGTTCGTTGAACGTTAATCTAAAAGTATTTCGGTCTTATCATAATTTGAACAAAAGTTAAGGATTGG 

AATTTCCAGCGGCCCCAGTGAATTCCATTCAGCATGCCGGTTTCACCTTCTCTAAAATAAAGTCCATTTAGCGAGCTGCAATATGAAGTATTATCAACCTCTTATAAATCAA 

ATTTTAATTTGATAATTAATTTACCTTGACAAACATGAGTGAAACCACCAACCACCGCCATGCTCGGCTGCACAATTTTTACTAGATTCATCATTATCACGATCAAAAGTTG 

AGAACCTTTTATTTATATGGTATCTTAGGGAGTCCCCAGCTGTTCCAGAATATTTTCCCACCCGTTCCAGTTTGTATAATTCAGTTTCACTGTCCACTTGAAAGTCATCAAA 

GTGAGCGTATACTGTAGCGCCAGGACCATGCTTCAATTGAATGAAAAGTTCGTGTGGTTGTTCTCTAGTCATTAGGTAAAGCTTCTGAAGACCGATGAAAAATTCGCCATTC 

GGATCGCCAAATCCATCTTTGTAACTAAACCACGATTGGTTAAAATTAACACTTCCATCTAGTCGTCTTTGAATGACAATCCAATCTCTTGCCGTCGTTTTGCACTGGGTCA 

CTTGAAAAGGTTCTTCTTCTGGCAACATTAGCTGGTGAATTCCATTTGGACTGCCACTAGGACACGTGTGTGGTGAACTCTGGCCCACTTCGAACAGCAAAGTCCATAAAAA 

TAGTACAAAAAAGCACGATTTCATTGTACTCGACATGCGTACTGTTAGAAGATCGGTGGCTTACTGAAGTATCTACCATTTAAAAGGGGAACATTGGGACGCAGTTTTGTGG 

TTTTGCTACCTTTGACCGCATTGCTTGATATGCATATATTCTAXTTAGGTCGGCATTTAAGCAAATGGATTCCAAAAAAGTGAATGGAGACTATTTAAAAGATAGAACTACA 

TTTTTATTGTTTAAATGTTTGTTTGTTTTGCTTATATTTTTATTGTTTTATAAACATTGATTTATTAATTTTCCGCTTGCAGAAGGAGGAAAAGATTCGAGTCCTGCGACAG 

CGTCTGGTGGAGCGGGGCGACGCCAAGGGCACAGAACTGAATGGTGCAACAGGTGTCGCCTCCGCCGCCGTTGCAACAACTTCGCAGCCCGCTTCCCTCATCAACTCATCAG 

CACATGCCACGCCCGCAGCCACACTCGCAATCACACAAGGTGAGTAGAGTACCTGGTGCATGGAACTATGCATCCCAAATGCCCCTTAGCCAGTTGCATGTGGTAATCGATA 

CTAGTTACGTTGGTTTAGCTTGACGTTCTTAATCGTTATGTTCTCGATTTTAACTAGAAATCTTAGCTCTATTCGTCGGCATTGACACAATTGATTAGGAACAACAATATAA 

ATGGAGTTCAAAGGTTATTTGCTTATGCTAACGCTATGTAAACTCGATTTTCGCACAAAAAACCAAAGGCAATTTATCAGTTTCTGGAATGTCTATTGCTTTGTGTGCATAC 

ATATGTATCTTAAGCACTTTTGCAGTTTTTGAGCTTAITTATATCTTCAGCACTTTAGCACTTTTTTAATAAATATCACTGCTTCTGCACCCTTTTTTCACATTTTAGCAAG 

ATTATTCAAAACATACTAATACGTAATATATTTATCTTACAGATACTCATGAGCATCGTACATGAAAAAACTTGTCAATTTCCTTTGCGTGATAGGTGAATTATACAAAAGA 

TATGAAATGCAAATAATGATGCTAACGGAAGGATGGATTTAAACAAACACGTACTTAAAGAACTTTGTATCAACTTTAAGCTGAGCTAAATAAGTGAATTGGTAGCAGCAAT 

TTTGTAGAATAATGAAATATGTATTGTACCTTAAACGTGTTTAGCATTAATATTGAAAACGAAGTAATTTTAACTTAAAAGAAAACAAAACAAAAATTCGACACTAGCATCT 

AAACTAAGCGAATTGTTAGGCTTTAAAGTTGCAACACAGAAACCAAAAAAAATTCTAGCAAATCAAGAGAATTCATTACGAAAAGATCGTGTTAAAGTTAGTAATTTTTAAA 

TGCAGGTAGAGAAGACAAATTGTTGATTTTGTGTTTAGTAAATACTGTTTTTTTAGTTAAACGAATTATCCTTTCGAACCTCATTTGCTCAGCGCTGTCTCCTAGAGCTCTC 

TAGTTAATTATATCTGCGTAGATGAAAATTCCCCAAAAG 

(SEQ ID NO: 175) 

Exon: 1001.. 1090 

Exon: 1165.. 1208 

Exon: 1810.. 2052 

Exon: 3783.. 3856 

Exon: 5748.. 6002 

Exon: 10046.. 10180 

Exon: 10271.. 10441 

Exon: 10503. .10719 

Exon: 10812. .11284 

Exon: 11344. .11493 

Exon: 11565.. 14044 

Exon: 14446. .14569 

Exon: 14902. .15028 

Exon: 15086.. 15342 

Exon: 15403. .15540 

Exon: 16659.. 16847 

Start ATG : 1026 

Transcript No. : CT35221 

TCATGGAGTTCCAGTCGTCGGATACATGTGGAACACAACGAACACAGCAGGAGCCTCTCGCGTCTCATCCCCCGAATACAATGCGAAAAAAACTGAATACCTGCAAAAAGAA 
ATACGCGTTAAATGGAATTACGCACCCCCTCCATCCAACACCCCCTACGACCCTGCCCAAGCAAAAGGAGCAGCGAAAAAGAAGAAGCGTCTTATGAAACAATTGAAGCACC 
GGCCAAAGGAGCAAAGGAGCAGACAAAGACAGAGACAAAAACATCGGCAAAGGATAAAACGGAGGCCGAGAAGAAGAGGAAGCAGTAGTAGCAAAGCAGGAAACAACAACAA 
CAACAACCGCAACATCGCATTTTTAAACGGCAATACGCAAAAAAAATATTGCTGGCAGGAGTTTAGAACATTGCCAAAGGGGATGCGGACAGATGAAGGGGCGGGGGCGTGG 
CGGGATATGACAAGTGATGGTGCTGTTACGTTTTGGATATTTTTGCTTTGTTTAATCGCCTCGCCGCACCTGCAAGGGGGCGTGGCCGGGAGGCCCGATGAACTGCACATCG 
GCGGCATCTTTCCGATAGCCGGCAAAGGAGGATGGCAGGGCGGCCAGGCGTGTATGCCTGCCGCAAGACTGGCGTTGGATGATGTCAACAAGCAGCCAAATCTGCTGCCGGG 
CTTCAAGCTCATCCTGCACAGCAACGACAGTGAGTGTGAGCCCGGTTTGGGCGCCAGCGTGATGTACAATCTGCTCTATAATAAACCGCAAAAGCTGATGCTGTTGGCAGGA 
TGCAGCACGGTCTGCACCACTGTAGCCGAGGCTGCCAAAATGTGGAATCTAATTGTGCTCTGCTACGGGGCCTCGAGTCCGGCTCTTTCGGATCGCAAACGATTCCCCACTC 
TATTCCGCACCCATCCATCGGCCACGGTGCACAATCCAACGCGCATCAAGCTGATGAAGAAATTCGGCTGGTCCCGGGTGGCCATTCTGCAGCAGGCGGAGGAGGTCTTTAT 
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ATCGACCGTAGAGGATCTCGAGAATCGATGCATGGAGGCTGGCGTTGAAATCGTAACTAGfiCAATCATTTCTATCCGATCCAACAGflCGCCGTGCGCAATTTGCGACGCCflG 

GATGCACGCATCATTGTGGGACTCTTCTATGTGGTGGCCGCCAGGAGGGTGCTCTGCGAAATGTACAAACAGCAGCTATATGGCCGAGCTCATGTGTGGTTCTTTATTGGCT 

GGTACGAGGACAACTGGTACGAGGTGAATCTGAAAGCAGAGGGCATCACCTGCACTGTTGAACAGATGCGAATAGCTGCCGAAGGACATCTGACAACGGAAGCGCTCATGTG 

GAATCAGAACAATCAGACAACTATATCCGGAATGACTGCAGAGGAATTTCGGTTAGTACTBCTACTCTATCCATTGCTAAAGTTTTTGCTTCAATGTGCCTTCTTTTTGCGC 

AGACATCGACTGAATCAGGCGCTAATCGAGGAGGGTTACGACATTAACCACGATCGCTATCCGGAGGGATATCAGGAGGCGCCACTCGCCTACGATGCAGTGTGGAGTGTGG 

CTTTGGCTTTCAACAAGACCATGGAACGATTGACAACCGGGAAGAAATCTCTGAGGGATTTTACCTATACGGACAAGGAGATTGCCGATGAAATCTACGCTGCCATGAACTC 

CACACAATTTCTGGGTGTATCGGGTGTGGTGGCATTCAGTTCTCAGGGCGATCGTATTGCTCTTACACAGATCGAACAGATGATAGACGGCAAGTACGAGAAGTTGGGTTAC 

TACGATACTCAGTTGGATAACCTATCCTGGTTGAATACTGAACAGTGGATTGGTGGCAAGAAGACAAAGCATGTAACGACTGAACCGAATAAGAACGGAAATCCGTCCAAAG 

ACGACGTTAATACAAGAGGATCATTAACAAGCTTAAGTGAGAAAAATGATGAATCCAAAAATCCATCATCATCTGCAAGTCTTCAGCAAGCTCCTAAGGTAGCACCAAAACC 

GAAAAAGTTATCAATTTCGGATGCTGGTAAAGACACAGTAACCCAAAAAGTCAAAGAAAATGAAGAGCCAGGATTTTCTAAACAATTTGAAAAAGAATCTATTGGTGTGCGA 

GCAAATAGAAACAGCAGTGCAACAAAAAAGGAAAATGAAAAGCTATTAGTAAAGACCGTGCCCGGAAAATCTCTTATAAAAGAATCTAACGATGAAAATGTGGAACCATCTA 

GACGCACAAAAAGTCAGCCAGTTGGAAAAAAAGTTTACGAGGAATCTTCAACGAGGCGAGTTCGTGAGCCATTTGATTCTTTTGATAGGGAAAAGTACCTGTCTGATATGAT 

CGATTATGATCGATCGTCCGAGGACGTGCCGGAAAAAAAACTATCTTGGGAACCAGACAGTACACTTCGACGTCGTTTTGTCAGCAATAATGAATATCATTCTAGTCTTGAA 

GGTGAGGAGGAAGAGATCGATTTAGATTCAGTTGGATCGTCAACAATGCGAGGTAGCTATCGAATGCCACAGAATGCCGAGGATGAACGACCTGTTATCCTAGCTGAAATAA 

TCAATATGGGAAAGTTGAAGGAGCTCAAGGAGCAACGCGAATCAACTAACGTATTTCGTAACAATCGTAGAGATGATAAAACCATTGATGAAGTGAAGGGTTCCAAGATCAC 

AGGAACCAGCAAACAACATGTTGAAGAAGTTAAATATAATGAAACAGCAAGTAATGAAGATGATGACAATGCACAGGTAAAACATATAGGTACAATCACAAAAATCTTTAGC 

ATTGCGCAAAGAATGAAAGAGAAGAAAAGCTCCAAAGAAAGTGAAGAAGACGAGAATAAGGACGAGAATAAAGATGTAAAAAAAGATGAAAATAAAGACGATGAACAGAAAC 

TTCCCCCGGAACCAGAGATCGAATTGGAGGCCAAAAAAGCTTGGACTTTTCCGATCGCAGAGACATCACCTGATATCCCTGGAGAAAATCAGATCCTGCAAGAATACCAAAA 

GGTTAAAATTGGCAAAAGAAACAGGAAATACAGATGTATAGGCATCAATACGGATATAAGCAGAAAATCTAAGATTAAAACATTTATCAGCAAGTCTGCAGGTGATGACTTA 

GTAGTATACCATGATGGTGGAAGATTGCGAGATATAGGACAAACCACTGAAAATCTTAAGAATCAACGAAAGAATCATAAACACAAAACCGAACCAGATGATATACCCGTTG 

ATATAGGACACAGCGATGATCGAGTAAGGGAAATTGGAGTGAATACTAAAAAGTTACCAAAAATCATAATACCACCAATTGCCGAGATGCATGTGCACAAAAATGGCAAATT 

AAGGGACATTGGAACCAGCACAGATAAGCCATTTTGGCCAATCGATGACGGAACTGATGTAATCTACATGCATCCCATTAAGACTGATAGGAAGAAATTAAACAAACTGATT 

GTGGATCCACCGCCGGATAATGGACCCTACAAGATGCCAACCAAAGAAGATAGACGAACGTACTACAAAGGTTGTGAATATCATTTTCCTGGCCGAACGGAATGGCGCAGGC 

TTTTCTTCAACAAAATCCATGGAAAATACAAGCTACGCCGTCCAAGTCATTGGCTCTACACACTAGTTTTTTCCGTTTTATATATTTTGTTTGTCATTATCTTTTCCATGGC 

CTGGTTCGATTTCATCAAAGATGACGCTTCCAGAAAGGTACCCATGATAAAGATGGCACAACCATTTATTAGTTTCACTCCCATCGGACCACGAACTAATCCCAAGGCGGTA 

TCCTTTGATCCTAGAAACAGCACAGAAGTCATGGAAAAGTACGCTGGTATAATGGCTCTTCTGGAAAAATACGGCGACTATGGACACAATCCGCGTTTTGGAACGTGTACTG 

CGAATGAAAAGTTTGGCTATCCAAGCGGCGAACCGTGTGTTTTTCTTAAAGTTAATCGGATTATTGGATTTAAGACCGAACCCTACATCAATTCCGATGAGCTTGTCAAAGC 

CAAGATCGATGAGGTTGAATTTACGGCCTTAAAAAGGTTATTGGAAAATACAACAACAGAGGAGGGTCATTTGAATCCCACTTGGATAACATGCCGTTCAGATAAAGATAAG 

AATGTTTTGATTGAATTTCATCCAGAGCCGGCGATAAGAACGGAATACACCGATATTGAAGAGAAAATCGAGTATATTGCGAACGAGGGTAAAAAGTCATTTTTCGGCCCAA 

ACGATGTGAATCGAATCGTGGCTCTTAAGATCAAAAATCTCAAAGCAAATGAACGTGTCCATATAAATTGCAAAATAGTAATACAATCCTCGCATCCCGTTTGCAATACGAT 

CATGTTATTTGGTGTCATCATCTGTCTAATATCTGTCATCTTACTGGGCATCGACGGACGCTTTGTCAGCCCCGAGGAATATCCAAAGATCTTTGATCCGCTGCAGCGTTAT 

CTCGAAACATTCCCACTCGAAGATCCAGTATCTACTACTGATGATATTAAAATACGTCCAGAGCTTGAGCATTGTGAAAGTCAACGCAACTCCATGTGGTTGGGTCTTGTAT 

ACGGCTTCAAGGGGCTAATCCTGGTGTTTGGCCTCTTTTTGGCGTACGAGACGCGCTCCATTAAAGTGAAACAGATCAACGATTCGCGTTATGTGGGCATGAGCATCTATAA 

CGTGGTCGTCCTTTGCCTGATAACAGCTCCGGTGGGCATGGTCATTGCATCGCAACAGGACGCGTCCTTTGCCTTCGTTGCTCTAGCTGTGATATTCTGTTGTTTCCTAAGC 

ATGCTGCTGATATTTGTGCCAAAGGTCATTGAGGTTATACGTCATCCCAAGGATAAGGCCGAATCGAAATACAATCCCGATTCAGCCATATCGAAAGAGGACGAAGAACGCT 

ATCAGAAACTTGTTACCGAAAACGAGGAATTGCAACGATTAATAACACAGAAGGAGGAAAAGATTCGAGTCCTGCGACAGCGTCTGGTGGAGCGGGGCGACGCCAAGGGCAC 

AGAACTGAATGGTGCAACAGGTGTCGCCTCCGCCGCCGTTGCAACAACTTCGCAGCCCGCTTCCCTCATCAACTCATCAGCACATGCCACGCCCGCAGCCACACTCGCAATC 

ACACAAGGTGAGTAG 

(SEQ ID NO: 176) 



MWNTTNTAGASRVSSPEYNAKKTEVLQKEIRVKWNYAPPPSNTPYDPAQAKGAAKKKKRLMKQLKHRPKEORSRQRQRQKHRQRIKRRPRRRGSSSSKAGNNNNNNRNIAFL 

NGNTQKKYCWQEFRTLPKGMRTDEGAGAWRDMTSDGAVTFWIFLLCLIASPHLQGGVAGRPDELHIGGIFPIAGKGGWQGGQACMPAARLALDDVNKQPNLLPGFKLILHSN 

DSECEPGLGASVMYNLLTOKPQKLMLLAGCSTVCTTVAEAAKMWNLIVLCVGASSPALSDRKRFPTLFRTHPSATVHNPTRIKLMKKFGWSRVAILQQAEEVFISTVEDLEN 

RCMEAGVEI VTRQSFLSDPTDAVRNLRRQDARIIVGLFYVVAARRVLCEMYKQQLYGRAHVWFFIGWYEDNWYEVNLKAEGITCTVEQMRIAAEGHLTTEALMWNQNNQTTI 

SGMTAEEFRLVLLLYPLLKFLLQCAFFLRRHRLNQALIEEGYDINHDRYPEGYQEAPLAYDAVWSVALAFNKTMERLTTGKKSLRDFTYTDKEIADEIYAAMNSTQFLGVSG 

VVAFSSQGDRI ALTQI EQMIDGKYEKLGYYDTQLDNLSWLNTEQWIGGKKTKHVTTEPNKNGNPSKDDVNTRGSLTSLSEKNDESKNPSSSASLQQAPKVAPKPKKLSISDA 

GKDTVTQKVKENEEPGFSKQFEKESIGVRANRNSSATKKENEKLLVKTVPGKSI.IKESNDENVEPSRRTKSQPVGKKVYEESSTRRVREPFDSFDREKYLSDMIDYDRSSED 

VPEKKLSWEPDSTLRRHFVSNNEYHSSLEGEEEEIDLDSVGSSTMRGSYRMPQNAEDERPVILAEIINMGKLKELKEQRESTNVFRNNRRDDKTIDEVKGSKITGTSKQHVE 

EVKYNETASNEDDDNAQVKHIGTITKIFSIAQRMKEKKSSKESEEDENKDENKDVKKDENKDDEQKLPPEPEIELEAKKAWTFPIAETSPDIPGENQI LQEYQKVKIGKRNR 

KYRCIGINTDISRKSKIKTFISKSAGDDLVVYHDGGRLRDIGQTTENLKNQRKNHKHKTEPDDIPVDIGHSDDRVREIGVNTKKLPKIIIPPIAEMHVHKNGKLRDIGTSTD 

KPFWPIDDGTDVIYMHPIKTDRKKLNKLIVDPPPDNGPYKMPTKEDRRTYYKGCEYHFPGRTEWRRLFFNKIHGKYKLRRPSHWLYTLVFSVLYILFVI IFSMAWFDFIKDD 

ASRKVPMIKMAQPFISFTPIGPRTNPKAVSFDPRNSTEVMEKYAGIMALLEKYGDYGHNPRFGTCTANEKFGYPSGEPCVFLKVNRIIGFKTEPYINSDELVKAKIDEVEFT 

ALKRLLENTTTEEGHLNRTWITCRSDKDKNVLIEFHPEPAIRTEYTDIEEKIEYIANEGKKSFFGPNDVNRIVALKIKNLKANERVHINCKIVIQSSHPVCNTIMLFGVIIC 

LIS VI LLGIDGRFVSPEEYPKIFDPLQRYLETFPLEDPVSTTDDIKIRPELEHCESQRNSMWLGLVYGFKGLILVFGLFLAYETRSIKVKQINDSRYVGMSIYNVVVLCLIT 

APVGMVIASQQDASFAFVALAVIFCCFLSMLLIFVPKVIEVIRHPKDKAESKYNPDSAISKEDEERYQKLVTENEELQRLITQKEEKIRVLRQRLVERGDAKGTELNGATGV 

ASAAVATTSQPASLINSSAHATPAATLAITQGE* 

(SEQ ID NO: 177) 

Name: GABA B receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384569 

TGTGGGTGGTTCGCACGCCCGCCATTTCCGGCGCGGCCTCGGATAACGTCCTCGATTACGCCCTGCCGCCCAACACGCCCACTCCGAAGGACTACGCGCACAAGTTCCTGCC 
CGAGGGCAAGGATGTGGTACCCGGAAGCTCTTACATACCGCTGCGGTTGCTGGTGATAAAGCGCTGAAGCTGATATATGAATCTTGGGCGGCGGCTTGCCTTTTGTTGTTCG 
AAAACAAGTTCCGGCGGAATGCCGATACCCGACACTTAATGCCCAACCAGTAGTTGGCTAATTGGCTTTAAGCTTACTGGTTCCTAGCGCTAGTCTAGCAATAGTTTAGTTG 
TTTATGTTAAGGCCCAAGTTCGCTAATCTAAATTAAGCCACCACATCCCGGGGTCCGTGACTAAGACTTCCGACCTTATCGCATCCAATAAATCCAAAACTAAGACTCAGCC 
ACATGCCGTCGTAATCTATGAAATACATTCACCAAAAACTAAGTTGTCATCTCATTTACATAAATTAGAAGTCATAAATTTGAAGTGAGCGGGGAACTACTTGGCTCGAAAC 
GGTTAGAAGCTGAATTCAAAACCGGTTAACGTTTATCTGTCAGCTGTAATCAAAAGCTTAAAGCTTCGCTTGTTTGGTTTAGTCTTTGACTAAGTTTGGAAAGCAGGACTAA 
AAGCTTTTATTTGCCGAAACTAAAGCTCGGAGGTATTTATTTATTCACAAGTAAAATGACAAAACTTGGAAGTTCTTTGGTAATGTAATGAACGCCAATATTTACATTCGGG 
TTTTATTGTATTTGAAAAGTCTGATGAATCATTATCTTGATGACTGGCCCCTAATTTACTTTTCATGTTCTACGCAAGGGGTATGATAACTGGTTGACATGGATACCAAGAT 
TAGAGCTTGCAGACTTTATAATCATCCAACAAACATTATGGTGCATTGTATACCATTTATGTTATGAATGAAAGTGGTTCAGATAAGTTACATTTTATTTTATCTCACACAT 
CTAAGTTTTTGTATTGCTGCACAGGAAAATAAATCAAGATTATCCACAATAGTTTAAAAGTCAATTTATAGGTACTATGTAGAATTTATTTACATTTTGTTGTACTAATGGT 
AACTTAACTTAGGTTTTTAAATACTAGTACTGTTTGTTGATGAATGGCTTAGGCCTTGCAAAATCGAGTCAGTGGGCATTTGTATAGTTTGCTGATTTCTCTTCTTGCTCGA 
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ATTGCAftCAGAGACCCMCCSGGCCCGCTGGTTCTCTTTGTTAAATACTATAAflGTAAflGGAATAAAACAAATCCCTGTAAflGTGGCTGTCAGGCflGAATAAATftGGCGAAG 
ATCCTTCCGAAATCGAAGTATGTACACAATCCAAAAATCCATGTTATTCCCAAAAGAAAGAAGAGCAACACAAACAGGCCGAGTTGCTTGAAGATAAGGTCACGTTTGAACA 
GGGCTGTATAGACACTCCAGGAGATGTATCCCACCAATATTGCATTGGCCACCGTGATAAGGCCAATAGGGAGTATGACGCCCflGACTTAAACCATATCCAGAAGGATAGCA 
GAGGATGGGATAGTCCATTGAGGAATTATTGGGTCTGTAGGAACCGGGCTCCAGAAAAACCACCAACAAGGTGGGTATAAGTGGCAATGTCCAGGCCGCCACAGCCGACATT 
AGTATATAATGTCGTGGGTGATTAACTCCAATGACTCTCACATATCTTTGATACTGCAGAAATCCAATGATAAACATCCAGCTGAAGACAACCAAGAGTAGATATTGCATCA 
CGGCTCCAACCAGAGTACATCGTTCCGACTCAGATTGTTCCAGCTGCTCCAGCAAATGGCTTTGACTCAGGATCAGGAAGAAGAGCAACTGAAGACCCAGGGCAGCACACAG 
ATTCAGCAATATCTTCGTGGAGGCTAATGTCCGGAAGCTCTTAAACACGGCTGCAGTTATAAAAATCATTAGGAGGCCCAACAGAGAAAGGGTCAAGCCGACATTGGTGATT 
ACATCTAAGGAGTGATCATCTTCATTGGTGTCCAGACCTGCCTGCATTTTGCTCACTCCCAGTAGAAATGTAAACTGGGTCAGATGATCCGCATGGCAGAGGATCACTGGGT 
GCGATGAGGCTTCCAAGCTGCCACCACCGCTGGTGGATATTCCATCGCTAAGCCAGGTCTCATAGTTCCTiGTAACCACATCCTCCCTCAAATTGAGTTTCATTCGCTTTTGC 
ATTACGCAAGAAAAATGGCAACTTTCCAGGTAGCTTGGTATCTTACGAAAAGATTTTAGATTAAAACATATATGCTATAATTTTCCCTCAACCCACTCACCATTCAATCCTG 
GTATTGTTATGGATATAACATTACTAGTTGGCCGTCGTTTGATTTCCTCTTCCGTCTCCACAAAGAGTGAATCTCGAGTATAGACCTTAAAAACTAAGTAAGAGGCTCCCTT 
TCTCTTCAGTTGGGTCCATAGTTTTTCGGGAATAAAGACCGCACTTTCTAGATTCTTCATGGCCCTTATATCTTCCTCATTATCCGATAGGTGAAGCATTTCGAAATGCGAC 
ATATTGCCACCAGAGAAGTTCACTATCCCACTCACATTTCTCACTTCTGGATCTACAAAGAATACTCTGAGTTCGCTCGTCTTTAGAACAATGACACCCATATCAAACAAGG 
GATGGAATGTGAACTCAGTGCTGTTTGCAAAAATAGTGGTGTTTAACTTTAAAATTGTTGGACTAAGCTCATCCAAATAATCCTCGAAGTTGCTAAGGAGTGCATTGGTTGC 
ATTAGCGTCTGCTGAAACACGAAGGGTTTCATTATCTGTAGACATAACTTTTTGGCATATCTTCATCAAGTCCCCTACAACTGCAGCATTTCTTTCCGGTTTTTTGACCAGC 
GAATCGAATATCTCCGAAATACTGAAGACATCGGCGGCGGCCAGTTTCTCCTGAGTTTGAGTCAGGAGGTTACTCAGTTGGCTAATCATGTCCGGAGGGCTCTTCACAGCGA 
GAAGGTTTATACTCGAGCTGAGTTCCGGTGCCGACCTACAGGTCACGTGATCCAAGCTCTGCCAAACTCTATTCCCATTGCGAflTTTGGCACCTCCGTCGGATGGGAATTCC 
CCGTTCGTTCAGGCATAGATCCCTTAGATTCGCCTTTTGCCCAACTCTCGTTTTCTCCACGCGATTGATTCGCTTTTCCTTGCCTTCATCCGTTTCGTATTCCAGTTCAAAG 
TCCTCGGCCTTGCAGAATCCTCGGTTGCAGTCCTTGTAGGGATTGTGCGACAGCTGACCATCTGCATCCACGCAGAGGAGGAAGCGCAAGTTCTTCAGGTGGCAGTAGTCCA 
GACACCGGGCCACATTCGGCTGATCCAGGGCGTACAGGGGGGGATTTGTACCTCCAGATACGAGTATTATATAGCCAGCAATCCACCAGAGAAACAGCAGCATACTGCTATC 
TATTTTTAACTAATTTAAGCGCAGGCTGCGATGTCGCACTCCCGAAGCTCCGCGATCGACTGACGAGGCGGAGATAAAATGCGAGTGGCGATAAAACGAGTTTTCCTCTCCT 
CATCCCTGTAAATAGGGCCTGCAAGAATTGCCACTTTCCTCACTGTTCTAGCTGGGTATTTGAATTGCTGGTCTACTTCGAGCAGAGCTCAAGGTCAACTGGTTATCTGGCC 
GACGATAAGAATTGCAGCATCTTCAAAGGTCGAACGGTGCGTTCTTTTCAAAATATCTTTGAATATAATAAGTGAAAGTCTCGAAAATAACACGAACCTCTTTAATAATGGG 
TATTATTTATACACAGTAGTTCTGTTTACTTTGATCACATTTTTTATCAAGGGAACTTCCTAATTAAAAACAAAAAAAAAAAAACGAATAGCCTGATAAAGAGCTAAATGGT 
AATTATCAAAATTATATATACCATACACCCAACGATAATGATAATTTGTTTATCGCAGTAAAGTGAGATTCTCTGAATTGATAAAGGTTAATTGCCGTTTTTCAACACCATT 
GCATGTATCGATAAAACTAATAGGAATTACAAATAAATTAAAAATCTGAATTTTCTTTTTTGCTCTGGGTAACAATGGCTGCCATTTCTGAATGTTTCTGTTATTCTCTCTG 
ATTTTAAAAGTGCAACGATACTTAACACGAACTTTGAGTTGCTAGTCGCCATAAGCATATGAATGCATCAATCGGTAGCTAGGGGCTGTCAACAACTACAATATGTATGATA 
AGCAGCCACCTTTAGATACTTTCCTTACTAACACCTAGAAGGTAGAGATAGAGGTATTCAAATATTTGCATTCAGATATAAAATAATGTTCTCTTGTTCAAGCTGTGAAATC 
TTTATCTGTAGAGATAAAGTGAGCATTTCAATTGAAGAACCAATTTGTATATTTTTTTAGATTTCACCGAACGCATGAATAGAATTTGTCTGCTA 
(SEQ ID NO: 178) 

Exon: 3463. .2341 
Exon: 2281. .1491 
Exon: 1024. .1001 

Start ATG: 3463 (Reverse strand: CAT) 
Transcript No. : CT35672 

ATGCTGCTGTTTCTCTGGTGGATTGCTGGCTATATAATACTCGTATCTGGAGGTACAAATCCCCCCCTGTACGCCCTGGATCAGCCGAATGTGGCCCGGTGTCTGGACTACT 

GCCACCTGAAGAACTTGCGCTTCCTCCTCTGCGTGGATGCAGATGGTCAGCTGTCGCACAATCCCTACAAGGACTGCAACCGAGGATTCTGCAAGGCCGAGGACTTTGAACT 

GGAATACGAAACGGATGAAGGCAAGGAAAAGCGAATCAATCGCGTGGAGAAAACGAGAGTTGGGCAAAAGGCGAATCTAAGGGATCTATGCCTGAACGAACGGGGAATTCCC 

ATCCGACGGAGGTGCCAAATTCGCAATGGGAATAGAGTTTGGCAGAGCTTGGATCACGTGACCTGTAGGTCGGCACCGGAACTCAGCTCGAGTATAAACCTTCTCGCTGTGA 

AGAGCCCTCCGGACATGATTAGCCAACTGAGTAACCTCCTGACTCAAACTCAGGAGAAACTGGCCGCCGCCGATGTCTTCAGTATTTCGGAGATATTCGATTCGCTGGTCAA 

AAAACCGGAAAGAAATGCTGCAGTTGTAGGGGACTTGATGAAGATATGCCAAAAAGTTATGTCTACAGATAATGAAACCCTTCGTGTTTCAGCAGACGCTAATGCAACCAAT 

GCACTCCTTAGCAACTTCGAGGATTATTTGGATGAGCTTAGTCCAACAATTTTAAAGTTAAACACCACTATTTTTGCAAACAGCACTGAGTTCACATTCCATCCCTTGTTTG 

ATATGGGTGTCATTGTTCTAAAGACGAGCGAACTCAGAGTATTCTTTGTAGATCCAGAAGTGAGAAATGTGAGTGGGATAGTGAACTTCTCTGGTGGCAATATGTCGCATTT 

CGAAATGCTTCACCTATCGGATAATGAGGAAGATATAAGGGCCATGAAGAATCTAGAAAGTGCGGTCTTTATTCCCGAAAAACTATGGACCCAACTGAAGAGAAAGGGAGCC 

TCTTACTTAGTTTTTAAGGTCTATACTCGAGATTCACTCTTTGTGGAGACGGAAGAGGAAATCAAACGACGGCCAACTAGTAATGTTATATCCATAACAATACCAGGATTGA 

ATGATACCAAGCTACCTGGAAAGTTGCCATTTTTCTTGCGTAATGCAAAAGCGAATGAAACTCAATTTGAGGGAGGATGTGGTTACTGGAACTATGAGACCTGGCTTAGCGA 

TGGAATATCCACCAGCGGTGGTGGCAGCTTGGAAGCCTCATCGCACCCAGTGATCCTCTGCCATGCGGATCATCTGACCCAGTTTACATTTCTACTGGGAGTGAGCAAAATG 

CAGGCAGGTCTGGACACCAATGAAGATGATCACTCCTTAGATGTAATCACCAATGTCGGCTTGACCCTTTCTCTGTTGGGCCTCCTAATGATTTTTATAACTGCAGCCGTGT 

TTAAGAGCTTCCGGACATTAGCCTCCACGAAGATATTGCTGAATCTGTGTGCTGCCCTGGGTCTTCAGTTGCTCTTCTTCCTGATCCTGAGTCAAAGCCATTTGCTGGAGCA 

GCTGGAACAATCTGAGTCGGAACGATGTACTCTGGTTGGAGCCGTGATGCAATATCTACTCTTGGTTGTCTTCAGCTGGATGTTTATCATTGGATTTCTGCAGTATCAAAGA 

TATGTGAGAGTCATTGGAGTTAATCACCCACGACATTATATACTAATGTCGGCTGTGGCGGCCTGGACATTGCCACTTATACCCACCTTGTTGGTGGTTTTTCTGGAGCCCG 

GTTCCTACAGACCCAATAATTCCTCAATGGACTATCCCATCCTCTGCTATCCTTCTGGATATGGTTTAAGTCTGGGCGTCATACTCCCTATTGGCCTTATCACGGTGGCCAA 

TGCAATATTGCAATACAAAAACTTAGATGTGTGA 

(SEQ ID NO: 179) 

Start ATG: 1 (Reverse strand: CAT) 

MLLFLWWIAGYIILVSGGTNPPLYALDQPNVARCLDYCHLKNLRFLLCVDADGQLSHNPYKDCNRGFCKAEDFELEYETDEGKEKRINRVEKTRVGQKANLRDLCLNERGIP 

IRRRCQIRNGNRVWQSLDHVTCRSAPELSSSINLLAVKSPPDMISQLSNLLTQTQEKLAAADVFSISEIFDSLVKKPERNAAVVGDLMKICQKVMSTDNETLRVSADANATN 

ALLSNFEDYLDELSPTILKLNTTIFANSTEFTFHPLFDMGVIVLKTSELRVFFVDPEVRNVSGIVNFSGGNMSHFEMLHLSDNEEDIRAMKNLESAVFIPEKLWTQLKRKGA 

SYLVFKVYTRDSLFVETEEEIKRRPTSNVISITIPGLNDTKLPGKLPFFLRNAKANETQFEGGCGYWNYETWLSDGISTSGGGSLEASSHPVILCHADHLTQFTFLLGVSKM 

QAGLDTNEDDHSLDVITNVGLTLSLLGLLMIFITAAVFKSFRTLASTKILLNLCAALGLQLLFFLILSQSHLLEQLEQSESERCTLVGAVMQYLLLVVFSWMFIIGFLQYQR 

YVRVIGVNHPRHYILMSAVAAWTLPLI PTLLVVFLEPGSYRPNNSSMDYPILCYPSGYGLSLGVILPIGLITVANAILQYKNLDV* 

(SEQ ID NO: 180) 

Name: HE6 Receptor-like 2 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384028 

CAGCTAGTAGTGAATAATACTTGAGTTTTTGAGTTTGTTGGGCTGGGTTGAAAAAAAAAAATATGCACTTTAGCCTCTATTAAGCATTTTCCTATGGTTTTCACTCTCGTTC 
GGGAACTGTGTGTTTTGAAAATGTAACAATTTGCGCTTTCGTCCTCGTACATTGTGTGTATTTACAGAAAGTCTAGCCCATTTTTAGATTTTGTATGGGAATGAGTGGAAAT 
TACAAGTGTATAATATATAATACAAATACACTTTAGTTGGTAAGATTAGCAAGACGTAAGACGTACGATAGTTAATGAGGGGGGAATTGCTAGTTTTATGTTAATGCCTATA 
AGCTCAAATTTACAACTGATTATTCGCTGATCGAAAATGATCTCAAGATCTCTATCATTTGTTTAGCTTTAGTGTGGATAGTGTTGAAAGGTTGACTTGAAAGCAAATGGTT 
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agtaactaactgtcgaggaacttagaggtraacgraaacgcgtgttttaggaatgggtttgtttttcttagtgtatgtggccttgtttttrcagttatfltcgttflcctrcac 
aggtgggctctgttttaccgatctaaacgtttgaacaactttgatctatcagtgatcctaatcaatcgaaatcaatcgaactcaagactgacagaaatcaacgtaaactaaa 
gtaaaatcgtctaaaaatttatcaaaactttaTaaaacttcttggctgctttcaaggttcctcactttacgtttcattgaaaaataaagtaatatttgttattgtttgtgtt 
tggtgagtgtttgttgcatctccatttgcatcaatcattttcctgctggttttttttttctttctttgcgaaattaaagtaattcttttattgtttgtgttgttgcgctata 
taataactatttctcacatattatctacacataccaacgtgtatcgtaccgtttaatacctatatacaattctcaaatatccgccatgaaccagcaaggactccctacttcg 
aatgcgagaccacggagtgggagaccatcgagtgagagagcattgagtgggacacgccggagatttccgcataggtatccgtgggtccttcggcactgcacaccgaatcctc 
gggcgtcctggaaggttccgcctcggtgatctcgatgtcctgatgatgcttgttgctgcggtgtttggcacttagagcctgcgaatttgagatattttagtgggtcacatgg 
acaactggtggatatgtgttttctcgttgtagtttcgtgtgcaatctgttttcgataagacttatgtggtgctatctatgggacggacacatgcacacacacacacacacac 
gcacaagctggtggtttcgggttcattggggaactttgggtctaagtttatagacaaacgtgcgagaaattcttttgatttagtcgttttgttttggattttagtttacatt 
cacaacacagatattttagcaggttaggtttagaggtttatgctaaaagcaaccgaaaattcttaccttatccttgctgccctttttctacattgaacattggtggttattg 
ttttgtttggtttggttgtggtgggtttcgaatatattgtacatagtcatgacaagaacaatgatatttatagatgtattaatataagtggcaaacagtactgcgagttcac 
attggaggatcaagagaagtacagattcatagacggttagagagacgggggagagagaaagatctagggagcgtggggatgggcagcaaaaggtttggtttcggattgaaga 
ataacagcaatatcgatctgaggcctacagcccggttccccattctggccacgtcaatgtcatgtacagtgagcactatggctggtggtggactacgctgccgttgtggtga 
ccccgactcacctgcagggagaagcgacggtgacccgccttgcgtccgccacgtcgcttactgatgtgcggattgtcctgccgcagagtcttgttcttcattatctccaatt 
gggtatacagtctggaatcgaaaatgtatagaaaatagagataatagaacatatagatatgcttcgtaccttttgagttcggcgcgtatgtcctcgggactcatttcggata 
tggtcagctcaccaatgtccggatcgcccagacagagcccggcatatcctccgcccagccgctggccggcgtcgtgtgacgtacccttgaaagcatccacaggtaaacgtat 
tgatagatcatacgctagcgaacgaacctgcaagctatgaagatgcaacaaacaccgcaaaccacactcaggaagtcaaacggaaaatggagaacggaacggaaacggaaat 
ggaaacggaactcaaaggtaagagagagcaaaatcgcatggtatagatctgtgctgaagagttatgtatagttatatgtatatagtatgtgtatgggtggtctgtgcttatt 
ggtcgtgattgtatctttgggggtgctggaaatgaacgaaaagaaacaaatcaaattaaaaaagtaactcaaaaggtaaacaagatggacgatagctataagaagactactt 
agacagagaacttcagtttaagattattcagttgaggcacagaaagacagagaaagttacagtgaggcagagtcttggatacactagtttggaatttttttttctggtatgc 
atattggaaattttgaaaggaaaccgatttttgtcatttgtttgttagtagttagatgttgtgttttcttgttaaattcatgcaaaaatcaacactaaacatgcaaattaaa 
tggcgtgcttattgagttcaagttaaagtgcaatacattcaggcataaataataagagctaaaaacagtcgaacattctctgtttaaacgagacacacttgcagcaacttca 
tagcttgactaaagtttcaattcaatgttagcttattactaaaattatgttaattttctcggtgaaaaccagaacaagaactcaaaatataaccataaacaaatgaaatatt 
caaattgtattttaattagcaaaagcaaccaagcttttgagtggatgtccacattttaatacttcttgtgttttggctagttatttaacaaaaatcgtttcctcaaccaatg 
tcaataccaagtacgcatttcgatgactataatgacgaccacgcccacgacagacgaaaggcacaaaatgaacgggtgtgaatactctaagcatatgaattatattcgtttg 
gtgcccgactaggacaaaacacttcctaattacatcatatatctcaatgtgtagaacacaaaagcgtactcaaaccaaaaaaaaaaaaaaaaaaaccaaagcaaagtaagta 
aatcatatcgtttcgaactcaaatcggttacctgcttgtgctggtaccacaactttggcacaaatatcagacccaaggcgaaactattcgtcagctgggagcggatgaacag 
ggccagcaagatggcgctgggactcatttccggcaggtagacaaagcgcagaaagtagaagctggacgagaccaggaactccagcgtcagtgcggtgaccagaaattgtcgt 
tccttcaatgggacaaaaggaaattaccgattaattcatttgacttgacttaaattccccttctacttagttaatcatctaagccaatctcttgcgcttaattgataaagaa 
agatctgcaatcgatttgactgtacatcttacccggaactgggtgttggcattccgactggcgatggacaggtgcagtccaaagcacaggatgagcatctcgctggtctgcg 
tgaccagttcccacttgagcggatggcaggtgttcgtgtccgcctccctgaggctctccagctgcgcactttcgagaagatccaacgacgaggccgtaaaggcggccatgta 
gcagatgacggcaaagaccatggtgcccagatacttgagcaggtccacgtcgcgcagcacccagcgatgcgctttccgggtgcgaaagtccaccaggtggcggtacaacttc 
agtatgatggcgccgtagcaggtgatgaagcccagctcccgaagccagggctccagaaggcagcgctcggtggatgcgggaaagaaatggacggcaacctaaagatgaatat 
gtaaatacacacagaagttgaaggtgaccatcgccaggagcactcacagatgcataaagtaaaacaatgcccagcagtatcgtctccagcacagtccacataccagacgcaa 
tggcctgaaaatgatgaaaggattactcaaagtttcccaactttaggatatgtaaactaaataccttgcactttcgctgccggaagacaatcaccccgaggacaatgcagca 
caggatgcaggcgcccagtacgatggccaccaggaggcgcagacaggcgtccacattgagcacctcctcctcctgaaaggtcagacagacgccgttgctatcgcagttagtg 
catccgccgggacacggaatgcacgagtagttgtcgtagccctcggacagttccactcgatcgccacggaatccctggagcgtggaattgggcaggtagtaggattcccggc 
acagacaggtgtacacatcccgggtggcggcgggtttgttctcggtgagcaggcagaaggtcgtattccgatcgcaactacagagatatggtattatatatattatccagaa 
ctaatgtcacgaattatcgtataaagtttggccacaactcacccatgacggcgaccgaaaacttcctccaggccatcgttgcacacatcctcgtcggcggcaatgaaggcag 
ccgccacaactctgaaagcatttagcataaattacaaagttgtgccaaccagctgcacatcgatgcgactcacttgatcctgtgctcgctgaaggatatgcgaaatggccag 
agccagcgacgcgtcaacaggttgcagtcgcgaaatggagcgccccactcctgggtgccgtccgtgctgttccaggccgcactgtagtcgggtaaaataatacatttatggg 
caactttccaagattatacggcttatggtctattcaaattaagcgatgcacattatttcggcaagctacggtgcatattgaatttaattgtatttatcttttagagtaatta 
gtatttgcatttcaaagccataagataattgattttaatttgttgctaattaatgtcgcacacttttttgggacacaaccagtcttcgaaacttaccccaactcccgccaat 
aggaacccaaatatgccttattgtgcacctgatccggtggtatctgcaataagttaaatcgattagcccatggccccattatcgctgattagcgagtgcataccgtaacgtt 
atccctgaatttaacgaccaccgtgtgaatgctgggcgcggcaatggcgaatgccagcagattgtcgtcctcgttgagggcgtagctggcgattccattggcccggcgactc 
acctccgcacgtgtcggctcatcgatggcaatcggtgtgttggcatttgggaatttgcccgataaaaattcaataaactggcgtgcggcatcctcggcgatgtcggtgtagt 
cgtaaaactgttgaattcataaaaccagatcacaattaattgcgttgtgtcataaataatcatttcaatgtatatgaaatgggcatataaatatcatttaaatcgaattcga 
tttaaatacacaaagaaaaagttcaataaaattaacacagcataaatttacgctttgctaaactcaatttatttataccacactatataggaaacggaaaacgtgtgctttc 
taattgaagatattttcggaaattgcgaaaaaattacgtatatgccacgtatattggtcgccacgtgatttaagtgaccatgcggcacggccacttcccccgacttgggacc 
attaattattcacctggcgcacaggacacgtttcacagcaacaccttgcggcaagagtgcgaaaagttttgcttcgtttcgactgtgaaataataggatttaataccctcgc 
cgtgctgtgggactgcgggaagggtttttaatccaagttcgttcagctcctaaccgaccaaatgtttatacataatttacacgttacaagaaactctcgagatctacgcacc 
aaagtgtaggaaatggtagggccactttgatgacatcattatggctttgggaaacaaaagtaaccacagaagggagcagggagggggttgctggtagcggggaaaaaggaca 
cgttgtttgggcgtgagtcatgtagttgggctggcggcaccttttggcttttaaaacaaatatttcccattagctgggctttcggtgatgagcgaccacattgcgtatacgc 
caccgaagccaccaaataaattgccaacaagtaacatcgatatacaaacgtttgctggatgctcctctaccatgtgtgtgtgagtgtgtttcttcattttttttcctgccaa 
acgtgagccaaaatttacacatcgaggcaaaagcaatcagtataaggcagctggagaaatggcacgagacctgctcccagataaagtcattatgtagtgctcctagcactcg 
cgcgcagccataatcataaacatttggtatactcttgcatgctagagaaatattgtagttgggtaagtgttttcagatattaaactaagtaaggaaatatttctatttttac 
accattattcattgttttttcttcaggagttatctgaactttaagtaatcccaccacagttcttggaagcagtagaatgtcttcattaccgccaaatccttttgatatcctt 
tttcccaaatgtccgcaattacacacataccctcgcactagcgcccctcgttttttcagggtataaatgtatggggaaaacacttgtaaatagcggcagcagccac 

(SEQ ID NO: 181) 

Exon: 5938.. 5903 

Exon: 5607.. 5368 

Exon: 5307.. 5249 

Exon: 5009.. 4890 

Exon: 4827.. 4747 

Exon: 4669.. 4321 

Exon: 4260. .4192 

Exon: 4130.. 3729 

Exon: 3586.. 3392 

Exon: 2213. .2086 

Exon: 2027.. 1916 

Exon: 1543. .1523 

Exon: 1198.. 1001 

Start ATG: 5938 (Reverse strand: CAT) 
Transcript No. : CT35779 
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ATGGTCCCAAGTCGGGGGAAGTGGCCGTGCCGCATGTTTTACGACTACACCGACATCGCCGftGGATGCCGCACGCCAGTTTATTGAATTTTTATCGGGCAAATTCCCAAATG 
CCAACACACCGATTGCCATCGATGAGCCGACACGTGCGGAGGTGAGTCGCCGGGCCAATGGAATCGCCAGCTACGCCCTCAACGAGGACGACAATCTGCTGGCATTCGCCAT 
TGCCGCGCCCAGCATTCACACGGTGGTCGTTAAATTCAGGGATAACGTTACGATACCACCGGATCAGGTGCACAATAAGGCATATTTGGGTTCCTATTGGCGGGAGTTGGGT 
GCGGCCTGGAACAGCACGGACGGCACCCAGGAGTGGGGCGCTCCATTTCGCGACTGCAACCTGTTGACGCGTCGCTGGCTCTGGCCATTTCGCATATCCTTCAGCGAGCACA 
GGATCAAAGTTGTGGCGGCTGCCTTCATTGCCGCCGACGAGGATGTGTGCAACGATGGCCTGGAGGAAGTTTTCGGTCGCCGTCATGGTTGCGATCGGAATACGACCTTCTG 
CCTGCTCACCGAGAACAAACCCGCCGCCACCCGGGATGTGTACACCTGTCTGTGCCGGGAATCCTACTACCTGCCCAATTCCACGCTCCAGGGATTCCGTGGCGATCGAGTG 
GAACTGTCCGAGGGCTACGACAACTACTCGTGCATTCCGTGTCCCGGCGGATGCACTAACTGCGATAGCAACGGCGTCTGTCTGACCTTTCAGGAGGAGGAGGTGCTCAATG 
TGGACGCCTGTCTGCGCCTCCTGGTGGCCATCGTACTGGGCGCCTGCATCCTGTGCTGCATTGTCCTCGGGGTGATTGTCTTCCGGCAGCGAAAGTGCAAGGCCATTGCGTC 
TGGTATGTGGACTGTGCTGGAGACGATACTGCTGGGCATTGTTTTACTTTATGCATCTGTTGCCGTCCATTTCTTTCCCGCATCCACCGAGCGCTGCCTTCTGGAGCCCTGG 
CTTCGGGAGCTGGGCTTCATCACCTGCTACGGCGCCATCATACTGAAGTTGTACCGCCACCTGGTGGACTTTCGCACCCGGAAAGCGCATCGCTGGGTGCTGCGCGACGTGG 
ACCTGCTCAAGTATCTGGGCACCATGGTCTTTGCCGTCATCTGCTACATGGCCGCCTTTACGGCCTCGTCGTTGGATCTTCTCGAAAGTGCGCAGCTGGAGAGCCTCAGGGA 
GGCGGACACGAACACCTGCCATCCGCTCAAGTGGGAACTGGTCACGCAGACCAGCGAGATGCTCATCCTGTGCTTTGGACTGCACCTGTCCATCGCCAGTCGGAATGCCAAC 
ACCCAGTTCCGGGAACGACAATTTCTGGTCACCGCACTGACGCTGGAGTTCCTGGTCTCGTCCAGCTTCTACTTTCTGCGCTTTGTCTACCTGCCGGAAATGAGTCCCAGCG 
CCATCTTGCTGGCCCTGTTCATCCGCTCCCAGCTGACGAATAGTTTCGCCTTGGGTCTGATATTTGTGCCAAAGTTGTGGTACCAGCACAAGCAGGGTACGTCACACGACGC 
CGGCCAGCGGCTGGGCGGAGGATATGCCGGGCTCTGTCTGGGCGATCCGGACATTGGTGAGCTGACCATATCCGAAATGAGTCCCGAGGACATACGCGCCGAACTCAAAAGA 
CTGTATACCCAATTGGAGATAATGAAGAACAAGACTCTGCGGCAGGACAATCCGCACATCAGTAAGCGACGTGGCGGACGCAAGGCGGGTCACCGTCGCTTCTCCCTGCAGA 
AAAAGGGCAGCAAGGATAAGGCTCTAAGTGCCAAACACCGCAGCAACAAGCATCATCAGGACATCGAGATCACCGAGGCGGAACCTTCCAGGACGCCCGAGGATTCGGTGTG 
CAGTGCCGAAGGACCCACGGATACCTATGCGGAAATCTCCGGCGTGTCCCACTCAATGCTCTCTCACTCGATGGTCTCCCACTCCGTGGTCTCGCATTCGAAGTAG 
(SEQ ID NO: 182) 

Start ATG: 1 (Reverse strand: CAT) 

MVPSRGKWPCRMFYDYTDIAEDAARQFIEFLSGKFPNANTPIAIDEPTRAEVSRRANGIASYALNEDDNLLAFAIAAPSIHTVVVKFRDNVTI PPDQVHNKAYLGSYWRELG 

AAWNSTDGTQEWGAPFRDCNLLTRRWLWPFRISFSEHRIKVVAAAFIAADEDVCNDGLEEVFGRRHGCDRNTTFCLLTENKPAATRDVYTCLCRESYYLPNSTLQGFRGDRV 

ELSEGYDNYSCIPCPGGCTNCDSNGVCLTFQEEEVLNVDACLRLLVAIVLGACILCCIVLGVIVFRQRKCKAIASGMWTVLETILLGIVLLYASVAVHFFPASTERCLLEPW 

LRELGFITCYGAIILKLYRHLVDFRTRKAHRWVLRDVDLLKYLGTMVFAVICYMAAFTASSLDLLESAQLESLREADTNTCHPLKWELVTQTSEMLILCFGLHL3IASRNAN 

TQFRERQFLVTALTLEFLVSSSFYFLRFVYLPEMSPSAILLALFIRSQLTNSFALGLIFVPKLWYQHKQGTSHDAGQRLGGGYAGLCLGDPDIGELTISEMSPEDIRAELKR 

LYTQLEIMKNKTLRQDNPHISKRRGGRKAGHRRFSLQKKGSKDKALSAKHRSNKHHQDIEITEAEPSRTPEDSVCSAEGPTDTYAEISGVSHSMLSHSMVSHSWSHSK* 

(SEQ ID NO: 183) 

Name: Metabotropic Glutamate-li Ice RECEPTOR 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384249 

CTCGTGACCTTTGAACCCGACGCTGGCACTTTGAAACGGAAACGGAAACACACACTCACACACACGTAATTTGGTTAGTCTGATATCGATGCACTATATGTTGGCTAATTTA 

GACGGACTTTGGTTTTGACCAATCGCCAGGGTCCTACATATATGGTTTTGACTATATATTCTCGCTATGTTTAGTTGATTATACGATTTAGGGGTTTACTTTAGGTATTTTG 

AGCATTTTCAATTCATGCGATTGTCGTTGAAATAACAATTGGGTTTTAGCTTTGCTTTAGCTGCCGTGACTTTTGTTTGTATCTCTATAAATTTATCAATATTCTCCGATTC 

TAATTGCGTTGGGGGTGTGGTTGTTTATTCGCAGGACTTCTCACGTAGTTGTCTATAATTTGCGGTCTCTTAATTGGCACAGTTTTAGTTTTAGCTTTAGCTCGTCGATTTT 

GCGAATCCCTATGTACACGCATAGCTCAGTGAGTTGCAGTGATTGCCATTTTGAATAATTTATTTGTGGTTATTTCTTTTCGTGAGCAAAATGCTTTACGGCTTACAGATTT 

TCGATTGGCGTACGTACATACATATTACATATTCATATACATTTATCGAAATGAGTTTACAACCTGTGCTTAATTTAGTCGTACAAAAATACTCAGTATTTACAACAAGATA 

TAAACGACATAAAATCATAAATAAATAAGTTATCCTATACAAAGTCCCGATCTGTGCTCCAAGGAACCCGGCAAACTCGAATTGAGATTAAATGTTTTAAAAACGAAAACAA 

AAATGAAAAAATGTGAAGAAAATGAAATGGCACGCTCCACAGATTGAATAATGTCAACATTACGAGTAAATCAGCCGACGCATTTGCCGGCATATTAATTTTACAAATTAAT 

GGAACACTTTAATTGCATTAAATTGATTCCACCCATCCGGCTGGCATCGATTGATTCGATTCGCATCCTTACACCAGTCGCACCATTTGCACCATTTGCACCATTTACACAT 

GACTCGAGGCCGTCGAGGCGGCGGCATTGTTGTTCTTGTTCCATTTGTTGATGTACTCCATGGGTATTTCAATCTGGATCTCCTGCACATCGTCGCCCGCCAAGCAATCGAC 

ATTGGATTTGCCCTTGCCCGCATCCTTGCCCGTATCCTTTGCATCCTTCGACTCCCTGCCATCCTTGGCGGCAGCTGCCAATTGTCCGCCCAGGCGGGATCCGCCCGCCGCC 

AGCCGCTGCTGCTGCTGGGCGATGGGGGTCAGCAGCGCCTCATCCCTGTATTATAAGTTGTCCACCATCTGAGACTGGCGGCGATGCGAGGGCGTTGTCGAGTTAAGGTAGC 

TCTCCGAGTAGACGCTCTTCGCCCGCTGGTCGTAGCGCAGGCTGGAGCGCCGGATGTCGATGTCCAGAGGCAGGTGGCGATCCTTCCACCAATTGCAGAACAGGCAGAGCAG 

CGTGTTGCGGAACGCCTCCCGGAAGTCGCGGTTGAAGTACGCGTAGATCAGCGGGTTTAGCGTTGAGTTGAAGTACCCGATCCAGAAGAGGATTGAGACGACTATGTCCGGC 

ACTTGGCACTCCTCGCAGGTCATGGAGAGTGTGTACCTAAAATTAAAACAATATAATTATTAGCTTACACTCATTATCTTTAACTTAATTTAAGTCAATTTTAAATTAGTTG 

CACAGGTAATGCAATTTTATGTACTTTTTTTTTCCCGGCATTGCATAATATCTATTTTTTAATAATACAAAAGCAAATATAAAAGATCAAATGAATATTTTTGTTTCATAAA 

AGTTAAACCTCATAAACGATCTATATTTGAATACTTTGATCACCATTTGAAATCTTTCTGCATCACTTTTTGTCTGTGTAGCACATAAAGATGAGATATATATACCCATAAA 

TTCCAGATGCATGAGTGGGCCAGGGCTTTGCATAAATGAGATTATACTCTGGGTGGTGTGAAAAACAACAACCGCCGAAACAATGGCGCACGGGCCGTATGAGTAATATTTA 

ATTATGCATGCTTGACATTAGATAAATTATGCCTGGAATCCCATGTCAGCGGGATTGGCCTCCAACTCTCAACCAAACGTTCAACCTTCGACGGCTGGTTGGTACTCACCAC 

AGGAAGAAAGGCAGCCAGCAGAGGATGAAGGTGCCCATGATGATGCCCAGCGTGCGTGCGGCCTTGTGCTCCCGCTTCATTTTGATTAAGTGCTTGTCCTTAGTGGGGGTGT 

GCTCCTGCTCGACCTCGTGCAGCGTCAATGTTTTCGACGACCCGGATCCGCTCAGCGCCTCGCCTGCCGTAGGAGATAAGATGGAGATTTGATAAAAGTCCAGGCTGGAGAT 

GGGCCACTGAACTCAATGCACTCGCACTGCATCACTGTCCAGGATGCTTGTAGAACAAACTAGTGATCTTTCACGCTCACTACCTGTTTGCTATTCAGAAAGTAATAAGACT 

TTCTGTCCTGGTAAATTTATATTTTTGAAGTATTATTTCCACTTAAAACCCATTATATATATTCAAACAAACATTTCTTTACAGCCAAACTAGGTAAGCCTTGTTGTTTACA 

ACCCAACTTTGACTCCAAACTCACCTGATGGCTGCATGGATGGTCGGTGCATCAGCATGGCATTGCCGTGCCGCATCATCAGCTGCTTCTCCTGCCGATTGGCTTCCCGGAA 

GATGGCCAGGTAGGTGAATATCATAATGGTGCAGGGTATCCAGAACGATATGGAGCTCGAGATGACGGCGTAGTACTTGTTCACCACGAACGAGCACTGCGTCGGATTCTGG 

ATGACGAACTGCTGGTGCTGCGGCGTGGTGTACCAGCCGATGAAGATGGGCAAGAAGGAGAGCAGTGCCGGCGATATCCATGTGTTTAGCAGCATAATGCCGACCACGCGTT 

TCGTCATGCTAATCGGATACTTGAGGGGCTTAACAATAGCATAGTATCTGTGAAGTGGGCAAAACGCCAGGTTATTGGGCCCTCATATAGCTATAGTTCTGGCTGTGGCATG 

CAACTTACCTATCCACAGATATGCAGCATAAATGCAAAATACTCGCTGTTGAGAAGTAGACATCGAGGCTGTTCCACAAATCGCACAGGAAGGGGCTGAAGTTCCACCTGAA 

AGGGTCCAATGAATGGAGGCATTAGTATCGCGATGCTTACCCAATAACGAGGCAATAAACTTTGGCCTTTACTTTCTAACGTCATTGTATCTGATTATCGAGTCCATGAATT 

TGCAGTATATATTTTTGTTGACTCATTTCTGGTTTTTTGCCTCACCTAAAATGCCACGCGAAATACTTTTATGAAGTCATCGCTTTAAAACTGATTTCCTGGTTTTTAGCTG 

TGCAAAAAATTTAGTTTTTCACGATTAATTCCCTTTGTGACAACACAGTTATTTACGTACTCTGATTAAAATCATTTGATTGTGGGTAAAATTGCGCTAGTTCTAATATTTA 

CTACCATTTAAAATACAGATATATAGGCCATAAAAACTCACTAGATATGCGCAATAAGGTTGCTCTAAATTTAAACGGCATTTTACTGCCAATTCCTGAATGTAGGTCTATA 

AAAAACTCAAACTGACTAGACACACTCTCTTTTTCGCCTATTTTCGTTATCAGGCCGCCTGATTTATGTGTTTATCGATTTCATCTTAGTTTCGGTTGCCAGGTGACATTGC 

CCCTGAACTGCCATTCTGGAGGGCCAAATTAAAAGCTTAGATGCACTGTGAGACCAGAAAATAGGCCATAAGGCCCGAGGTGAAAAGTGGCAAGGAAAGCGGAGGCGACTCT 

GTTCGCTGCATGAAAAACTCATTAAAACTTGCGTCGTCGCTGTTCGCCTGTCGCTTCTGCCGTCGCCCTTGGTTGCCACTCCGCATTTTGGCCATTTCGGTGAAGGCGGATA 

CCCGGCCTTTTTGGGGCCAGGAGCCAGTGGAATTTATTATCGGGCGCCCCGTTTGGCATCAGTTTCAAAAACTTTTCGACTATCCAACGAGGCTGTTTTTGCTCGCTGGTTT 

TTGGTGCTTTACACTACATTGCCATTTGGGGCGGGCGCGCCATCGGTTTTGACACTTAAGCCATGAAATGGCAGTTCCTCGTGCTGCAATTATGCAACTGGCAACTGTGAAG 

GTAAAATGTCTAATGAATGCCAAGTGGCGGCGAGTTGGCTGTTGCCAGAACCAAACCGATGCCAATTTATATGCAACTCGCGCTTGGCCAAAGTTTTTC 

(SEQ ID NO: 184) 

Exon: 3243.. 3209 
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Exon: 3131.. 3033 
Exon: 2959.. 2601 
Exon: 2303.. 2126 
Exon: 1604.. 1298 
■Exon: 1246.. 1001 

Start ATG: 3243 (Reverse strand: CRT) 
Transcript No. : CT21650 

ATGGACTCGATAATCAGATACAATGACGTTAGAAAGTGGAACTTCAGCCCCTTCCTGTGCGflTTTGTGGAACAGCCTCGATGTCTACTTCTCAACflGCGAGTATTTTGCATT 

TATGCTGCATATCTGTGGATAGATACTATGCTATTGTTAAGCCCCTCAAGTATCCGATTAGCATGACGAAACGCGTGGTCGGCATTATGCTGCTAAACACATGGATATCGCC 

GGCACTGCTCTCCTTCTTGCCCATCTTCATCGGCTGGTACACCACGCCGCAGCACCAGCAGTTCGTCATCCAGAATCCGACGCAGTGCTCGTTCGTGGTGAACAAGTACTAC 

GCCGTCATCTCGAGCTCCATATCGTTCTGGATACCCTGCACCATTATGATATTCACCTACCTGGCCATCTTCCGGGAAGCCAATCGGCAGGAGAAGCAGCTGATGATGCGGC 

ACGGCAATGCCATGCTGATGCACCGACCATCCATGCAGCCATCAGGCGAGGCGCTGAGCGGATCCGGGTCGTCGAAAACATTGACGCTGCACGAGGTCGAGCAGGAGCACAC 

CCCCACTAAGGACAAGCACTTAATCAAAATGAAGCGGGAGCACAAGGCCGCACGCACGCTGGGCATCATCATGGGCACCTTCATCCTCTGCTGGCTGCCTTTCTTCCTGTGG 

TACACACTCTCCATGACCTGCGAGGAGTGCCAAGTGCCGGACATAGTCGTCTCAATCCTCTTCTGGATCGGGTACTTCAACTCAACGCTAAACCCGCTGATCTACGCGTACT 

TCAACCGCGACTTCCGGGAGGCGTTCCGCAACACGCTGCTCTGCCTGTTCTGCAATTGGTGGAAGGATCGCCACCTGCCTCTGGACATCGACATCCGGCGCTCCAGCCTGCG 

CTACGACCAGCGGGCGAAGAGCGTCTACTCGGAGAGCTACCTTAACTCGACAACGCCCTCGCATCGCCGCCAGTCTCAGATGCAGCAGCAGCGGCTGGCGGCGGGCGGATCC 

CGCCTGGGCGGACAATTGGCAGCTGCCGCCAAGGATGGCAGGGAGTCGAAGGATGCAAAGGATACGGGCAAGGATGCGGGCAAGGGCAAATCCAATGTCGATTGCTTGGCGG 

GCGACGATGTGCAGGAGATCCAGATTGAAATACCCATGGAGTACATCAACAAATGGAACAAGAACAACAATGCCGCCGCCTCGACGGCCTCGAGTCATGTGTAA 

(SEQ ID NO: 185) 

Start ATG: 1 (Reverse strand: CAT) 

MDSIIRYNDVRKWNFSPFLCDLWNSLDVYFSTASILHLCCISVDRYYAIVKPLKYPISMTKRVVGIMLLNTWISPALLSFLPIFIGWYTTPQHQQFVIQNPTQCSFVVNKYY 
AVISSSISFWIPCTIMIFTYLAI FREANRQEKQLMMRHGNAMLMHRPSMQPSGEALSGSGSSKTLTLHEVEQEHTPTKDKHLIKMKREHKAARTLGIIMGTFILCWLPFFLW 
YTLSMTCEECQVPDIVVSILFWIGYF^STLNPLIYAYFNRDFREAFP^TLLCLFCNWWKDRHLPLDIDIRRSSLRYDQRAKSVYSESYLNSTTPSHRRQSQMQQQRLAAGGS 
RLGGQLAAAAKDGRESKDAKDTGKDAGKGKSNVDCLAGDDVQEIQIEIPMEYINKWNKNNNAAASTASSHV* 
(SEQ ID NO: 186) 

Name: Beta 2 Adrenergic Receptor-like 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384249 

GTGAATCGTTTTTGCAGCGAATGGCGAAATTTCCTACAGCTTGCCAATTGTTTTGGTTGTTCGCAGTTCATTCTTTTTACTCATTGGTAAGATTGTCTTTCTGCATGAAAAT 
TCATAACGTCAGCTACACAATTCTCTTCCATTCAATATTTTGTCCATACATAATTTCGATTGCGTAATTTGTAATTTTCAATAACTCTGTTATGTAGATTTGGTTTGCTAGT 
TTTCTAGGTTACATATGTATGATACTTAGAGCTAGAGGTAACATTATGGGATTGCACTTTTTCCATGATTTAGGCTAGTTCTATCTGTAACAACATTAAAAAGTGTGTTTAA 
TATGAAATAAAAGCAGGGAGTTCATTCCGCTCAGGATTTGCAATCCTAATATACGATGATGACACATGTGAATAATCCAATTTTGGATCGTATTAATACTCCAAAAATCGAT 
GGAAAAAGTGTTTCGCCTTCGACAAGCCGAAGTGGTGTACATAATTCTTAGATGATTCATTTGACAACATTTCACTTGAAGTTGGAACTTTTCTACGCAAAGAGTCGCTCGA 
TTTCTAATTGGCTTTCGCTCTGCCATATATGTATAGTTATTACGTATATCTTCTGTTTAGTTGGACAAACAACACTTATATATGGATAGTATAGTTCCGGGGCAGCATTGTC 
CCTGCAGCCGCGGCTTGGCGGGCTTAGAACACAACTTTAATTTAAATTCCCTATCGTTGCTTAAACGAAATTGTACACGTATACATAAACACACATCGAGTATTGCTTAGCT 
TCAAGGATTATGCTAGCTATATCCTGGCACAAATGCCACTGGTATGGCCGTTGCTCTTGTTACTTCTCTTCACTTCAGCTGGTGTTTTCCCTCCTTGGGCACATATACATGT 
GCGACGATTCACTCAAGTATTAAATATCACTTATAGGGGCATTGTTCCGCATCCACGAGGGTGGACGAAGCAGGAATGTGGCCGGGATGCAGACGCTATCCGGTCTAGACGT 
AGTAGGCGTTGTACGGATTTCGTTTCTCCAGACAGGGCAGCACACACTCCAGCGTATTGCGGAATGCCTCGCGAAAATCGCGATTAAAGTAGGCGTATATAAGCGGATTTAG 
CGTGGAGTTGAAGTAACCGATCCAGAATAACACCACCACGAGCACATCGGGACATGGGCAGGCCGGACCGCAGAGCGATGTGATGACATACCTGGAAGGAGGAGTGAGAATC 
ACATGAATATGAGTCACCAGAAAAAGAAGAATAAGAAGGCCAACAATGGATTACGGATGGCGGTACCACATCCGTACGGCCTATCTACCGTCACTCACCACAGAAAGAAGGG 
CAGCCAGCAGAGCAGAAAGACGCCCATGATGATGCCCAGGGTGCGGGCGGCCTTGTGTTCGGCCTTCCATCCTTTGGCCGGTCGCACGGAACCGCCTGTGGGGCAGATAGAG 
AGTTTAGAAGGGATATGAAAGCTGGCAATTGGCTGATTAAGTTCAATTCGAATCGATTCGTTTCGATCCGCCAGTCTGTTTTTTTTCTTTTGTTGGGTCATAGCAATCGACA 
GTTTTCTGGTTCATTTCTGTCTGTCTGCGAAAGCACTCGAAAAATGGGTTTCTTTTTAAGTAGTCTATAATATGCTTAATGTGGTATTTGTTGTCTACCTGTTGGATGTGTT 
TGTTTTATGGTTTATTGCTTCTAGGATGAAATTGTTTATACAGCTAATACTGGAAAGTATAGGTGTTTGCTTAAGCTTTAATCGGCGATACCCTTTTATGTTAACTATGTAT 
GTAAATTAATTGAACTTCAATCAATTAAAATACAGAAAATTTCAAACAAACAAATTTTAAAGAGGTAACTAAACTAATCTTAAAAATTATTAAATATTTTCTCTGTGTGTAA 
ATTGCCAAGAAATGCGGTTTATGTTCGTTTTATTGTGAGAGAAAGATTATGCCGGCATGTTGCGTATACGCCATGTGTGTGGTATCCTTGTTTAACTTTGACATTAGTAATA 
CTTTTGCTGTTATATGTTGTTGGCTATCGATGGCATTTCAAACCCCACACCGCCCCGCTCCCTGCGTCCACCTTGCAAGTGACTGTCATTAAAGTGCTGGAAAGGCCAACAT 
GTGTGCAGTATCTTGGTGCTCGGGCGTGTGGGTGTGCGGGAAAAGCAATAAAAACGAGGCAAACAATTGAAAATATTTACACAGGACACTTGTTATCACAAAGACATACAAT 
CCCCCGCATCCATCCAGTCCAGTGCGCCCGCCCCAAACATAAACATAAACATCACTGGCGGCCAGGAGGAAAAACAAATCAGACAGACAACAGTGAGCAACAGGAAGCAACG 
GACCATCGCCCTTGTTCACCCATTGCCCATCGCCCATTGCCCATGTTCCCCCATCCAGTTTGCCACCTGGATCCCTTGATGGACGCCACTCGGTGATGGCCACAGCCAGGTG 
AAATCATAATGCAAAGGCTGTGTGGAGGGAGGCGTAATAAAGCAAAGCAAACGCACGTGCAAACGGAAGCAATTATGTACAGCATGAGAGCAGCGAAAGGTGAAGGATTCGG 
ATACGGATTCGGAGGCAGATTAGGTGGAGGTGGAGACAGCATGAGGTTTCGATGGTGTAACACTCCGGATTCAGTTGCCTAAGCTGGTTAGCTTGGCATCCGGATAAGGAGG 
TGCCAAAAAACTGGCACTTCTGTGGGTATTTGCCACTAAAGCTGGGGGCTACAAAAATTTGAGGTAGTTCTTTTCGGTTTCATTTCACGGTTGATTTAATGATACTTAGGTT 
AACCACCGTTGAAATGATTGGGATTGCAGAAATTCAGGGATATCTATTCTAGTATGATTTTTGGTAAGAACGTTAACTAGTTTCAGGTGTATTCAAAAGCTTTCAATTTTAA 
ACATAATAGGTTTAAACATATACGGAACAGTAATATTGATTGTTTTTTGACTGACAGGATATTTCATATGAAATAAATAGATTTTTGATATGTTTGGTCTAATGCAGTGGCA 
TAAAATACGAAGTGAATGGGACTTGATTTGATTAAGATTTTCTGATGATTATATATGTATTGATATAGAGAAACTATAACTATTGGTGTTGTGAAATTAGTGAATTCTGAGT 
GTGAAATGTATTATAATATAATGCACAAGTTCTGGACTGTGGGAAATCGAATGATTCAGATGGATAATGGGTAGCAATGAGATAGGTGCTGAAGCTGAGACAAGCAGAAAGA 
CAGAGATAGAGAGAGAAGCGGGAGAGAAAGAGCGAAACGTAGAGACTGGGGGAGGAGAGGAGGGGAGATACGACAAGACTCACCGGTTGCACCACAGTGACTGCATCCGGTT 
TTGAAGCGGCGCTTCCGGTTCTTGCTCTGCTGCCAACATGCTGTGACCGCCGCCGTTCGCTTGTGCCAGAATCGAGTGGAACTGGACGAGGAGGCGGGCCTCTGGCCAAGAT 
CTGGAGTTTGCTCGTGAATACTACGTTTACTACTAGTGGTTACTGTGGTGTCGATTGTGCTGTCGGCGTTGGCGGTTGCGGTTGCGTTTGCGGTGGTGCCGGTGGTGCTGCT 
GATGTCGGCGCCGCAGAGCGGACACTTGGGCAGCGGACTGGGCACCGCGGCGAAGGTGATGCCGCGGGGCCGCTGCTCAATGATCTCCGCCTCCTCCTCATCCACAAAGATG 
CAGATGCCGCCGCGTCGGATCAGCGAGGATTTATTGGAAAATTCGTGCTCTGCTTGCAGTTGATTGTTGCATTTTCGTACGATTTCGATTGTCAATTTGGCCATGGCATTTT 
TGGGGTTTTATTGGTTGGATTTGGTTTTGGAGTAGAAGTAGTTTGTTTGTTTGGTTGGACATAATTTGGTGATTTCGATTTACATATATACACATACAACACACAACGAAAT 
GGCGTGGAAGAGAGATAGATATAAAGAGATAGAGACAAGTAGTAAACAATGTACACAGGGTATTTCACAGGATACGAAAGGACTTCAACTTCTGAGATCACATGCAGCCATG 
CACCGGCAAACGCAAAGGATCGGGATCGAAAAGGGAATGGATTCAGATTCATTATCGTTGGCTCCCTGTCTCTGGAAGTAGGTCGTCCCTTTCTGTCTTCGTTTGCAGGTGC 
AACTAACTAAATGCTATACATTATATTATATATATATAGTAAGCTAGTTAACTAAAGTTTCATTCGAAGCATTTGAGTATTACTTTTCCAATGAGGGTTTATAGGTTCTATA 
CATTTTAAAACACTGTATGATTTTTCAGAAATTCGCACCGCTTTTGAAAGGAACTTAAATACACAATTCACAGTTTTCTTCAACGTAACAACTATATTTAAATATGGTAAAA 
GTTCGTATCTTTATTTTAAGATCAAAATTCTAGTAAGACCAGATTCCTTATGTTAAACCCGAATTCAGGCCAATTACTCCTCAAAAAAACAAGGACAAAAGAATGTGCGAAA 
AAGTTAAGTCAAACATTGAAACACTTGAACTTTTCGTGGAAAACTGACAAAAACAGAACGAATTTCTCTGGGGCAGA'f ATAAATTGTCTTAAAAGAGCTACTTAAATCAAGC 
ACGAGCTATACAGTCTACACAATGACCAAATGGATGACTACTCACTTGCGGAACTGGTGGAAACGCGGCCCATCGGGATGTAGCTGCGACTGTCGCTGTTCTCCTGCGAGAA 
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GTTGATGACATTCACGTCGTAGGTCACCACGGACGCCTGTCTCTTCCGCACAGTTTGGATGCCATCGTAGCCTGCCAGGATGCTGCAGCGGCCTTGGCGGCCACCCACTCCT 
CCAACGCCACCAGCCCCTCCATTGCCGGAAACGCTCACGTCCGCATCGCCGGCGCTATCTTTGCGCTTGAACAGTGGCGACGTGACATCCCGGTGCTCATCGCCGCATCCCT 
CAACCGGCGACAGCGTGGAATCGAGTATGATCTCCGGCTTGGGCGATATGTTGGACTGTGAGGCCATTTGGGGATTTGGCTGCGGCACTGGAACGGGCATAGGCAGGCTGGT 
CACCACATCCAAGGACACAGTGGATGTGTCACTGGGCGTATAACCAGGTGCGGCGGCATGCGGCGGCGGCGATAGGAAGTACACATTGATGGACGCCGACTGCGCGTTGGAA 
GAGGCGGAAGAGTCCATGGCACCTCCGTTCGGCACGGAAATGGCCGGCGGAGCCGGTGTGTTCGGTGGGGACATCAGATAGTTGGCGCACACCTGGTCACTGAGCGTGCTCA 
GGCGCTGTGGAGCACCAGCCGGTCGGCCCAGCGGAAAGCTCAGCGAGCGAAAGATCTCGGCCAGTTCCGAGCTGCTGGCAATGAGCAGGCCACTCGGTTCTGGCAATTCTGG 
CGGCGATAACGATGGTGGCGGCGTGGCCGGCTGCTGGGGCGACTCCCCAAAGCAATCGGCCAGCTGCGCCAACTCGTCGTCGCTCCGAATCCTCTCGGGCGGTGTTTTCAGC 
TCACTGAAGTCAATCAGCTGTACCCCGTCGCTGAACTGCTGGATCACACTGGCTCCACTCTGCTCGCTGAGCACATTGATGTCCAGCAGGATGTCCGGCTTGTTGGGCGTGG 
AGTCCGTATCGCTGTGCTGCCGCTGGCGTCTCTCAAGTTTGCGTTCCTTGCCGCCACCGCATCCGTTGTAGCTGTGCGACAATTTCCGGTTCCGGGAGCGCCGGAAGAAGCT 
GGATCCCTCGGAGTTGGCCTGATGTCGCTTCTTTTCCAGCGTAAAGCACTCGGTGCTCGAGCGTTTCGTCTTCGACAGCAGGGACAGAATGTTGAAGCCCTTCTCCTTCCCG 
GGCACTGGCTGCTCCGGCAACGCCCGCGATTCACTAATGGTGCGCAGGAACTCCGAATCGCTGGAGGCGAATGCCTTACGCCCACAGGCCGCTGCGCCCGCCTTCAGCCTGA 
TGAAGTCCGTTTCACTGAGGGCAGCGTACTTTCCTCCGCCTGGGCCACTGATAATGGTGCCGCCCGTCTGGCGCTTAAAGTACAGATAGTTGTCCTCTATCAGGCGCTTCAG 
CTCCTTGTTGTCCTCGGCCAGCGGCAGGTGGGTGTCGTACAGGTCGTTGCCCTCGCTCTCGTCGCTGTTGAGACCACCAAAGACCCGGTACGCAGTGGACTCCGGTTCCGGT 
TCGCTGTTCCCACTGCCGCTGGTCAGGCTGAGCTGCTGTTGCGGGCCATTGGGATGAGAATGCTCCTTATCGAGCTCCTTCGAAGGGATCATCTTGCCCTCCGTCCATATCT 
GCGGATTGAACACTGGCGCCGGTGGCTGCAACTGATTGTGGGAGGGCTGCTGTTGCTGCAGTGCCATCATCGAGGGAGCACTGTCCGTGTGGGTGACCTTCTCGTACCGCTC 
CTGCTCCAAGTCCCTCAGATCGATGCTGCTGCGGCTGAGGCGGCGTGGCGGCGGCGATGGCACTCTCAGTTCATCGCACTCGTCCCTATCGTCGTCCTCGTCGGTGGCCGGC 
TGCAGCATGTCCTGTGGGCAAAGGACAATCACTCAAAGGGAAGTCATGGTGGCGCATCAATTACGCACAGCCACAAGGGAAAAGGAAAAACTCTCAAGCGGAGTGCTTCCCT 
CTAGCACCTAATCTAATCGCTCCACATCCGTTCATTGAGGATGGGCAAAAAGCTTTAGAGCAGGATGTCAATGTTGCCCGTTTGTATACATGAACTGCATTGCTTTAAAGTC 
TTTGTATATTTATTTAGGATACTATCTGGATGTATCAGCAACTTTACCAAATTTTACTCAATAAAAAATAAATAATGTTTTATTCAACTACTAAGAGATCTTCAGAATGGCA 
AGCAAATGCGAGTACTAAAGCATTTATCGGTTTAATAAAAATAAAACATTTTTAAGTATCTCTAGGTATTTTAGGCATCTTTGGATTAATGTGTATTTTTTCATTTTGAACA 
TCTTTGAAATGTTTATTGTTTATTGAATGATTTATTGAATTTATATCAATAAAATGCAGCAAAATACTCTTTTGGCTTTCAATAAATGACATCTGTATCTTTTCGGTTCAGT 
AACATTTAGTAACCTTCCAAACTCACCTCCAAGTTACTAAGCGCACTGTGCGTCTCCTCCCGGGCGGATGTGGCATTCAGATCACAGTCACTGGGATGCAGATAGCTCAGGC 
TGGTGGGCTGCTGGGTGTGGCCCATGTGGACGCTGTTAAGGAGGATGTTGGAGCTGGTGCGGCTGAGGGCCTTGCGTTGACGAATCGCCTCCCTGGAAAACCACAGAGCACA 
GATCAGCAATTAAGCAAACTCATCCACTACGACCAGCGACCATCGACCATCGACCATCCGGGTAAGAACTCACTTAAAGATGCGCCAGTACATCACCAGCATCACAATGCCG 
GGTATCCAGAAGCTCACCGAACTGGAGATGAGGGCGTAGGCCTTGTTGACCACAAACGAGCACTGGTCCGGATGCAGGGAAATCTCCCGCAGGTGCTCCTCCGTCGTGTACC 
AGCCCAGAAAGATGGGCGTGAAGGATATGAGGGCGGGCAGGATCCACACATTGGCGAGCATGAAGCAGACCGTTTTGTGTGTCATATTCAATGGATACTCCAGTGGACGCAC 
AATGGCGTAGTATCTGTTAAAAAGAAGAGGGGGTCATCTTTATTAATGGAAATGAATTGCGGACAGGTCCTCCGCACAAAGGTGCCAGGTGGCTTTCAGCCATTTCCTCCTT 
CTCCTCCTCCTCCTTTTGGCTTACTAATTGATTATCGCTCTTTGGTTGGCTGAATAATTGTGTTTCGGTACTCAGGTTGCCATGCACAGAAATCGATTTATTAACCTTTTAA 
AGATCATGTTGCATTTAAATGAAGGATAAATGCTCTTAACCGAATTCATTATCGGAATCCGTGAAAATAAGGTGGTGTATTCTTTTTAATTATCAATATATTAATAAATGCA 
ATTGAAATGAAATAATTATTAATCCACGTATATTGTACCTCATTATTTTATCGAAACTTAATGCTCCTTTTAAAGAGTAATGATTATGTTTGGAACATTACAGATTTATTAG 
TTTAGCCAAATTTCCGTTAGTGGCCATCATTAATTCTCTAATTTTGATAGGTTCTGGTTGAATTTCCCTTTGTTTCTTGGTATGTTTTGTGGCCCGTCGAAAGGTGAGTGAA 
ACACAAATTAATTAAAACAGACAAAAGGCCCATGACTCCAGTGTCCATCGACCATCCCTCCATTGTTGCTGGTCAGCCCGAAAATCGCAGCTGTTGGTGCCTTCCAAGTGGG 
GAATGGCGCCAATTAACGGCTTGACAGGATGGAGAAGGTTGTGGGAGTTGGGTGGAATATGGGTGGAATATGGGCAACGCCCACGTTTCGGCCAGCTGCAAATCATTTGGGC 
CGCATTCATCATACGGGCGGGCCTAATGTTTGCTTTTGATGTGCGCGCCTTTCATAGTAATTAATTGTTTCTTCTTGGCAATTCGCCGGGCTTTGGTATTGAAAGTGATTTG 
GTTGCCAGGCATCTGGGCAAGATTAATGGTATTTGGCCAAACTAAACGGCTTTCTTCAGTACCATAGCCCAGCACAATTGACTTTAAACCGCTTAATATATAGATAGAGAGC 
GBCATAAATGGGCACTCACCTGTCCACTGATATGCAGCACAGGTGCAATATGCTGGCCGTGGAAAAGTAAACATCCAGGCTGTTGTACACGTTGCACATGAACGGTCCGAAC 
ATCCACTTTCCGCCGGACAGTTCCACGGAGGCGTTGAATGTCATCGCACAGAGGGCCACCAGCATGTCGGCCATCGCCAGCGACACAACAAAGTAATTGGTTATCACCCTGC 
GAACGAAACAAAAGCCCAGGAATTTGTGAATATGGCGTGGTTCTGGGCGGGATTTGACCAGCTTCTCCCCACATCTAACCACTTGTCACAGCGCAAAACCACGCCAAACGTA 
ATGGCATATGGCATGCATCATATGTGCACATGCCGCTGTACTAAGCGCTTTGGTTGGCTCCTTCCGTTGGCCAAGCAGTGCACCACTGCTTTCCATTTCCTGTTCCGGTGTT 
CAGGAGTTTGGAGGCAATAGGTCACCGGTATGCAGCAATTTCCTTGGCTGTCATAAGTCACTCGACTCACCAGGCAGCTGCCACCTTCAGCTGCAAGCTACTGGCTTGTGAA 
GTTATCATCTTGTCTTCGGGGCGTATGTGGTTTAAGGAGCTGTTTATGCGAAGATGAAAGCACTTTTGTGCTTCCAAGTCATTTATTACATATTGCGACATCTATTACGTTG 
GAAAGTGTAGTTTTACAGGGGCTGACAACATTGTCTAAAACATGTTATTAACGTATATTGTAATATTGTAGTAATGAAGCTCAATTAAATCAAGATTATTAATCTGTGCATT 
TAGAAGGGCTGTTAATTTCGAAAGATCCTTTTATGTCAAGCCGTTTTTTCTTTTATGCACTGCTTTATTTTGTAATATCTCAACGCAATTCTATATGGCGAACGTTCTATCG 
TAAGGTAGTAAATGATAATCTGAGCACTTTAAGCTCCTTTAGCGGTATAATAAAGACTTTAATCAACATCTACCCTTGCTGCAGAAGGCATAAACGTTGCGCAACATTAAAC 
GGATGTCGTGGGTGTACGTGTGGGGGCGTGGCCGGGGGCGCTGGCGGTGGCAGAGCGCTCAATGGTATGCTACACTTATTTGCACTGCACTCTGGGGCGTCCTGTCATCGCG 
ACGCTGACATGGCAGCGATGGGCTTGGGATCGGCTTGTGTGCCGCATTTAACTTCGTTTTGTTCTTTATTATTCCACTTGTCGCTCATTTAAATTACATTTATCGCGATGGC 
GAAAAGGATTCGCCCTGCCGCGTCACTGTTTGTTTGCCATTTCCATTGTCCTGCGACCCTTGCGGCAATGACCCTCAATTAAACTCAGTTACAGGTGGTGAAAACAAAAGTA 
ATTGGATATAGACTTACCGCAGCTTTCGATTGCGCTGCACTGAAATGATGACCAATGCATTGCCGAGGACAGCGGCCAGGATAATTGACGAGAAGATGAATCCTTTGAGCAG 
CAGCAGCGATAGATCCAGCCAGCTCTCCTCCAGCTCCGCCTGGTTATCCACCGCAATGTCACTGCCATTGGCCGTTATATTTGGGCCGACTGTCGAGAGGAGCTCAAACGTG 
GAATCCGACGACAAGTAGGATGAAAAGGGCGAGGAGTAGGAGGCATCTGACGATGTGGCTGTTAATGATAGCGAAGTCAACGAGGCATCCACAAATTGCGATGTTATAGGCA 
AACTGCTCGTCGGCGTCGTCGTAGTTTTGGCTGCTGCTGTCGTGGAAATGTGGCCTGTTAATGTTGCAGGTTGCAGGTGGCTGGCGTTGGTGGCTGATGTTGCTGCTGCTTG 
TGATGTTGCTGCTCCTGCTGCGGCTGTTATGGGTAATGTCATTGTCGTGGCCAGAAGGTCAGCCACGTTAACCCCTGACATCTCGTCGATTTCGCAGCCCAATTGAGTCGCA 
GCTCATTCAAGTTGCTCCGCCAATGCTTTTTCGCTGTACTTGTCGTTTATCTTTTTCCCGGCTGCTGTGCGTAATTTAAAACTCGAGTAATTGCAATCTGCAATGCAAACAA 
GGCTGCCGAAGTTTGCCGTCATCCTGGTTGCCAATGTTGCTCCTCTGCGATTCCAGTATTTGCTCACAGCTCCGACTGAAGGTCTCTGCTCCATTTCCCAATCCTCCATAGG 
ACGCTTTTCCCGTTTGAGTTGCTGGAAATTGGCAAAAAGTCGAGACTCGCGCTTCGATTTGGCACTCAAGCGCGCAACAGTCGGCGACACGGGAACTGCAAATGAATAAATA 
AATAATTTATGAAATAATCGGCGAGTGTGAGGCTGTGAAACTGCATGGCAAAATTGGTAAAATGGGAATTCGAACGGTTGCGCCGCCCGGGGTGTTTTTATGTACATACGCA 
ATATTCAACAGGGGGCATAAACCCTATTGCCTATTTATGTGCCGCAGTAACACATAAATGGGATTCTTCTGTTTCTCCAGTGCCCCGTCAAATGTCCGTTCGTTAAAAGTGC 
GCCACTTCAACTTTGCTACACTAGATTAAAATAACAGAAATATATTGCGTATACGCAGTGTGTGCACTCGGCATGCATCTGCATTTCTATATCCCCAGCGAGACAGCGGTGC 
TGTCAAGTGCTGACACTTGTGCAGAGTGGAATGTGGAATATGGACAAGTTGTTGCTTTAAAAGTGAGATTAAAGCAAGATTATCAGTGGGAAAGTTGTTTCAGTAGTGCCTC 
ACGCCTGGGCGCAGTGAAAACTGCAAATCCTTTTTGTCCCGGTACTTGTAAGTAAAACAAAAGACTCTTCATAATCTGTGGTAATCTCGAATTTTTTCTTTTTTATTATTTT 
AAACTTTTTGCGACTCAATTTGGCCGTAAAAATGCTAACACATCATCAGGGAAAAGCTGAAAGCGTGCTTACACTTTACTTTCTCCCCCGCAACATTGAATTGAAGCTGAAT 
CCGTCCTTGGCACTCAGACAAATCGCTGGCTACACGTTTTCATTGCACTTATTCTGTGCACTCTAGATGCCCTGCCCTCATTCCAATCTTCAGAGTTTTGAATCTAATATTT 
TTAAGGATTTATGTACTTATACAGATAACCTTGCCGAATTGATTGCATTGCGTTGTCCTTGAGCTGTCGATTCGACTTGTTTACTCAGACCATCCGGTCACATCCTCCTGCT 
AGATTGGCAACTTAATGTTTTAATGCTCGGACTTAAACCACATCAGTGCGCTCTATCATCCTTGGCTTATACATTTCACAAGCCATT 
(SEQ ID NO: 187) 

Exon: 10847.. 10568 
Exon: 10497.. 9986 
Exon: 8844 . .8644 
Exon: 7629. .7354 
Exon: 7260. .7083 
Exon: 3656. .3332 
Exon: 1439.. 1331 
Exon: 1211.. 1001 

Start ATG: 104 97 (Reverse strand: CAT) 
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Transcript No. : CT21843 

TTCCCGTGTCGCCGACTGTTGCGCGCTTGAGTGCCAAATCGAAGCGCGAGTCTCGACTTTTTGCCAATTTCCAGCAACTCAAACGGGAAAAGCGTCCTATGGAGGATTGGGfl 

AATGGAGCAGAGACCTTCAGTCGGAGCTGTGAGCAAATACTGGAATCGCAGAGGAGCAACATTGGCAACCAGGATGACGGCAAACTTCGGCAGCCTTGTTTGCATTGCAGAT 

TGCAATTACTCGAGTTTrAAATTACGCACAGCAGCCGGGAAAAAGATAAACGACAAATGTCAGGGGTTAACGTGGCTGACCTTCTGGCCACGACAATGACATTACCCATAAC 

AGCCGCAGCAGGAGCAGCAACATCACAAGCAGCAGCAACATCAGCCACCAACGCCAGCCACCTGCAACCTGCAACATTAACAGGCCACATTTCCACGACAGCAGCAGCCAAA 

ACTACGACGACGCCGACGAGCAGTTTGCCTATAACATCGCAATTTGTGGATGCCTCGTTGACTTCGCTATCATTAACAGCCACATCGTCAGATGCCTCCTACTCCTCGCCCT 

TTTCATCCTACTTGTCGTCGGATTCCACGTTTGAGCTCCTCTCGACAGTCGGCCCAAATATAACGGCCAATGGCAGTGACATTGCGGTGGATAACCAGGCGGAGCTGGAGGA 

GAGCTGGCTGGATCTATCGCTGCTGCTGCTCAAAGGATTCATCTTCTCGTCAATTATCCTGGCCGCTGTCCTCGGCAATGCATTGGTCATCATTTCAGTGCAGCGCAATCGA 

AAGCTGCGGGTGATAACCAATTACTTTGTTGTGTCGCTGGCGATGGCCGACATGCTGGTGGCCCTCTGTGCGATGACATTCAACGCCTCCGTGGAACTGTCCGGCGGAAAGT 

GGATGTTCGGACCGTTCATGTGCAACGTGTACAACAGCCTGGATGTTTACTTTTCCACGGCCAGCATATTGCACCTGTGCTGCATATCAGTGGACAGATACTACGCCATTGT 

GCGTCCACTGGAGTATCCATTGAATATGACACACAAAACGGTCTGCTTCATGCTCGCCAATGTGTGGATCCTGCCCGCCCTCATATCCTTCACGCCCATCTTTCTGGGCTGG 

TACACGACGGAGGAGCACCTGCGGGAGATTTCCCTGCATCCGGACCAGTGCTCGTTTGTGGTCAACAAGGCCTACGCCCTCATCTCCAGTTCGGTGAGCTTCTGGATACCCG 

GCATTGTGATGCTGGTGATGTACTGGCGCATCTTTAAGGAGGCGATTCGTCAACGCAAGGCCCTCAGCCGCACCAGCTCCAACATCCTCCTTAACAGCGTCCACATGGGCCA 

CACCCAGCAGCCCACCAGCCTGAGCTATCTGCATCCCAGTGACTGTGATCTGAATGCCACATCCGCCCGGGAGGAGACGCACAGTGCGCTTAGTAACTTGGAGCGGCCCCGC 

GGCATCACCTTCGCCGCGGTGCCCAGTCCGCTGCCCAAGTGTCCGCTCTGCGGCGCCGACATCAGCAGCACCACCGGCACCACCGCAAACGCAACCGCAACCGCCAACGCCG 

ACAGCACAATCGACACCACAGTAACCACTAGTAGTAAACGTAGTATTCACGAGCAAACTCCAGATCTTGGCCAGAGGCCCGCCTCCTCGTCCAGTTCCACTCGATTCTGGCA 

CAAGCGAACGGCGGCGGTCACAGCATGTTGGCAGCAGAGCAAGAACCGGAAGCGCCGCTTCAAAACCGGATGCAGTCACTGTGGTGCAACCGGCGGTTCCGTGCGACCGGCC 

AAAGGATGGAAGGCCGAACACAAGGCCGCCCGCACCCTGGGCATCATCATGGGCGTCTTTCTGCTCTGCTGGCTGCCCTTCTTTCTGTGGTATGTCATCACATCGCTCTGCG 

GTCCGGCCTGCCCATGTCCCGATGTGCTCGTGGTGGTGTTATTCTGGATCGGTTACTTCAACTCCACGCTAAATCCGCTTATATACGCCTACTTTAATCGCGATTTTCGCGA 

GGCATTCCGCAATACGCTGGAGTGTGTGCTGCCCTGTCTGGAGAAACGAAATCCGTACAACGCCTACTACGTCTAG 

(SEQ ID NO: 188) 

Start ATG: 281 (Reverse strand: CAT) 

MSGVNVADLLATTMTLPITAAAGAATSQAAATSATNASHLQPATLTGHISTTAAAKTTTTPTSSLPITSQFVDASLTSLSLTATSSDASYSSPFSSYLSSDSTFELLSTVGP 

NITANGSDIAVDNQAELEESWLDLSLLLLKGFIFSSIILAAVLGNALVIISVQIWRKLRVITNYFWSLAMADMLVALCAMTFNASVELSGGKWMFGPFMCNVYNSLDVYFS 

TASILHLCCISVDRYYAIVRPLEYPLNMTHKTVCFMIANVWILPALISFTP1FLGWYTTEEHLREISLHPDQCSFVVNKAYALISSSVSFWIPGIVMLVMYWRIFKEAIRQR 

KALSRTSSNILLNSVHMGHTQQPTSLSYLHPSDCDLNATSAREETHSALSNLERPRGITFAAVPSPLPKCPLCGADISSTTGTTANATATANADSTIDTTVTTSSKRSIHEQ 

TPDLGQRPASSSSSTRFWHKRTAAVTACWQQSKNRKRRFKTGCSHCGATGGSVRPAKGWKAEHKAARTLGIIMGVFLLCWLPFFLWYVITSLCGPACPCPDVLVWLFWIGY 

FNSTLNPLIYAYFNRDFREAFRNTLECVLPCLEKRNPYNAYYV* 

(SEQ ID NO: 189) 

Name: Beta Adrenergic/ Octopamine Receptor-like 
Classification : G_proteln_linked_receptor 



Celera Sequence No. : 142000013384518 

GTTCCGTTACTTGGTCTTTGACTTCTTACCCCAGAATATGATTCTTGGCAGCAGGAGAAACCAGTACGACTTGTTTTGGTCGTATTATGAGTGAAATATTACCACTCATGTC 
ATTTTGAACGTCTTTAAAGCTCACACCCTTATGGACCCATTATAAGAATATGGCGTGGCAAAATTTCCCATTTTTATCGAGGCTGGAAAAACTTTTCCACACATGCAATCAT 
GTTTCGTTTCACTTATCATGGCTAAAACCTAGCAAATAGAAAAAGTTTTTTTTTATCCACAAACATATCTGTATTGAGTGCCAGCAATCTGTATTGCTTTAGCTTAAGATGT 
TTACTATTAATAGAGCCAAGTGGCAAATGACATACGGCTTACAGAGAAAATAATTTCCAATTCAATTACATTTACGTACATATATTCCTTCCTACAAGCAGACAGGATTATT 
GATCTAAATATAAATAAATATATAATAATTATTTTAATAATATAATATTAAAATGTAGATTAACAGAATATATTTTTTTTGTATGAAAGCCTCTAGGGTATTTGTGTGACTT 
TTTTCTCTCTGCATAAGTGAAGATGTTTACTATTAATAGAGCCAAGAGGCAAGAGAAAAGTGGCAGCAATCAAGTGCTTTTATTTCGGGACTGCATGTTAAAGGCGATTTCC 
TGATCGAAGACTTTGGATACCTGATGCCATCCGTCTGAATGTGACGTCAGCCTGGCTGGGGATTCAATTTAAATTCATCCTTGAGTCTAATGACTTTTAATGTTGTTTGCTT 
CACAGGCGAGTGCGCAATTTTTGGGCCTGTCGTCGACTGCTTATCGGCTATGTGCCACCGCAGCCGCAAAGGATACGACAGACAATATTAATAACGCATTGATACGAGTGGG 
CGGCCGATTTGCTTTTGTTTTTCTTTCCCAACTTGGCGGTGGAAAGGAGTGGTGTGGCGTGGCGTGGAGTGGATAGAGTCCACAATTAATCGACGACAGCTAGTATGTTGCA 
AGGCGTCGCCATCACCATAGCCAACGACAGCAACGATGATGGCATCAATCAATCATTCATGGCTCATGTGTCGCCCAGTCCCAATCAGAGTCCCTCCATTGGAGTGGGCATC 
GGTATCGCCTCATCCACAATGGCGAATCCCAGCGAGAGCCCCGAGATGTTGCTGCTGAAGAACGACAAATTCCTAACACATGTCGCCCATCTGCTGAACATAACGACCGAGA 
ATCTTTCAAATCTCCTGGGTTCCACGAACGGCACAAATGCGAGCACAATGGCAGCGGATTCGCCGGTCGACGAGTCCCTGACCCTGCGAACAGCCCTCACCGTTTGTTATGC 
ACTCATTTTTGTGGCCGGCGTGCTGGGTAATCTTATCACCTGCATTGTTATATCGCGAAATAACTTTATGCACACGGCCACCAATTTCTATTTGTTCAACCTGGCGGTTTCC 
GACTTAATACTGTTGGTCTCAGGTGAGATGGCATTTTTGAGGGACACCATTGCGTATGCGTAATGCGGCAAATATCCTGCAGGCATCCCCCAGGAGCTGTACAACCTTTGGT 
ATCCGGATATGTATCCCTTCACGGACGCCATGTGCATCATGGGCAGTGTGCTCTCGGAAATGGCCGCCAATGCAACGGTTCTCACCATCACAGCATTCACTGTGGAGCGATA 
TATAGCAATCTGTCATCCATTCCGGTGAGTTTTTAACATTTAAACGTTATATATATTTTAAATATTTATTTACATTATTAAACAAATGTATAATTTCTTCAGTGTTATTGCC 
TCGCTCTCATTAACAAAGTGGATCATGCATAATTAGGACAGAGAATATGAATAGATAAATTAAACTTGATAAGCGTCAGTCGGAAACATTATGATATGAAACAACAAAAGTA 
CCTCACAAATTATGGGCACAATAATTATTCATAACTAAACAGTCTGGCTGTGGAATGATAAAACAAGTTAGGCTTGACAACCAAATTCCCATCGTTTTTAAATATACCAAAT 
TCATTAGCTTTCATTTATAGACACTAATTAAAGTCAGGTCCATAAATGAACCAAGTTCATTACAGACTATGTAATTAAGTTCCAATAAATGCGTTGTAAAGATAGAAAGTTA 
GTTAGTCAATTAATTGCTTAGTACCAACATATTTGCATTTTTGGAATGGTACAACAAAACTTTTCCTTATACAGGTATCACCAGTTCCTTCCAAATGTTCTGACCATTTTGC 
GGTCAAAGAATACAACAGCTTCCTTAAGCGAAAGCGTCTGGTTTTCATTTACGATGTTGCATTTCCTTTAGTTCAGAAACTCTTGAATAGCTAGAAGAAATGAAACTTAAAA 
ACCTTACATTGTTGTTTTCAAACACAATTTCCGTTTCTTAGCATTTTTCACAAATTAATTGTTACGAATACAATCACTTGGAGTATGTCCTTAAGAATATGCAGACACAAAA 
GGGTGGTTTTCTATAAGAAGGGATTCCAAGTCCGATTACAAAGTTAAGGCAAAAAACTTTCATGTCTCTGAAGTTGGGATTAAATATTGTTGTCGAAAAGAGTAAAAGGTTT 
CTATGTTTGCCATTTAATAGTATTATAGTTATTTTGAAACGAAAGGATATTCGGACATGTGCTTAGCATGAAAAAGGTCTGAAACCCATTGCAGTCACTTTACCAAAAGAAG 
TTAAAAG'rGATGTCTATTAAATTTGGTATGAAACTGTTTGAATTCCTTCCCATTTACACCCACATTCAAAGTAAATATTTATGTTGCACTCAATATCCCAAACCTTAAAGAG 
AAAGTTAAAAAGCGTAAAAAGTGTAGGTGGTCACTGCGTGTACTTTACAACCGTAATATTTTCCATCTTATATCTGCCCACGTTTATCCGAAACATCGACATCATCTTGTCC 
TTAATACTTAACATATAATGTAGCCTAATCTTTCCGTCATTTGTAGGCAGCACACCATGTCAAAATTGTCGCGAGCCATTAAATTTATATTTGCCATTTGGCTGGCGGCCTT 
CCTTTTGGCCCTTCCGCAGGCCATGCAATTTTCGGTGGTCTACCAGAACGAAGGATACTCGTGCACGGTGGGTGGGACTGAAATGTGAAAACCAACTGACGCATCATCGAGT 
GTCCTTTTCGCTGTTTATTCCCTTAGATGGAAAACGACTTTTACGCCCATGTGTTCGCCGTTTCGGGATTTATTTTCTTCGGTGGACCCATGACGGCGATTTGTGTACTGTA 
CGTGCTGATCGGAGTGAAACTGAAGACGAGTCGACTGCTGCAATCGCTGCCGCGAAGGACTTTCGATGCGAATCGCGGCCTAAATGCCCAGGGACGAGTCATCAGAATGTTG 
GGTAGGTGAGTTAAGTTGGCGGCTTGATTGATTAAAGACAGCAATATGCCCCACCGCCAACCAGAAACAACAAGAACATCCACGGGTGTCAAAACTTTTTCCGCCGAAAGTC 
TTAAATTGAAAATTACTGTCAACGTATTTGCGTTGGCGAACATCAAACAGGGAATTCGTCATAGCCCCGCCCTCATTGCCTTCCGCCCACTTCATTTGCACTTGCGTCATGT 
GGGCTGAACGTGCTTCTGTTTTTTTCCCTCTTTTCCCTCAACGGTGTCGTTAATTTTTCGCAAACATCGGTGCGTATGTGTAAAGCTCTTAAATGGCCATCTAGCCTATGAA 
GCAAAGGTGAAGCCAGGTGTCACCTCAAAAGTAATCAAGCCGGGGCACTTAAATGAACTCAAGACTGCTGGGTGCACACAGCAAGCTTAAATATTTGTAAAAAGTGTTTCTT 
GAATCTATAGACATATGTCCGTTAACGTGCCACGTTATAAGCTGTTAAATTTTAAAGGAATTCAGCACACATCTCGTGTATTATTCACATATTTTATTTATATAATTAGAAA 
TTCTTATCAAACCATACAGTGAATAAATTTAAGTGTAAATCCAGGCCCAATTTTAGATCCCAATAAGCCGTTTTTTCTTACAGCCTACGAAACAACGTTTAATTTACTGTTG 
CTTTTAACTGAAACACCGACCATTGATTTCGAAATTTACCATTGGCCCTGGCCTGCCAAATGGAAATTGATGGACCAATTCAGATTAAGCCCAGAGCCTTAACCATAAATTA 
ATTTGGCCAGCAAGAGTGGAAAAAAAACACACCCGTTCTCAGGGCTTAGTTATTCATAGATAGTGGGATATTATTTATCGTTTGCTAAGAGGTAATACCTTGGCTTTTATCA 
AAATGTGATATTTTACCAAAGGCCCGTTTGCGCAAACTATTCGATTTCGGGGGATATAAAAGATTAAGACCACTTTCAGTGTGAGAGAGCCCCTTTTCCGGTGACCTAGTTG 
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TTCAGGGGTCTTAATACGTTGCTTATCCTTACTTGACCGTATTTGCflGTflGCTGTAGCCGTCGCCTTCTTCCTCTGCTGGGCTCCATTCCACGCCCflGCGACTGATGGCRGT 
TTACGGACTGAATCTGATTAATATAGGCATCAGTCGGGACGCCTTCAACGATTACTTCCGGATACTTGATTACACGTCCGGGGTGCTCTACTTTCTCTCCACCTGCATCAAT 
CCGCTACTGTACAACATCATGAGCCACAAGTTCCGCGAGGCCTTCAAGGTAAGCCCATGGGATCAACCCTTCTCAGTTAACCACTTCATAATAGTAATCTATTATTAATTGG 
GTCCTTCGCATGTTTGTGAGGACGGTCACAAATGCAGATGCTTTTAAAGTTAAATTTAAGCTTTAATTATTCAAAATTGTTTTTCGTAAATTAATGAATTGGTAATCTATGT 
ATTTTTTACACTTTTAAGATCACTCTGACGCGACAGTTTGGTCTGGCCAGGAACCACCACCACCAGCAGAGTCAGCACCACCAGCACAACTACAGTGCCCTTCTCCGCCAGA 
ATGGATCAATGCGTCTGCAGCCGGCCAGTTGCAGTGTAAACAACAACGCGCTGGAGCCATATGGATCCTACCGAGTGGTTCAGTTTCGTTGCCGGGATGCGAACCACCAATT 
GTCCCTGCAGGACAGCATTCGCACCACCACCACAACCACGACGATAAACAGCAACAGCATGGCAGCCGGAAATGGAGTTGGTGGTGGTGCTGGTGGCGGTGGCGGTGGGCGG 
CGTCTGCGGAAACAGGAGCTCTATGGGCCAGGTCCGGGTACAGCAGTTCCCCATCGAATGCTGCAGGCCCAGGTGTCTCAACTCTCGTCGCTGGGCGATGCCAATTCACTGC 
TGGAGGCGGAGGTGGTGGACAGACACTACGCATCCGGGCGAGCCAAAAGAGCTCTCTTGGCCACCAAAAGTGGTGCCCTGCTGGTGACACCACCTCAAAGTGGTGACCCATC 
GGAAGTCAGTCAGCCCGCCACTCGTCTCAAGCTGACTAGGGTGATAAGTAGAAGGGATGAGGTGGCCAACACAAGCACTCCGCCCTTTTGTGGGAGCCACAGTTTGCCGGAT 
CCGGAGACTTGTCAATCGGCATCGGTGGCTGGTCGAAGCTCCCGTAAGTTTCCGTGGCGAAAACGAAGGCAAAAAACGGAGGATCCCAGCAGCGAGGGCTTGACCTATGGCT 
CACCCAAATCCCAGTGATTTCAGCTGGCTGGTAGCTAGAGTTATACTTAGTTAGGCATTGTTTAGTTAGCGCAATAAACTTTTCTATCTGTAAATAAGCCAGAAAAAGTCCT 
GCGATTTGAATGCAAATTTCGGGTGACCATAAAAAATTTAACGAATGGGCATGGTCCAGACAATTTATGAGAGTCATAAAGCCGGGATAATGAGGCTACGGCGTATCGTGAC 
GAATACACGCCCCACATCATTTTGCAATGGCAACCGCATAGCACTCATAAATTAAGTGTTAAGCTACCAGCTAAATAATGTGAATAAATAAAAATTGTTCTCAATATTACAG 
CAAAATCCTTTTAAATGCGTTTTCCTTTCACAATGCATGCCAGTTACATTCAAGTTCGCCTGCATTCCAAGTTGTCCAGGATCAAAGGGAAAATCAAAAAAAACAGTTTACA 
TCTCAAGTTTTGCAGGTGCAGCGTTTCCAAGCAATCTGTTCCTACCTTTTGCGACATTTTTCCTGCAGATTGCCTGGCATTCTGTTGATTTCCCACACAAAAACTAAAAGCG 
CTGAAAAAAATAAAAGAAGCATAAAATAAAATAAAGTGCCAATAAATTAGCAGTCTTAATATTTTCAGAGCCCCTCAGAAGCAAATCTAGGAAGGAAAACATGACATAAATT 
TTAAATGGATTGCTCATCAGGATATCGTGTTGCAGTAGATATATATATTTTCGCAGTGTAAGTCACGACTCCACTTGCACCTGGCAACAAAGTTTCAGTTTGCACCTGGCCC 
ATTGCCGGAGGTCCATTAGTCAAGAGGGCTTTGTAGGCTAGGAAGTCCGGATTGAAACGAATGTGCAGAGAGTGGCACGAATGGGCTTATCCTGCGGTGAATTGGCACGTGG 
AGTGCATGGAGAGTGGTGTCGGTGCCACAAAATAAATCAACTTAAAAGTAGCATACAAATTTAAGGACCCGACTGATGTTATGTCCAAAGGGAATTATTAATGGTTCTTTAA 
AGGCAGGCA 
(SEQ ID NO: 190) 

Exon: 1001.. 1478 

Exon: 1539.. 1704 

Exon: 2959.. 3091 

Exon: 3163.. 3361 

Exon: 4411. .4640 

Exon: 4835.. 5617 

Start ATG : 1001 

Transcript No. : CT25324 

ATGTTGCAAGGCGTCGCCATCACCATAGCCAACGACAGCAACGATGATGGCATCAATCAATCATTCATGGCTCATGTGTCGCCCAGTCCCAATCAGAGTCCCTCCATTGGAG 
TGGGCATCGGTATCGCCTCATCCACAATGGCGAATCCCAGCGAGAGCCCCGAGATGTTGCTGCTGAAGAACGACAAATTCCTAACACATGTCGCCCATCTGCTGAACATAAC 
GACCGAGAATCTTTCAAATCTCCTGGGTTCCACGAACGGCACAAATGCGAGCACAATGGCAGCGGATTCGCCGGTCGACGAGTCCCTGACCCTGCGAACAGCCCTCACCGTT 
TGTTATGCACTCATTTTTGTGGCCGGCGTGCTGGGTAATCTTATCACCTGCATTGTTATATCGCGAAATAACTTTATGCACACGGCCACCAATTTCTATTTGTTCAACCTGG 
CGGTTTCCGACTTAATACTGTTGGTCTCAGGCATCCCCCAGGAGCTGTACAACCTTTGGTATCCGGATATGTATCCCTTCACGGACGCCATGTGCATCATGGGCAGTGTGCT 
CTCGGAAATGGCCGCCAATGCAACGGTTCTCACCATCACAGCATTCACTGTGGAGCGATATATAGCAATCTGTCATCCATTCCGGCAGCACACCATGTCAAAATTGTCGCGA 
GCCATTAAATTTATATTTGCCATTTGGCTGGCGGCCTTCCTTTTGGCCCTTCCGCAGGCCATGCAATTTTCGGTGGTCTACCAGAACGAAGGATACTCGTGCACGATGGAAA 
ACGACTTTTACGCCCATGTGTTCGCCGTTTCGGGATTTATTTTCTTCGGTGGACCCATGACGGCGATTTGTGTACTGTACGTGCTGATCGGAGTGAAACTGAAGAGGAGTCG 
ACTGCTGCAATCGCTGCCGCGAAGGACTTTCGATGCGAATCGCGGCCTAAATGCCCAGGGACGAGTCATCAGAATGTTGGTAGCTGTAGCCGTCGCCTTCTTCCTCTGCTGG 
GCTCCATTCCACGCCCAGCGACTGATGGCAGTTTACGGACTGAATCTGATTAATATAGGCATCAGTCGGGACGCCTTCAACGATTACTTCCGGATACTTGATTACACGTCCG 
GGGTGCTCTACTTTCTCTCCACCTGCATCAATCCGCTACTGTACAACATCATGAGCCACAAGTTCCGCGAGGCCTTCAAGATCACTCTGACGCGACAGTTTGGTCTGGCCAG 
GAACCACCACCACCAGCAGAGTCAGCACCACCAGCACAACTACAGTGCCCTTCTCCGCCAGAATGGATCAATGCGTCTGCAGCCGGCCAGTTGCAGTGTAAACAACAACGCG 
CTGGAGCCATATGGATCCTACCGAGTGGTTCAGTTTCGTTGCCGGGATGCGAACCACCAATTGTCCCTGCAGGACAGCATTCGCACCACCACCACAACCACGACGATAAACA 
GCAACAGCATGGCAGCCGGAAATGGAGTTGGTGGTGGTGCTGGTGGCGGTGGCGGTGGGCGGCGTCTGCGGAAACAGGAGCTCTATGGGCCAGGTCCGGGTACAGCAGTTCC 
CCATCGAATGCTGCAGGCCCAGGTGTCTCAACTCTCGTCGCTGGGCGATGCCAATTCACTGCTGGAGGCGGAGGTGGTGGACAGACACTACGCATCCGGGCGAGCCAAAAGA 
GCTCTCTTGGCCACCAAAAGTGGTGCCCTGCTGGTGACACCACCTCAAAGTGGTGACCCATCGGAAGTCAGTCAGCCCGCCACTCGTCTCAAGCTGACTAGGGTGATAAGTA 
GAAGGGATGAGGTGGCCAACACAAGCACTCCGCCCTTTTGTGGGAGCCACAGTTTGCCGGATCCGGAGACTTGTCAATCGGCATCGGTGGCTGGTCGAAGCTCCCGTAAGTT 
TCCGTGGCGAAAACGAAGGCAAAAAACGGAGGATCCCAGCAGCGAGGGCTTGACCTATGGCTCACCCAAATCCCAGTGA 
(SEQ ID NO: 191) 



MLQGVAITIANDSNDDGINQSFMAHVSPSPNQSPSIGVGIGIASSTMANPSESPEKLLLKNDKFLTHVAHLLNITTENLSKLLGSTNGTNASTMAADSPVDESLTLRTALTV 
CVALIFVAGVLGNIITCIVISRNNFMHTATNFYLFNLAVSDLILLVSGIPQELYNLWYPDMVPFTDAMCIMGSVLSEMAANATVLTITAFTVERYIAICHPFRQHTMSKLSR 
AIKFIFAIWLAAFLLALPQAMQFSVVYQNEGYSCTMENDFYAHVFAVSGFIFFGGPMTAICVLYVLIGVKLKRSRLLQSLPRRTFDANRGLNAQGRVIRMLVAVAVAFFLCW 
APFHAQRLMAVYGLNLINIGISRDAFNDYFRILDYTSGVLYFLSTCINPLLYNIMSHKFREAFKITLTRQFGLARNHHHQQSQHHQHNYSALLRQNGSMRLQPASCSVNNNA 
LEPYGSYRVVQFRCRDANHQLSLQDSIRTTTTTTTINSNSMAAGNGVGGGAGGGGGGRRLRKQELYGPGPGTAVPHRMLQAQVSQLSSLGDANSLLEAEVVDRHYASGRAKR 
ALLATKSGALLVTPPQSGDPSEVSQPATRLKLTRVISRRDEVANTSTPPFCGSHSLPDPETCQSASVAGRSSRKFPWRKRRQKTEDPSSEGLTYGSPKSQ* 
(SEQ ID NO: 192) 

Name: GROWTH HORMONE SECRETAGOGUE RECEPTOR-like 2 
Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384518 

ttgggcaaattaaggaaactttttgtggtttcagttactcgtttaagccacattttcccaacatatgctgcactttatgaattaaaagtcatttattttaggctgcacacac 
gttatgtttgcctgacaaattgcatttgctgaaattaaaacaaaatgcactgggacactgcgttatggcatGctgctaattagtcaccaacagcggctttcagcgaatgttg 
gccaatcgcgagaatggctttcatccactgtctagactccagaccaaaaagcgaatttgtgcgccagataaacatcttaatggccagggattatgatgcagtgcccgctttt 
tactgacagtaattagccagacgcccattgaaccctcttcttgataatccccacttgatccacactctttcaaaaaaaaaaaatgtattcattaaatgggaaaaaaatattt 
actgcaagagaaatgttattaatggatattgaaaaactgacattagataacagtttctattggactgctaacaaagtcggtcagctgtttaaaaaactgagatcgcttttat 
atttaaattgtataataaaaccattagggtatcaaccttaatatgtatttatatatcactgtactccaaatactatttacgcttaactatgtagattcattttatttttatc 
aaaagtccattctttttttattaaactccactttataaatgaaacaacggtcgaatttttgaactgagtacataactacatagtttattcgattacgtgcactataactata 
attcgaactatagctttgggctatattacggttattggtaaggacaaccatataattgaccccaaacgcggaatatttgtctaaattaaaacacaaacaagctggagtggcg 
cttgctgctcgtatcacagttgtgtgtgtgtgtaatgtacgtaggtggcactggaatggactacctcgcttatcgagtacagatagaggatatgctgaccccctttaaaagg 



Start ATG: 1 
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CGGCCCGCTCTTCAGTGTCGCCTCCATTCACCGTCGCCCCCTTTTCGTGCGGTTTGTGGCTGCCTTTGCGTCTGGCflCGCCAATTGAATTTGGTCCTGAGACCCTTGAACGG 

GCTTGAGCTCTTGAGCACTTTCACTTTCGCCTTGCGCACTTTGCCACTCGACTCGTCACCGCTCACGCCCCCCGACCCCGCGGCCCCTGTTGTGCTGCCTCCTGCCACGCCC 

CCCGTCACACCTCCTGGTGGGCGGGTGATGCGCGTAAGTTTAAGGCGTGACCGCGAGATGGCCAAGTCATCTGTGAGCTCGTCAATGGAGCACATCGTGGGCTGGCGCTGCA 

GTGGCTCCCCGGGAAACTGCAAATCCTCCTCCAAGAGGGAATGGGCATCGCCCACCGATGACAGCTGCGATATTTGCGTCTGGAGCATGCGACGCGGATTATCGAAGAGGTC 

TTGCCGGCTGCGATTCTGACCCTGCATCTGGACACTGTCCAAGGACACGCGGTTCAGGCGCACCGACTGACCCGTTGTCGTTCCATTGCCACTGCCATTGGCAGAGCCATTT 

AGCCCAGTGGTGGTCGTCGCCATGGAAGTCATTGTGGTGCGAACGCTGTCCTGCAGGAGGAAATCAAGGAATGGAGGGATAATCAGAAAGGGAAAGTGCAGTCGCCAGACAC 

GAAAATAAGTAACGCAATAAACATCTTATAGTGGTTTTAAAAAGTCTCGTGATTCAAATTTCATTTTGAGAAACAGGACATACACTATATTAATCCTTTTCATTTTCCTCTT 

AAATAATGCCACATAATAGAATTTAAGTGCATATATCGAATGATAATTATGTAATTTTTCTCAGTGCACTCACCGTTGTGTGCAGCCGCAGTGAACCCGTTTGATTACGTCG 

AAGAGCGCTGTAAGTATGAGGAAGCCCCCTTCCCTGATTTTTGCCACCAAGTCCAAAGTGTCTAGCCAAAGTTACCTAAAATGGGATTAAATACGTTTTTTAATTCTAGCTT 

GAAGTTTAAAATCACTAAATTGACCGTTAAAGAATCGATTAATAAACTTGTAACCTGAAGCTTATTTTCAATTTCCTAACGCAAGCTATAGAAATGAATATCCGGCAGCGAA 

ACTCACCTTAAAAGCCTCACGAAACTTGTGGCTCATGATGTTGTAGAGCAGCGGGTTGATGCAAGTTGAGAGGAAGTAGAGAACACCGGACGTATAGTCGAGGATGCTGAAC 

ACGTCGTTGAACCACTGGGACTCAATGCCCGAGGTGGATCCATAGACCGCCATCAGCCGCTGGGCGTGAAAGGGGGCCCAGCAGATGAAGAAGGCCACGGCCACCGCCACTG 

GGAAAGTTGGATAAGGGCGCCTGATGCAACCATGAGTTATTTCGGATATTTCATATCCTGCTTACCCAGCATCCGGATGACTCGCGTTTGGGCGCTTATCCCCCGGTTTACA 

TCGTAACATCGCCTGGGAAGCGCCTGCAGGAGTCGGCTCCGTTTCAACTTCACCCCGATGAGGACATAGAGCACGCAGATGGCCGTCATGGGTCCGCCAAAGAACAGGAAGC 

CCGACACAGCAAACACATGGGCAAAAAAGTCGTTTTTCATCTACAGCAGGCATTTCACAAAAATAATATATATGTGTATTCGCATGCGATACGATATGGTTTTGGCTTACAG 

CCACAGCCGGAAGGCAATGGCTGTAAATTTGATTGCCTAAACTGAGTCGAACCTCGTAAGGGATTGTCGCTGTTTTTATCATGCTCTTAAATTTTTTGTAATTCACAAAGGC 

TCTCTATTCGAACTTTCTATACGGAAAACACATAATGAATCAAAGGATCAAAGGACCGGTTCTACGAATCAATTAACCAGGTAACTCATAACTGGGGGACATACGATTAAGG 

GTCTCGCATCAATCATACGCACCGTGCACGATGTTCCCATGCCCTGCATCACCACGGAGAACTGAATGGCTTGGGGCAGGGCCAGCAAAAGGGCAGCTATCCAGATGGCAAA 

TATGAACTTTACGGCCCGTGACAACTTGGACATCGTGTGCTGCCTGTATCGAAAGCAGAACCAATTAAGCAAGTTTGGGGCAAAATGGAAATGGGCAGTGGAAGGAATGAAG 

GTTCCCTTACCTGAACGGATGACAAATGGCAATATATCGTTCGACTGTGAACGCGGTAATGGTTAGAACTGTCGCATTGGCCGCCGTTTCCGAGAGAACGCTCTCCAATATG 

CAGATGCTGTCACTGAAAGGATAATTATCCGGGTGCCAGAGGTTATAGAGGTCCTGCGGCATTCCTAAGAACATAAATAGCAAAATAGCAAAATATATAGTGTGATTATAAA 

TATTTAATAATAGCTGAAATTATAGCCTATAGCCTCAAAATGCAAATCGATTTGAGCATCTGCCGAATATCTCACTCTACATTTTCTTGTTTATTTACAAATTTTAATTTAA 

TATGCCTCAGTTCCATTTATTGTTTGCACTACTGAACTGAATTAAATTATAATTCAAAAACCATGAACATTTTAGTTTTGCACTCACCTGAGCATAACAAAATCATGTCGGA 

TATAGCGAGGTTAAACAGATAAAAGTTGGTGGCCGTGTGCATAAAGTTGTTCCGCGAAATGACAATGCATGTGATGAGGTTGCCCAAAACGCCGGCGATAAATATGAGGGCG 

TAGCCCACGCTTAGGGTGGCCAGCAGTGAAAGTGGTGTCATCGGGGTCACCGTGGGCAAAAGCTCCTCCGGCTCCAGGCCCTGGAGAAGGCTGGTCAGGTTGTCCGCGGAGA 

TGTTCAGCACTTGTGTCAGATTTAAGAGAAATTTCTCATTGTGAAACAGTTGTAGATCGGTTGCCAGGACACCGGAACTGTTAGTGGGCAGCATTTTGACTGCCATAGGGTG 

TGTGTCTTTGTGTGTGTGTATACGCACTATTATTTTCGAATTTCTTTGGAAATTTCAGACTTCTGCGTATGTATTCTCAATCGAAGCTTTCGCTTGCAAATAATGTTTTCAT 

AACGCCCCTTACGGGGGCTCATCAATTTCGACTTAGGCCAGAGCCACTCCACTTGCAATCTGTTGTCATGAGTGTGGGATTAAATTATGTGTACGGATGTGTGTGATAAGGT 

GGACCCTATGTGCTATTTAAATAAAACTATGTTTTTCAAAATTTGTTAAAATTTAAATAAAATAATACAAGACAAATAACGCATTTTAACGAATTTAGTAATAAAACAAATT 

CAAACATTAAGGCTTTTTACTCCAGATTTCTAAATTACCATAAATTATACATATACACAATTTGGTTGGTAAGCTGTGCGCTTTGTTTAGCTTGCTAAGGTCCACCTTAGAG 

AGCCCGTCTTAAAGCGTGTTTGTGTCCTGTGTGTTTACCTGGCATCCTGGGAAACTGGCAGGCGGCAGTCAGGCGTGTCCTAGGCGAAAGACCAGGATCAGGGCGAGTGGAA 

GCTGAAAATTAATTTGAATTGCTAAGTGGCATTTAGTCAAAATATTTGTGTACACTGGATGAATTTAAGGATGTTGCTACAAGGCTGAGAAGGACAGGCCTGCCGATCCGTC 

TCCCACCCAACACCAGGTGGCAAACGAATGAAACGAGCCAGAAGCTCAAGGTCAGGTGGTGGATAAGACGGATAAGGAGACGACGATGGCAAATGGACACCGGGTGACTTAA 

TAACATTAACATAGTTTCATTCCATTCAGATGATGAGTGTTTTCGGCTGCTGCTGTTCTGCTCTGTTCTGTTCACATTCTGGCTATACTTTTCGTTATTGTTGCTGCTGACC 

GCTGTTGACCGTTTTTGGACGTTTTCATTATCATTATCATAAAAAAGTTTCTACAATATATGCACAACCAAAAGAATGCTGTCGAA 

(SEQ ID NO: 193) 

Exon: 3902.. 3560 
Exon: 3312. .3147 
Exon: 3067-.2935 
Exon: 2616.. 2418 
Exon: 2349.. 2135 
Exon: 1979.. 1866 
Exon: 1618. .1001 

Start ATG : 3902 (Reverse strand: CAT) 
Transcript No. : CT25350 

ATGCTGCCCACTAACAGTTCCGGTGTCCTGGCAACCGATCTACAACTGTTTCACAATGAGAAATTTCTCTTAAATCTGACACAAGTGCTGAACATCTCCGCGGACAACCTGA 

CCAGCCTTCTCCAGGGCCTGGAGCCGGAGGAGCTTTTGCCCACGGTGACCCCGATGACACCACTTTCACTGCTGGCCACCCTAAGCGTGGGCTACGCCCTCATATTTATCGC 

CGGCGTTTTGGGCAACCTCATCACATGCATTGTCATTTCGCGGAACAACTTTATGCACACGGCCACCAACTTTTATCTGTTTAACCTCGCTATATCCGACATGATTTTGTTA 

TGCTCAGGAATGCCGCAGGACCTCTATAACCTCTGGCACCCGGATAATTATCCTTTCAGTGACAGCATCTGCATATTGGAGAGCGTiTCTCTCGGAAACGGCGGCCAATGCGA 

CAGTTCTAACCATTACCGCGTTCACAGTCGAACGATATATTGCCATTTGTCATCCGTTCAGGCAGCACACGATGTCCAAGTTGTCACGGGCCGTAAAGTTCATATTTGCCAT 

CTGGATAGCTGCCCTTTTGCTGGCCCTGCCCCAAGCCATTCAGTTCTCCGTGGTGATGCAGGGCATGGGAACATCGTGCACGATGAAAAACGACTTTTTTGCCCATGTGTTT 

GCTGTGTCGGGCTTCCTGTTCTTTGGCGGACCCATGACGGCCATCTGCGTGCTCTATGTCCTCATCGGGGTGAAGTTGAAACGGAGCCGACTCCTGCAGGCGCTTCCCAGGC 

GATGTTACGATGTAAACCGGGGGATAAGCGCCCAAACGCGAGTCATCCGGATGCTGGTGGCGGTGGCCGTGGCCTTCTTCATCTGCTGGGCCCCCTTTCACGCCCAGCGGCT 

GATGGCGGTCTATGGATCCACCTCGGGCATTGAGTCCCAGTGGTTCAACGACGTGTTCAGCATCCTCGACTATACGTCCGGTGTTCTCTACTTCCTCTCAACTTGCATCAAC 

CCGCTGCTCTACAACATCATGAGCCACAAGTTTCGTGAGGCTTTTAAGGTAACTTTGGCTAGACACTTTGGACTTGGTGGCAAAAATCAGGGAAGGGGGCTTCCTCATACTT 

ACAGCGCTCTTCGACGTAATCAAACGGGTTCACTGCGGCTGCACACAACGGACAGCGTTCGCACCACAATGACTTCCATGGCGACGACCACCACTGGGCTAAATGGCTCTGC 

CAATGGCAGTGGCAATGGAACGACAACGGGTCAGTCGGTGCGCCTGAACCGCGTGTCCTTGGACAGTGTCCAGATGCAGGGTCAGAATCGCAGCCGGCAAGACCTCTTCGAT 

AATCCGCGTCGCATGCTCCAGACGCAAATATCGCAGCTGTCATCGGTGGGCGATGCCCATTCCCTCTTGGAGGAGGATTTGCAGTTTCCCGGGGAGCCACTGCAGCGCCAGC 

CCACGATGTGCTCCATTGACGAGCTCACAGATGACTTGGCCATCTCGCGGTCACGCCTTAAACTTACGCGCATCACCCGCCCACCAGGAGGTGTGACGGGGGGCGTGGCAGG 

AGGCAGCACAACAGGGGCCGCGGGGTCGGGGGGCGTGAGCGGTGACGAGTCGAGTGGCAAAGTGCGCAAGGCGAAAGTGAAAGTGCTCAAGAGCTCAAGCCCGTTCAAGGGT 

CTCAGGACCAAATTCAATTGGCGTGCCAGACGCAAAGGCAGCCACAAACCGCACGAAAAGGGGGCGACGGTGAATGGAGGCGACACTGAAGAGCGGGCCGCCTTTTAA 

(SEQ ID NO: 194) 

Start ATG: 1 (Reverse strand: CAT) 

MLPTNSSGVLATDLQLFHNEKFLLNLTQVLNISADNLTSLLQGLEPEELLPTVTPMTPLSLLATLSVGYALIFIAGVLGNLITCIVISRNN FMHTATNFYLFNLAISDMILL 

CSGMPQDLYNLWHPDNYPFSDSICILESVLSETAANATVLTITAFTVERYIAICHPFRQHTMSKLSRAVKFIFAIWIAALLLALPQAIQFSVVMQGMGTSCTKKNDFFAHVF ■ 

AVSGFLFFGGPMTAICVLYVLIGVKLKRSRLLQALPRRCYDVNRGISAQTRVIRKLVAVAVAFF1CWAPFHAQRLMAVYGSTSGIESQWFNDVFSILDYTSGVLYFLSTCIN 

PLLYNIMSHKFREAFKVTLARHFGLGGKNQGRGLPHTYSALRRNQTGSLRLHTTDSVRTTMTSMATTTTGLNGSANGSGNGTTTGQSVRLNRVSLDSVQMQGQNRSRQDLFD 

NPRRMLQTQISQLSSVGDAHSLLEEDLQFPGEPLQRQPTMCSIDELTDDLAISRSRLKLTRITRPPGGVTGGVAGGSTTGAAGSGGVSGDESSGKVRKAKVKVLKSSSPFKG 

LRTKFNWRARRKGSHKPHEKGATVNGGDTEERAAF* 

(SEQ ID NO: 195) 

Name: GROWTH HORMONE SECRETAGOGUE RECEPTOR TYPE 1/like 
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Classification: G_protein_linked_receptor 



Celera Sequence No. : 142000013384638 

TAATTGCTGCCGGGTAATTGCAACTGAGCTCAGGTCfiGCGAGTTTGCTAACCATGGCGTCflTTATfiCTGTTGCACTATAAAATCATTCGACCCAAGTCCGATTAACTTAAGT 
GGGTCTTAGACCAAAAGATGTGTATTTCGAATACTTCTCAAACGTTATTTCCTTAAGTCAAATCATTTCAAGAACTGGGAGAGCAAATAAATATGCGATTTCCTCGACTTAT 
AGTTACTTGTGTGAAATTTGCGCGCTTGTGCAAATTCGAATGTTTGCAACTTAATTGGACTGCATAATAGATTGGGCAAAAGGATTTTATAACAATTCCAGTGTCCAGCCCA 
TCGTACTGATGTGGCTAATTGTTTTGCTGACTTGCACCTAGGCAAAGCAATTTCCCTTGAAAACAATCAAATTCAAAACAATCAAGTAAACAAGACAGCTGAGTGAAAGACT 
TTGGCTTCATGGGTGAACATATTTGTCGGTTGAAGGTTGAGTTTTTGTTTAAGATGATTGCACATGGCACACAGGTTGATTTTAATCTAAATTTAATTTGAGCATTTGAGCA 
TTTTTCTGCTGAACCGGGAAAACCCACATTTGCTTACCTAGAAAAACCGGTTTTCAGAGAAAGTTGCGAGTAATACATATTTAATACGATTTCAAGTCCTGTGTGTACAACG 
TTTTATCTAAGAAAACCAGAAAACCAGAAGCTTTTTAGCTTAGACAGCTTTAATATTTAATTACAGTGTTCACAAAAAAAAAAAACGAAATTCCCCTTTTCTTTGCTTCAAA 
TTTAATTAGACATGTATCTATTGTTTTTTTTTTTCCACTTGCAGCTTATTTATCCAACCTAAGCCAACTAAATGATGCAATGCTGGTGACATATAAAACGAAAGTGGAAACG 
GAGTTCATTCCTTAAGTAACCAATTTAAGATGTTGTGCTCCCAAGATCCCTAAACCACCGTCCAAACCCTTCGAATAACTTATTTAAGCCAAGATATAGACAAAATGGAGGA 
CGAGTGGGGCTCCTTTGATCGTCTGCCCAGTGTTCCGAGTGCCTCGATGGATTTGGAGACGGAAAACGAGGTGGTCAGCAATTGGTCCACACTGGCCAACTTCACGCGACTT 
GTGGCTGGTGCCGCTCCTGAAATCGTCAACTATACGCTCAACATGATCGACGTGGGTGTGGGCATGGCCACGGATATATCCAATTTGAGCGTGAGCACCACGCCCCTGCCCG 
CCTACGCAATCTCCAATAGCTCCTCACTGGCGCACACCAATAGTCGCCATGAAGCACCACCGATGGCGGAACAGGTTCCGGAGCACGTGATGGATCACGCACCTCAACTATC 
CCGATCCGGGTTGCTGAAAGTGTATGTCCTGGCGGTAATGGCACTGTTCTCTCTGCTGGGCAACCTGCTGACCATCTGGAATATCTACAAAACCCGCATCTCAAGAAGAAAC 
TCACGGCACACGTGGAGTGCTATCTACTCACTGATGTTCCATCTGTCCATCGCCGATGTCCTGGTCACCTGGTTCTGCATTATCGGGGAGGCCGCGTGGTGCTACACCGTCC 
AGTGGCTGGCCAATGAGCTCACCTGCAAGCTGGTGAAGCTCTTCCAGATGTTTAGCCTCTACCTGAGCACCTATGTCCTGGTCCTCATCGGAGTGGACCGCTGGATAGCGGT 
CAAGTATCCGATGAAGTCGCTCAACATGGCCAAGAGGTGTCATAGGCTACTTGGCGGTACTTACATCCTGTCGCTGGTGCTCAGCTTGCCACAGGTAAGTTTTGATACACTG 
AAAATAAGACCTTTAAAGTTCTTATAAAATCATTTGCCTAGAACAAACAGATAAGATAGTTACACAAATAAGCAAGATTGTAATAATATAAAAGTCATTTTTGGCCTATAAA 
TGGTCTAAAAATATTTGTGGGATATGTTAGATTCAATCTTTAGATAATTTAAATGAAAAAAAGTTAAAGGTGCCCTTTGATTAGGAAATCAGTAAAAGCTTTAAGATCTCCT 
CTTAATTTATGTACCCTTTCGGAAAATTTCTGTTCGACGTAAATGCAAAAAGCCTTGAGGGTACCCGTAAAAACATTATTCTATTTAAAAAGATTTAGCCAAAGGGCAAATT 
TAAAGCGTCTGCATCTGAAAAAATGGCCCTTCCAGTGAAGTGCGAAGAAGACACGCTTATGAAAATAATCCAGTCACTTTTGGTGGGGCGTTCCATTACGCTAAGTTAAAGC 
TGCGACTTAGCCGTGGTAATATTCACCTGCCACTGCCCCTGCCTGTGCCCCCTACTCCTGGCACTCCATTTGATCCTTACGCTCGCGCTGATGAAGTTGAAATAATATGTTT 
ACTTTATAACACGTATACGCCCCGTGGCACAGTGCAAATAGTCTTGAGTAGGCGCCAGAATGTGTGCAGACATTAAAAATGATTGCCTTTCGCACAATAAGTGTACACTAAA 
TGTGGCTTTAGGCCGCTGCCACGCCCACACACACGCGTGCTGATGGGCGTAGGAATGGGCATAGGGTTGGGCGTATGGGGACACCCAATCGAATTCCGGTCGTCCGACCGAC 
AAAGGGTTTTTTTCTTCTGGTTTTTCCTCTCGGCTTTTCGCTTGGGGACGGGCGACCAAATAAAAAAAAAAAAAGGTCCGAACAAGTGTGTCTTCTGCCTGACGACAGACAT 
TATTCATATTCCCGATTTCGGTTCTTTGTGCGCAGATAGGCACACTGGGTGATATATTCCTTTGCCCTGGCAGTCAGCTGGCGGATAAGCCTGAATGGGACTGCATTAATGG 
GCGTCACTCGCTTTGTTCTACAACTCTAGAACGACTCGGCTTCAGATTTCCGGCACCCTGATTGTATAATATATGCCGCGTGTCTCCACATCCCGATTTCATACTCGATTCG 
CCTGGCCTTTCCTTGATTTATGGCCCCGGGTGAGGCACTTCACTTCTCGAATTCAGTTAATCTTTAGCTGGAGTGCTACGAGCTCCTGCTCGTGTTTTTCCTTAAAGTGGAA 
ATTCGTGGACACTTCAGTTTAGGGTGTCACTAAGGTAGGTAATTAGCACGAAATCTCACTTCAAAGTTGCCAAGGGTTAGTAATATAGTTAGATTGAAAAGTGACTGCAAAC 
TTTCCACTGCTCTTTATGTGACCCGGGTCATAAATATCGTTCGCAGAATTTGGTTAGCCATAAGTGGGGACAAGTGGGTAGAAAGGTCAAACCTTCTGTATAAAAAGATTTG 
TCACCTGAGAGATGCCAAAAATAGATGAAAAAGAACCAAAGGGAAATTTCCTCCTAAGTGGATGGTAAGTGGATTTCCCGCACGCAAGTTATGCAACTAGCACACTTTCCCC 
ACTGCTCGCCCGCAAATGTATGCTAAAATGTAGCTCATTTAATTAGGGAGCCCAGCAACCCGACGCCATTTCCTTTTATCTGAATAATGTATTGCGTGGAACAATAATTACA 
GACGTACCGTACAAAAATCGAGGCGAGATTATATTTGCTCACTTTGGGGCAAAATGTGTGTGTGGTTTTTCTGGGTGCGAAGGCATGCGCGAGTCAGTTTGTTAAATCATCT 
GACTTGTTTAAATGCCGCAAAAAATCTACTGAGCTTAATAACATTTTGATGTTTTTTTCCAATCGATGTCAAGCGGAAAATTTGGCCGTTTTGTTGCTGGGAATTTGGGCCG 
ACTTTTGGTCCAGCCGACCCATAATGGAAAGGACATAATGTATTGGTTTTGGATTTTTAAAATAGAACCAGACTTTGATGAACTGTCCAGCTTGGGTGTAAATATTTGATGA 
CGAACGTCACAAATTACGATGATATGACAGAAGTCATTTATGGATGGCCACTGACACCCAATTAAAAGAAACTTTGCAATCATTAAGCCAATTTTTATGGTCCACTTTTCGA 
CATCTCGCTGAGATCCCGGGAAACTTTCTTCATTTCGCTTCTACTCGAATATGCGACATAAAATTTATGCCGCTTACAGACGTGACGGTCGAAACGTTCCCGCCGGAAACGT 
AGGAATCATCTCAAACAATTTTCCGGATAACTTTTTCCATGTCACACGTTCATGGTGCAGGCAACCAAACAGCCAACCAACCAACCAAGTGGCACATGTATTGGTAAGGAGC 
CACATCGAATCCAATATGAGACGCGCCGCTGAACTCGTCGGTTTCAAATAAAGGAGCGAAATGAAAATTCACTTCGACTGCCAATAAACAACAAATGTTTTCAACAGCAGCA 
TTTATTTGGCTCCACTTCTATCGACCCCAACTCCTCCGCGTCCTTCGAACCCAAGGAAAAAAGAAAAAAAGTTTAGTACCACACGCACACAACCACACAGAAGAATCCTGCC 
ACAATAACTCGAAGTGTGAGCCCAAAAATGTGGCACAATGAAAACAGGCCGCCATAATACAGTCAAACACAAAGTTATCCACAGGTCCTCCGCAGAGTATTATTGCGAATTG 
AATTTCACCCACATATCATCAAATTCAGCAGCCTTAGGTGGCATAATTAAAATTTCGTAGAATTGTTTACAAATTGGAGGGAACTAGTCAATATTACAAGAGTCCCTTGGCG 
ATTTCACCTTCAGTGTGCGCGGGCTTTTGATAATCGATAGGGGTCCTTAAATGCTTAGCTTTAAGTGAATTAAAATTTGAATTTAGTTTCAGAATATTTACAGCAGAATTTG 
AATCAATAATTGCAATTTATATGTGCAAGCATTGGCGAATAAAGTAAATAACCGGATTTAATTATTAGGTGACAAAATCGTAACATTTAAATATATTTACCTGAATAACAAT 
TGGAAATAAGTGCAAAAACAGCAAAAAGTATAATAAAGTTTAGATAAAAAATGAAATCCATTTAATATTTTAATTTGGGCTCATCTGGATTCATATTTTAGGTGACAAAATT 
AATTATTATAATTATTAAAACAAATGAGCAGGTAACAAGTCAGTAAGCTCTAACAATCCTTGTTAAATCTAACTTAAACGTAGCTGAGTGTCCAATTTAAATGGTCAATTTA 
ATCGTGCCTTTAAGCCAAACAAATATGTATATTTTAACAATCTGATAGTTTTACAAACGTCTTAAACGATTGCCAAGCACCTGACCGATGTGTGAAAACTACGAACTTGAGC 
AATACAATTTGGTTAGAACGGAAGTTCTTGAAGTTGTTCCCTCACAATGAATCAATCTTGTTTTATTTTACTGTCGCAAGTTTTTGTCTAAAGTTTTTCGTTGCAACTTGTC 
TAACTTTGCCGCAAATAGTGCTGAAGGCTTGCTTGTGTGTGGCATGGTTTTAGCGTGGAGCGCCCTTTAAACTTTTCGAGCATTTTTAATGAATGGCAACATGGAAATAATG 
AACAAACACTTGAAAAGTTGAAATAATTTAAGAAAGAATATTGAAAGCCACAATTATCCTTGCTTTCTTTTACTAGTCATGACCTATAAAAATATATTTTTCTACTATAAAA 
AGCCACTGGCACTTTACTTCGGTTTCGGCGGAAGGCAATGGCTTGGAAGTTATAGGCGAAATTAATAAAAATCAACCATTAGACGATTATCCGCTTCTCGTTATTTCTGCGC 
GGGGGGTTTTTGTGTAGTTTTTACTACGCCACTCGGCGGAGCGCGGAAAGTTTTCCCATTTCCTCCATTTGGGTCAGCTGTTGACACAGCGGCCGCCAGAAAACCCATAAAA 
AGCGCAAATAAAAATACTTTTAGCGTATTAGTTATGCGTAGACACATCCATCGTTGGTGTATTTAAGATGCTACACTTCCTTTATCAGTTATGGCCACACATTTTCCTGCTC 
GAAAAGCTTTTTCATGGTACGAGAATTTCCACCTTTCTACGTGGCACTCTGCTAAACTCGGTAAGACTTGAACAATGGAGCAGTAGCAACTGCAGAATGTATACATTTTCCA 
AAAATGTTCCATCAATTTGGAAATGATTTCAGCCAAACAAGAAAATCAAGCAAAAGCACTTTTCACCCATAACCAAGCCGCTTGCCACCCACAATGGAAAGTAGATCAAAGC 
CACAACGTTTTGAGTCAGCAGTTTGTTACCATGGACCGCATGTGGGTATTTCCATGTAAACATGCCTGTACTTAGTCCTTGGTGCAATCAGGACATGGATGGTTTGCTAATA 
AAAATCTTATTACGGCACCATCTTATTTACACAACACGGTAGAGATATATGGTTTTAAAAATTTCCATTTTCCTGCTTAAAACAGGCAATAAATTGATAGGAAGGTAAATAT 
CCTTAAGAAACACTTGTATAAAGTTCAAGTTCAATCCATGTGTCTTAACCTTTAATTGTTTACTATGTTGCATGTTCGTTCCAGTTCTTCATCTTCCATGTAGCGCGTGGCC 
CATTCGTGGAGGAGTTTTACCAGTGCGTCACCCACGGATTCTACACGGCGGATTGGCAGGAGCAGATGTACGCCACCTTCACGCTGGTCTTCACCTTCCTGCTGCCGCTGTG 
CATCCTGTTTGGCACCTACATGTCCACCTTCCGCACCATTTCCAGTGAGTTTTCAGCGAGAAGGAGTTTACATTCCCGTACTAACCTGATAACCTGATAATCGTGCAGGCAG 
CGAAAAGATGTTCCAGGGATCAAAGTTGGCCAACTACTCAACGGCCAAATTGCCCACACAGACGAATCGCCAGAGGCTGATACACAAGGCCAAGATGAAGTCGCTTCGCATA 
TCCGTGGTGATCATCATAGCGTTTCTCATCTGCTGGACGCCCTACTACGTCATGATGATTATGTTCATGTTCCTCAATCCGGACAAAAGGGTGAGTTTCCGCCCGTTCCCCT 
GCCGAATTTGAAGCTGAATGACAAGCAGATATCGATTCCTGATACCAAAAGGCTCATCAATCAGCAGTCGATTTGTCAGTTTGTAGCGCACGGTTTGGGCGAATCCTTTGTC 
CTGTTCCCTACCATATTTACGACTTGGGGTCATGGTTGTACGAGTAAAATAAAAACGAGAATACGACACGACCTAGTCGCAGCTGGCAAATCCGTTTGGAGCTATACCCACT 
GCCATAAAAATGATTTCTTATGCTCAATTTTTAATTAATCGTTACGCCACCCGACCGACCACACGGTCGCAATTTTTCCTTAACGGCGTACCCAACTTTGATGACTTGCCAG 
AGATGGTCCGTAAAAGCTGGCCCAAGTTGCAAGATTACGGCGAGGCAATCCGGGCAGAAGGAACACAAGGTCGAATTGCTACAAACCGAGCATTCTGTACAGTAGGGGAAAA 
TGAATGAGTAGCAAAATATTTCCAAAGTTTATTTGTTTCGATTTAAATTTTAAAAATGCTATAATTGACAAAATAGAATCATTTTTGCTGTCACTATTTTTAAGTGAATGGG 
ATATTTGTCCGCCGATTAACCTGAATTTCATATCCTAATGAATTGGATTGCACATTCATAATGGCATTTCCGCTGAAACTCTTTCTCGCTGTGGAAATCCCAGCGATGGCAG 
TGTGCAAAATAATTAATTTGGAAGCCAAATTTTTAGCGCAGTGTATCCATAAGCGACCTGAGCCCCAATTCGACCGCAGGAACGCCCACAATGCGCCCACGCGAATGTAGAG 
GGGAAAAGGTGAAAGGTGTGCGACAAGGGCAAGGAAATTAATTTGAATTTCACGGATTGCTTGCAGCTGGGCGACGATCTGCAGGACGCCATCTTCTTCTTCGGCATGTCAA 
ACAGCCTGGTCAACCCACTCATCTACGGTGCCTTCCACCTGTGTCCTGGCAAAGGGGGCAAGTCGAGCGGCGGGGGCGGCAACAACAACGCC1ACAGCTTGAACAGGTGAGT 
GGTGAGCCAACAGAATAAGTTAATTCTATTAACCACCACATTTCGTACTTGTTTTTGCGCTTTTATCGTGTCAAAAAATAGTTACTTTTCACTCAAGCAATACACTTTTAAG 
ACAGCCAAGCAAGCTTTTGAAGTCTGAATTGCTGCTTTTCTGATCAAAGAGTTTTTATTCTTGGTAATATTTCACATTTAAGAGTATAAATGATAGTGTAAATTTAAATAAA 
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ATAGTAGAAAGTTTTCCCTAAAGATCGATGTATAACTATTAATACATATTTTACTTAATCTAGTGTTGCTTTTTAATGACAGTTTTTGCTGTGCTTAAATTTTTACAAAATA 

AATCTTAAATCCTTGCAAAGAGTTTTCTTTTATGCAAATGGAAAACATTGTTTCATCGCTTGTTAATATCATTTACTTTATTTATTTATTATTCTAAACAGCTGCCAAAGCA 

GCGAACTATCTCAGGATCAGATATGATTTGCTATGATTTTGTCACACTTCCTATGTTTGTTTTGATAACGCAAAAAGCACACACACCGTGATTGGCAATAAACTGATAAATT 

CTTTTCCGTCTGGTTTTTATCACCTCACCGCATCGTCCATTCCCCCGTCCACTTTCCATCCGCAGGGGCGACTCGCAGCGCACTCCATCCATGCTAACGGCGGTGACGCAGG 

TGGACGGCACAGGTGGCAGTTCCCGCCAGATGCGGGCCTTCCGCCAGCAGAGCTACTACCGCAGCTCCTCCAACGGCACAGCCGGACCGGGTGCAGCTCCCTTTAAGGAGCA 

GGTTGGCCTGCTGCACGTGGGTCCCGGCAATGGGACGCCCGGTGGCTCCGTCTCTAGCGGCGCCACGCCGCAGTTGATCCGTAAGGGATCGGCTCTGTTGGCCCGACAACCC 

AGCTGTCTGAGGGAGCAGGAGCACCAGCAGCGTTTGCTGCTGCACGAGAAGCCCTCGACCCTGGTGCTCAGCTACGACAGCCAGCGGGGCGGAGTGGGCGTGGGCGTGGCCA 

GCGGTCTGCTGGACAACAACGAGCGAGTGTCGAGCGTGTGAGAGCAGCTCGCGATGGCGGTAGCGGCGGCGGCGGTGGCGGTGGATGCACCGGACACAGTAGAAGCAGCAGC 

AGCAACAGCATCAGCCGATGCGGATGCGAATGTGGCGATGGAGGACGCCTGCTCCTGCTGCCATTGCCTGAGCAGGGAGCTGGTGCGGCGGCAGTTGGAGCTCCAAGTGCTT 

CCAGTGCAGTGAGGCGACTGCCAGTGGATTTGATCCGGCTGGCTGAGTGGCCGGAGCTGGGGATTCCAGCGATTCCCTCGGTCAACGTGGGGAGTGGCGTCGATGATTAGAT 

CACCGGGGCGTGCAACATGTGCTTAGATAGTGCTAATCGAGAGTTAAGTCTTCAAACACAATGGGTGTGCTTAGAAGTAGCGTAGGATTAGAGCTAGTCCGTACGAAGTGGT 

TGCCCTACACTGGGAGAAACTCGTGGTCACTTTGCATGTTCTCTGAATGTTCTACTAATACTTGACTTCAACAAGGATACAAATATAAGAGATAATATCTATGTACATCCCA 

TAAAACCATAAAAAATTGCAATTAAATCTGGTTGTGAAAAAAATAGATTTAACTTCTGAATGTTTCAAATGTTGCCAATTCGCCAGCGTGTAATCCGATATAAGCAACAATA 

ATTACAATTATAATCAATGTGCAATCAGAGTCAGTTACAATTTACAATTATATGACTGAATAATTGGACTATGAATATATACGACTTCAAATGAGTTTCTCTCATTAGCAGG 

GCAACTTTGTAATGTTAATTAACAGCGTTTTTCAGTTTTTCATTTTGTTCCCCCAGCCGAGTAACTCAAAAATCGATTCTATAATTGTGTAAAATAGAAATCATCTAAAAAC 

AGGGTAAATCTAACTATTGTAAATACATTTACTCATTATTTTGAGAAGAATATTGTAAAAAAAAAAGATTGTAAAGTTTAAAATAAATATGAGTTCATGTTCAACGACACAA 

GGAATTATTTTGATTAATTTAAAGTACTAAAAA 

(SEQ ID NO: 196) 

Exon: 1001.. 1210 

Exon: 1307.. 1774 

Exon: 6357.. 6540 

Exon: 6605.. 6810 

Exon: 7683.. 7834 

Exon: 8466.-8889 

Start ATG : 1001 

Transcript No. : CT29989 

ATGGAGGACGAGTGGGGCTCCTTTGATCGTCTGCCCAGTGTTCCGAGTGCCTCGATGGATTTGGAGACGGAAAACGAGGTGGTCAGCAATTGGTCCACACTGGCCAACTTCA 
CGCGACTTGTGGCTGGTGCCGCTCCTGAAATCGTCAACTATACGCTCAACATGATCGACGTGGGTGTGGGCATGGCCACGGATATATCCAATTTGAGCGTTCCGGAGCACGT 
GATGGATCACGCACCTCAACTATCCCGATCCGGGTTGCTGAAAGTGTATGTCCTGGCGGTAATGGCACTGTTCTCTCTGCTGGGCAACCTGCTGACCATCTGGAATATCTAC 
AAAACCCGCATCTCAAGAAGAAACTCACGGCACACGTGGAGTGCTATCTACTCACTGATGTTCCATCTGTCCATCGCCGATGTCCTGGTCACCTGGTTCTGCATTATCGGGG 
AGGCCGCGTGGTGCTACACCGTCCAGTGGCTGGCCAATGAGCTCACCTGCAAGCTGGTGAAGCTCTTCCAGATGTTTAGCCTCTACCTGAGCACCTATGTCCTGGTCCTCAT 
CGGAGTGGACCGCTGGATAGCGGTCAAGTATCCGATGAAGTCGCTCAACATGGCCAAGAGGTGTCATAGGCTACTTGGCGGTACTTACATCCTGTCGCTGGTGCTCAGCTTG 
CCACAGTTCTTCATCTTCCATGTAGCGCGTGGCCCATTCGTGGAGGAGTTTTACCAGTGCGTCACCCACGGATTCTACACGGCGGATTGGCAGGAGCAGATGTACGCCACCT 
TCACGCTGGTCTTCACCTTCCTGCTGCCGCTGTGCATCCTGTTTGGCACCTACATGTCCACCTTCCGCACCATTTCCAGCAGCGAAAAGATGTTCCAGGGATCAAAGTTGGC 
CAACTACTCAACGGCCAAATTGCCCACACAGACGAATCGCCAGAGGCTGATACACAAGGCCAAGATGAAGTCGCTTCGCATATCCGTGGTGATCATCATAGCGTTTCTCATC 
TGCTGGACGCCCTACTACGTCATGATGATTATGTTCATGTTCCTCAATCCGGACAAAAGGCTGGGCGACGATCTGCAGGACGCCATCTTCTTCTTCGGCATGTCAAACAGCC 
TGGTCAACCCACTCATCTACGGTGCCTTCCACCTGTGTCCTGGCAAAGGGGGCAAGTCGAGCGGCGGGGGCGGCAACAACAACGCCTACAGCTTGAACAGGGGCGACTCGCA 
GCGCACTCCATCCATGCTAACGGCGGTGACGCAGGTGGACGGCACAGGTGGCAGTTCCCGCCAGATGCGGGCCTTCCGCCAGCAGAGCTACTACCGCAGCTCCTCCAACGGC 
ACAGCCGGACCGGGTGCAGCTCCCTTTAAGGAGCAGGTTGGCCTGCTGCACGTGGGTCCCGGCAATGGGACGCCCGGTGGCTCCGTCTCTAGCGGCGCCACGCCGCAGTTGA 
TCCGTAAGGGATCGGCTCTGTTGGCCCGACAACCCAGCTGTCTGAGGGAGCAGGAGCACCAGCAGCGTTTGCTGCTGCACGAGAAGCCCTCGACCCTGGTGCTCAGCTACGA 
CAGCCAGCGGGGCGGAGTGGGCGTGGGCGTGGCCAGCGGTCTGCTGGACAACAACGAGCGAGTGTCGAGCGTGTGA 
(SEQ ID NO: 197) 



MEDEWGSFDRLPSVPSASMDLETENEVVSNWSTLANFTRLVAGAAPEIVNYTLNMIDVGVGMATDISNLSVPEHVMDHAPQLSRSGLIiKVYVLAVMALFSLLGNLLTIWNI Y 
KTRISRRNSRHTWSAIYSLMFHLSIADVLVTWFCIIGEAAWCYTVQWLANELTCKLVKLFQMFSLYLSTYVLVLIGVDRWIAVKYPMKSLNMAKRCHRLLGGTYILSLVLSL 
PQFFIFHVARGPFVEEFYQCVTHGFYTADWQEQMYATFTLVFTFLLPLCILFGTYMSTFRTISSSEKMFQGSKLANYSTAKLPTQTNRQRLIHKAKMKSLRISVVIITAFM 
CWTPYYVMMIMFMFLNPDKRLGDDLQDAIFFFGMSNSLVNPLI YGAFHLCPGKGGKSSGGGGNNNAYSLNRGDSQRTPSMLTAVTQVDGTGGSSRQMRAFRQQSYYRSSSNG 
TAGPGAAPFKEQVGLLHVGPGNGTPGGSVSSGATPQLIRKGSALLARQPSCLREQEHQQRLLLHEKPSTLVLSYDSQRGGVGVGVASGLLDNNERVSSV* 
(SEQ ID NO: 198) 

Name: GONADOTROPIN- RELEASING HORMONE RECEPTOR-li ke 
Classification : G_protein_linked_receptor 



Start ATG: 1 
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